High quality food bacterial genomic ressources from fermented vegetables

DOI

This dataset is associated with a collection of 39 complete bacterial genomes (157 replicons in total) of high sequencing quality and submitted in May 2024 to European Nucleotide Archive (ENA). These strains belong predominantly to homo- and hetero-lactic acid bacteria pivotal in the fermentation process of various vegetables, although other taxa (e.g. Hafnia, Rahnella, Bacillus, Enterococcus and Pseudomonas) were also considered. This dataset also includes the production of genomes for lactic acid bacteria species that have rarely been sequenced, such as Levilactobacillus cerevisiae, Levilactobacillus yonginensis or Pediococcus parvulus. The strains demonstrated optimal growth performance during vegetable fermentation and their genome was sequenced using a combination of third generation long-reads ONT and short-reads Illumina technologies. These bacterial ressources and associated genomic data were characterized during the Agence Nationale de la Recherche (ANR) Metasimfood project (https://www.metasimfood.inrae.fr/) and are subsequently made publicly available for academic research purpose on fermented foods. These strains will be used in the frame of ANR metasimfood project, as core microbial ressources for the design of microbial consortia for pant-based fermented foods. The various files of the dataset summarize either genomic data (metasimfood_genomic_data.csv) such as accession numbers, size of replicons, sequencing coverage; or metadata associated with the strains (metasimfood_metadata_strains.csv) such as isolation sources, availibility, owner's contac. Specific genomic features detection such as prophages and plasmid (metasimfood_provirus_summary.csv; metasimfood_provirus-results.csv; metasimfood_plasmid_results.csv) are also available.

Identifier
DOI https://doi.org/10.57745/O4OJA2
Metadata Access https://entrepot.recherche.data.gouv.fr/oai?verb=GetRecord&metadataPrefix=oai_datacite&identifier=doi:10.57745/O4OJA2
Provenance
Creator Karimi, Elham ORCID logo; Lima, Alice ORCID logo; Tap, Julien ORCID logo; Gendre, Julia; Dugat-Bony, Eric ORCID logo; Marquet, Gwendoline; Valence-Bertel, Florence (ORCID: 0000-0002-4834-086X); Chuat, Victoria; Peltier, Emilien ORCID logo; Loux, Valentin (ORCID: 0000-0002-8268-915X); Chaillou, Stephane ORCID logo
Publisher Recherche Data Gouv
Contributor Chaillou, Stephane
Publication Year 2024
Funding Reference Agence nationale de la recherche
Rights etalab 2.0; info:eu-repo/semantics/openAccess; https://spdx.org/licenses/etalab-2.0.html
OpenAccess true
Contact Chaillou, Stephane (UMR1319, MICALIS Institute, INRAE, AgroParisTech, Université Paris-Saclay, Domaine de Vilvert, 78350, Jouy-en-Josas)
Representation
Resource Type Dataset
Format text/comma-separated-values; text/tab-separated-values
Size 12475; 11352; 11953; 16568; 8553
Version 1.0
Discipline Life Sciences; Biospheric Sciences; Ecology; Geosciences; Medicine; Natural Sciences