Orthofinder results: raw and annotated orthogroups and list of unassigned proteins

DOI

Raw and annotated Orthofinder results on 61 nematode and 2 tardigrade (used as outgroup) species. The annotated list of species with description of the abbreviated names is available at https://doi.org/10.15454/IIAQOW

Four files are provided and described below:

1- Orthogroups.GeneCount.tab

Raw Orthofinder result in tab-separated values format: number of proteins per orthogroup per species.

2- Orthogroups-Gene-Counts.xlsx Number of genes per orthogroup per species.

Annotated Orthofinder results in xlsx format.The Excel file is made of 3 different sheets described below:

  • sheet 1 'Orthogroups.GeneCount': The full results with species sorted by taxonomy and the last column 'Total' indicating the total number of proteins in this orthogroup

  • sheet 2 'PPN-spec': Orthogroups specific to plant-parasitic nematodes (PPNs). Species sorted by taxonomy. The last three columns are as follows: 'Total'= total number of proteins in the orthogroup; 'Nb.species'= number of species in the orthogroup; 'Nb.tylenchida'= number of tylenchida speces in the orthogroup.

The two last rows are as follows: 'Total in OG'= total number of proteins from this species in PPN-specific orthogroups; 'Total singleton'= total number of single-copy species-specific proteins.

  • sheet 3 'Minc-PPN-spec': PPN-specific orthogroups that contain at least one M. incognita protein . The last three columns and two rows are as explained in sheet 2.

3- Orthogroups.tab

Raw Orthofinder result in tab-separated format, orthogroup composition with accession numbers of all the proteins and species prefix.

4- Orthogroups_UnassignedGenes.tab

Proteins that could not be assigned to any orthogroup and thus correspond to species-specific singletons.

Identifier
DOI https://doi.org/10.15454/ZAYJBC
Metadata Access https://entrepot.recherche.data.gouv.fr/oai?verb=GetRecord&metadataPrefix=oai_datacite&identifier=doi:10.15454/ZAYJBC
Provenance
Creator Danchin, Etienne; Grynberg, Priscila; Togawa, Roberto
Publisher Recherche Data Gouv
Contributor Danchin, Etienne
Publication Year 2020
Rights etalab 2.0; info:eu-repo/semantics/openAccess; https://spdx.org/licenses/etalab-2.0.html
OpenAccess true
Contact Danchin, Etienne (INRA - Institut National de la Recherche Agronomique)
Representation
Resource Type Dataset
Format application/vnd.openxmlformats-officedocument.spreadsheetml.sheet; text/tab-separated-values
Size 16997175; 10056584; 45525114; 61638778
Version 1.1
Discipline Geosciences; Life Sciences; Ecology; Plant Science; Biology; Omics