High variability in SSU rDNA gene copy number among planktonic foraminifera revealed by single-cell qPCR

DOI

Metabarcoding has become the workhorse of community ecology. Sequencing a taxonomically informative DNA fragment from environmental samples gives fast access to community composition across taxonomic groups, but it relies on the assumption that the number of sequences for each taxon correlates with its abundance in the sampled community. However, gene copy number varies among and within taxa, and the extent of this variability must therefore be considered when interpreting community composition data derived from environmental sequencing. Here we measured with single-cell qPCR the SSU rDNA gene copy number of 139 specimens of five species of planktonic foraminifera. We found that the average gene copy number varied between of ~4 000 to ~50 000 gene copies between species, and individuals of the same species can carry between ~300 to more than 350 000 gene copies. This variability cannot be explained by differences in cell size and considering all plausible sources of bias, we conclude that this variability likely reflects dynamic genomic processes acting during the life cycle. We used the observed variability to model its impact on metabarcoding and found that the application of a correcting factor at species level may correct the derived relative abundances, provided sufficiently large populations have been sampled.

Supplementary Material 1. Geographic origin, taxonomic identification and SSU rDNA gene copy number quantification obtain with qPCR for the 139 specimens. Sequences are provided for the fragment 45E-47F.

Identifier
DOI https://doi.org/10.1594/PANGAEA.938692
Related Identifier References https://doi.org/10.1038/s43705-021-00067-3
Metadata Access https://ws.pangaea.de/oai/provider?verb=GetRecord&metadataPrefix=datacite4&identifier=oai:pangaea.de:doi:10.1594/PANGAEA.938692
Provenance
Creator Milivojević, Tamara ORCID logo; Rahman, Shirin Nurshan; Raposo, Débora ORCID logo; Siccha, Michael ORCID logo; Kucera, Michal ORCID logo; Morard, Raphael
Publisher PANGAEA
Publication Year 2021
Rights Creative Commons Attribution 4.0 International; https://creativecommons.org/licenses/by/4.0/
OpenAccess true
Representation
Resource Type Dataset
Format text/tab-separated-values
Size 2505 data points
Discipline Earth System Research
Spatial Coverage (-67.000W, -38.773S, 17.584E, 72.000N)
Temporal Coverage Begin 2015-01-31T12:05:00Z
Temporal Coverage End 2017-08-19T17:35:00Z