This study is a first effort to compile the largest possible body of data available from different plankton databases as well as from individual published or unpublished datasets regarding diatom distribution in the world ocean. The data obtained originate from time series studies as well as spatial studies. This effort is supported by the Marine Ecosystem Data (MAREDAT) project, which aims at building consistent data sets for the main PFTs (Plankton Functional Types) in order to help validate biogeochemical ocean models by using converted C biomass from abundance data. Diatom abundance data were obtained from various research programs with the associated geolocation and date of collection, as well as with a taxonomic information ranging from group down to species. Minimum, maximum and average cell size information were mined from the literature for each taxonomic entry, and all abundance data were subsequently converted to biovolume and C biomass using the same methodology.
The attached zip file contains raw data files submitted by the authors and a NetCDF file. Progressively, raw data will be imported into PANGAEA as distinct data publications related to the original sources (journal or data publications). A new version of the raw data file was uploaded on 2016-12-06.