TY - JOUR
T1 - Every fifth published metagenome is not available to science
AU - Eckert, Ester M.
AU - Di Cesare, Andrea
AU - Fontaneto, Diego
AU - Berendonk, Thomas U.
AU - Bürgmann, Helmut
AU - Cytryn, Eddie
AU - Fatta-Kassinos, Despo
AU - Franzetti, Andrea
AU - Joakim Larsson, D. G.
AU - Manaia, Célia M.
AU - Pruden, Amy
AU - Singer, Andrew C.
AU - Udikovic-Kolic, Nikolina
AU - Corno, Gianluca
PY - 2020/4
Y1 - 2020/4
N2 - Have you ever sought to use metagenomic DNA sequences reported in scientific publications? Were you successful? Here, we reveal that metagenomes from no fewer than 20% of the papers found in our literature search, published between 2016 and 2019, were not deposited in a repository or were simply inaccessible. The proportion of inaccessible data within the literature has been increasing year-on-year. Noncompliance with Open Data is best predicted by the scientific discipline of the journal. The number of citations, journal type (e.g., Open Access or subscription journals), and publisher are not good predictors of data accessibility. However, many publications in high-impact factor journals do display a higher likelihood of accessible metagenomic data sets. Twenty-first century science demands compliance with the ethical standard of data sharing of metagenomes and DNA sequence data more broadly. Data accessibility must become one of the routine and mandatory components of manuscript submissions-a requirement that should be applicable across the increasing number of disciplines using metagenomics. Compliance must be ensured and reinforced by funders, publishers, editors, reviewers, and, ultimately, the authors.
AB - Have you ever sought to use metagenomic DNA sequences reported in scientific publications? Were you successful? Here, we reveal that metagenomes from no fewer than 20% of the papers found in our literature search, published between 2016 and 2019, were not deposited in a repository or were simply inaccessible. The proportion of inaccessible data within the literature has been increasing year-on-year. Noncompliance with Open Data is best predicted by the scientific discipline of the journal. The number of citations, journal type (e.g., Open Access or subscription journals), and publisher are not good predictors of data accessibility. However, many publications in high-impact factor journals do display a higher likelihood of accessible metagenomic data sets. Twenty-first century science demands compliance with the ethical standard of data sharing of metagenomes and DNA sequence data more broadly. Data accessibility must become one of the routine and mandatory components of manuscript submissions-a requirement that should be applicable across the increasing number of disciplines using metagenomics. Compliance must be ensured and reinforced by funders, publishers, editors, reviewers, and, ultimately, the authors.
UR - http://www.scopus.com/inward/record.url?scp=85083544680&partnerID=8YFLogxK
U2 - 10.1371/journal.pbio.3000698
DO - 10.1371/journal.pbio.3000698
M3 - Article
C2 - 32243442
AN - SCOPUS:85083544680
SN - 1544-9173
VL - 18
JO - PLoS Biology
JF - PLoS Biology
IS - 4
M1 - e3000698
ER -