Opportunities for diversified usage of metabarcoding data for fungal biogeography through increased metadata quality
Loading...
Date
Authors
Harris, Mathew Andrew
Slippers, Bernard
Kemler, Martin
Greve, Michelle
Journal Title
Journal ISSN
Volume Title
Publisher
Elsevier
Abstract
The widely adopted use of metabarcoding techniques and the ability to sequence microbial
communities directly from environmental samples have advanced the field of fungal ecology.
The growth of publicly available big data offers opportunities for collating data from
different sources to explore biogeographical and macroecological patterns of fungal groups
over large spatial scales. This requires reliable and high-quality metadata associated with
the raw sequencing data. We assessed the accuracy of submitted metadata linked to
terrestrial plant-associated fungal genetic marker sequences, extracted from NCBI’s Bio-
Project web-portal. The amount of correctly captured, missing, and incorrectly supplied
metadata was determined. The quality of submitter-defined metadata was of a variable
quality, with some adhering to metadata standards, and others not capturing metadata
for certain attributes or, when metadata was captured, duplicating metadata across samples,
or only partially meeting metadata requirements. This ultimately limits the ability to
find, and subsequently re-use, sequence data. The rapid accumulation of metabarcoding
data and the ability to directly compare samples taken from different studies holds opportunities
for gaining a deeper understanding of fungal biogeographical patterns and their
drivers. Standardised vocabularies for metadata attributes during submission to public repositories
like NCBI’s Sequence Read Archive, coupled with adequate incentives for the
data providers, would facilitate the Findability, Accessibility, Interoperability, and Reusability
(FAIR) data principles and ultimately enable metabarcoding sequence data to be
readily utilized to perform large scale global biogeographical studies on the kingdom Fungi.
Description
Keywords
High-throughput sequencing, Metadata, Fungi, Conservation, Big data, INSDC, Macroecology, Species, SDG-15: Life on land
Sustainable Development Goals
SDG-15:Life on land
Citation
Harris, M.A., Slippers, B., Kemler, M. et al. 2023, 'Opportunities for diversified usage of metabarcoding data for fungal biogeography through increased metadata quality', Fungal Biology Reviews, vol. 46, art. 100329, pp. 1-15.
https://DOI.org/10.1016/j.fbr.2023.100329.