This site needs JavaScript to work properly. Please enable it to take advantage of the complete set of features!
Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

NIH NLM Logo
Log in
Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 Oct 30;8(1):1188.
doi: 10.1038/s41467-017-01312-x.

Algorithm for post-clustering curation of DNA amplicon data yields reliable biodiversity estimates

Affiliations

Algorithm for post-clustering curation of DNA amplicon data yields reliable biodiversity estimates

Tobias Guldberg Frøslev et al. Nat Commun. .

Abstract

DNA metabarcoding is promising for cost-effective biodiversity monitoring, but reliable diversity estimates are difficult to achieve and validate. Here we present and validate a method, called LULU, for removing erroneous molecular operational taxonomic units (OTUs) from community data derived by high-throughput sequencing of amplified marker genes. LULU identifies errors by combining sequence similarity and co-occurrence patterns. To validate the LULU method, we use a unique data set of high quality survey data of vascular plants paired with plant ITS2 metabarcoding data of DNA extracted from soil from 130 sites in Denmark spanning major environmental gradients. OTU tables are produced with several different OTU definition algorithms and subsequently curated with LULU, and validated against field survey data. LULU curation consistently improves α-diversity estimates and other biodiversity metrics, and does not require a sequence reference database; thus, it represents a promising method for reliable biodiversity estimation.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing financial interests.

Figures

Fig. 1
Fig. 1
Effects of curation with the LULU algorithm for clustering methods at 97% level. OTU table metrics before (red = raw) and after (blue = curated) curation with LULU. a correspondence of OTU (plant ITS2 sequence data) richness vs. plant richness for each of the 130 sampling sites, b total number of OTUs compared to total plant species recorded (564 species, dashed line), c percentage of OTUs having taxonomically redundant annotation, d OTU β-diversity (total richness/mean site richness) compared to plant β-diversity (17.23, dashed line), e distribution of best reference database (GenBank) match for OTUs retained and discarded by LULU
Fig. 2
Fig. 2
LULU curation workflow. (1) The user constructs an OTU table. (2) The user constructs a match list. (3) OTU table and match list is fed to the LULU algorithm

References

    1. McGill BJ, Dornelas M, Gotelli NJ, Magurran AE. Fifteen forms of biodiversity trend in the anthropocene. Trends Ecol. Evol. 2015;30:104–113. doi: 10.1016/j.tree.201411006. - DOI - PubMed
    1. Thomas JA, et al. Comparative losses of British butterflies, birds, and plants and the global extinction crisis. Science. 2004;303:1879–1881. doi: 10.1126/science.1095046. - DOI - PubMed
    1. Thomsen PF, Willerslev E. Environmental DNA–an emerging tool in conservation for monitoring past and present biodiversity. Biol. Conserv. 2015;183:4–18. doi: 10.1016/j.biocon.2014年11月01日9. - DOI
    1. Bálint M, et al. Millions of reads, thousands of taxa: microbial community structure and associations analyzed via marker genes. FEMS Microbiol. Rev. 2016;40:686–700. doi: 10.1093/femsre/fuw017. - DOI - PubMed
    1. Taberlet P, Coissac E, Pompanon F, Brochmann C, Willerslev E. Towards next-generation biodiversity assessment using DNA metabarcoding. Mol. Ecol. 2012;21:2045–2050. doi: 10.1111/j.1365-294X.2012.05470.x. - DOI - PubMed

Publication types

Cite

AltStyle によって変換されたページ (->オリジナル) /