Most cited article - PubMed ID 29020743
Development and validation of a multi-locus DNA metabarcoding method to identify endangered species in complex samples
The nuclear ribosomal DNA Internal Transcribed Spacer (ITS) region is used as a universal fungal barcode marker, but often lacks a significant DNA barcoding gap between sister taxa. Here we tested the reliability of protein coding low-copy genes as alternative barcode markers. Mock communities of three unrelated agaric genera (Dermoloma, Hodophilus, and Russula) representing lineages of closely related species were sequenced by the Illumina platform targeting the ITS1, ITS2, the second largest subunit of RNA polymerase II gene (rpb2) and the transcription elongation factor 1-alpha gene (ef1-α) regions. Species representation and their relative abundances were similar across all tested barcode regions, despite a lower copy number in protein coding markers. ITS1 and ITS2 required more sophisticated sequence filtering because they produced a high number of chimeric sequences requiring reference-based chimera removal and had a higher number of sequence variants per species. Although clustering of filtered ITS sequences resulted in an average higher number of correctly clustered units at optimal similarity thresholds, these thresholds varied substantially among genera. Best-fitted thresholds of low-copy markers were more consistent across genera but frequently lacked species resolution due to low intraspecific variability. At some thresholds, we observed multiple species lumped together, and at the same time, species split into multiple partial clusters, which should be taken into consideration when assessing the best clustering thresholds and taxonomic identity of clusters. To achieve the best taxonomic resolution and improve species detection, we recommend combining different markers and applying additional reference-based sorting of clusters. The current availability of rpb2 and ef1-α reference sequences in public databases is far from being complete for all fungal groups, but a combined marker approach can be used for group-specific studies that can build reference data for their own purposes.
- Keywords
- amplicon abundance, chimera, sympatric species, threshold,
- Publication type
- Journal Article MeSH
DNA metabarcoding provides great potential for species identification in complex samples such as food supplements and traditional medicines. Such a method would aid Convention on International Trade in Endangered Species of Wild Fauna and Flora (CITES) enforcement officers to combat wildlife crime by preventing illegal trade of endangered plant and animal species. The objective of this research was to develop a multi-locus DNA metabarcoding method for forensic wildlife species identification and to evaluate the applicability and reproducibility of this approach across different laboratories. A DNA metabarcoding method was developed that makes use of 12 DNA barcode markers that have demonstrated universal applicability across a wide range of plant and animal taxa and that facilitate the identification of species in samples containing degraded DNA. The DNA metabarcoding method was developed based on Illumina MiSeq amplicon sequencing of well-defined experimental mixtures, for which a bioinformatics pipeline with user-friendly web-interface was developed. The performance of the DNA metabarcoding method was assessed in an international validation trial by 16 laboratories, in which the method was found to be highly reproducible and sensitive enough to identify species present in a mixture at 1% dry weight content. The advanced multi-locus DNA metabarcoding method assessed in this study provides reliable and detailed data on the composition of complex food products, including information on the presence of CITES-listed species. The method can provide improved resolution for species identification, while verifying species with multiple DNA barcodes contributes to an enhanced quality assurance.
- Keywords
- CITES, COI, Customs agencies, DNA metabarcoding, Endangered species, Traditional medicines, cyt b, matK, mini-barcodes, rbcL,
- MeSH
- DNA, Plant genetics MeSH
- Endangered Species * MeSH
- Reproducibility of Results MeSH
- Plants classification genetics MeSH
- DNA Barcoding, Taxonomic * MeSH
- Computational Biology MeSH
- Animals MeSH
- Check Tag
- Animals MeSH
- Publication type
- Journal Article MeSH
- Research Support, Non-U.S. Gov't MeSH
- Names of Substances
- DNA, Plant MeSH