EukRef: Phylogenetic curation of ribosomal RNA to enhance understanding of eukaryotic diversity and distribution

. 2018 Sep ; 16 (9) : e2005849. [epub] 20180917

Jazyk angličtina Země Spojené státy americké Médium electronic-ecollection

Typ dokumentu časopisecké články, práce podpořená grantem, Research Support, U.S. Gov't, Non-P.H.S.

Perzistentní odkaz   https://www.medvik.cz/link/pmid30222734

Environmental sequencing has greatly expanded our knowledge of micro-eukaryotic diversity and ecology by revealing previously unknown lineages and their distribution. However, the value of these data is critically dependent on the quality of the reference databases used to assign an identity to environmental sequences. Existing databases contain errors and struggle to keep pace with rapidly changing eukaryotic taxonomy, the influx of novel diversity, and computational challenges related to assembling the high-quality alignments and trees needed for accurate characterization of lineage diversity. EukRef (eukref.org) is an ongoing community-driven initiative that addresses these challenges by bringing together taxonomists with expertise spanning the eukaryotic tree of life and microbial ecologists, who use environmental sequence data to develop reliable reference databases across the diversity of microbial eukaryotes. EukRef organizes and facilitates rigorous mining and annotation of sequence data by providing protocols, guidelines, and tools. The EukRef pipeline and tools allow users interested in a particular group of microbial eukaryotes to retrieve all sequences belonging to that group from International Nucleotide Sequence Database Collaboration (INSDC) (GenBank, the European Nucleotide Archive [ENA], or the DNA DataBank of Japan [DDBJ]), to place those sequences in a phylogenetic tree, and to curate taxonomic and environmental information for the group. We provide guidelines to facilitate the process and to standardize taxonomic annotations. The final outputs of this process are (1) a reference tree and alignment, (2) a reference sequence database, including taxonomic and environmental information, and (3) a list of putative chimeras and other artifactual sequences. These products will be useful for the broad community as they become publicly available (at eukref.org) and are shared with existing reference databases.

Zobrazit více v PubMed

Worden AZ, Follows MJ, Giovannoni SJ, Wilken S, Zimmerman AE, Keeling PJ. Rethinking the marine carbon cycle: Factoring in the multifarious lifestyles of microbes. Science. 2015;347: 1257594–1257594. 10.1126/science.1257594 PubMed DOI

Parfrey LW, Walters WA, Knight R. Microbial eukaryotes in the human microbiome: ecology, evolution, and future directions. Front Microbiol. 2011;2: 153 10.3389/fmicb.2011.00153 PubMed DOI PMC

Moreira D, López-García P. The molecular ecology of microbial eukaryotes unveils a hidden world. Trends Microbiol. Elsevier; 2002;10: 31–38. 10.1016/S0966-842X(01)02257-0 PubMed DOI

Moon-van der Staay SY, De Wachter R, Vaulot D. Oceanic 18S rDNA sequences from picoplankton reveal unsuspected eukaryotic diversity. Nature. Nature Publishing Group; 2001;409: 607–610. 10.1038/35054541 PubMed DOI

Massana R, Pedrós-Alió C. Unveiling new microbial eukaryotes in the surface ocean. Curr Opin Microbiol. 2008;11: 213–218. 10.1016/j.mib.2008.04.004 PubMed DOI

Pawlowski J, Audic S, Adl SM, Bass D, Belbahri L, Berney C, et al. CBOL Protist Working Group: Barcoding Eukaryotic Richness beyond the Animal, Plant, and Fungal Kingdoms. PLoS Biol. 2012;10(11): e1001419 10.1371/journal.pbio.1001419 PubMed DOI PMC

Hu SK, Campbell V, Connell P, Gellene AG, Liu Z, Terrado R, et al. Protistan diversity and activity inferred from RNA and DNA at a coastal ocean site in the eastern North Pacific. FEMS Microbiol Ecol. 2016;92: fiw050 10.1093/femsec/fiw050 PubMed DOI

de Vargas C, Audic S, Henry N, Decelle J, Mahe F, Logares R, et al. Eukaryotic plankton diversity in the sunlit ocean. Science. 2015;348: 1261605 10.1126/science.1261605 PubMed DOI

Massana R, Gobet A, Audic S, Bass D, Bittner L, Boutte C, et al. Marine protist diversity in European coastal waters and sediments as revealed by high-throughput sequencing. Environ Microbiol. 2015;17: 4035–4049. 10.1111/1462-2920.12955 PubMed DOI

Grossmann L, Jensen M, Heider D, Jost S, Glücksman E, Hartikainen H, et al. Protistan community analysis: key findings of a large-scale molecular sampling. ISME J. Nature Publishing Group; 2016;10: 2269–2279. 10.1038/ismej.2016.10 PubMed DOI PMC

Parfrey LW, Walters WA, Lauber CL, Clemente JC, Berg-Lyons D, Teiling C, et al. Communities of microbial eukaryotes in the mammalian gut within the context of environmental eukaryotic diversity. Front Microbiol. 2014;5: 1–13. 10.3389/fmicb.2014.00001 PubMed DOI PMC

Yilmaz P, Parfrey LW, Yarza P, Gerken J, Pruesse E, Quast C, et al. The SILVA and “all-species Living Tree Project (LTP)” taxonomic frameworks. Nucleic Acids Res. 2014;42: 1–6. 10.1093/nar/gkt1324 PubMed DOI PMC

Guillou L, Bachar D, Audic S, Bass D, Berney C, Bittner L, et al. The Protist Ribosomal Reference database (PR2): a catalog of unicellular eukaryote Small Sub-Unit rRNA sequences with curated taxonomy. Nucleic Acids Res. Oxford University Press; 2012;41: D597–D604. 10.1093/nar/gks1160 PubMed DOI PMC

Federhen S. The NCBI Taxonomy database. Nucleic Acids Res. 2012;40: D136–43. 10.1093/nar/gkr1178 PubMed DOI PMC

Balvočiute M, Huson DH, Balvočiūtė M, Huson DH, Balvočiute M, Huson DH. SILVA, RDP, Greengenes, NCBI and OTT—how do these taxonomies compare? BMC Genomics. 2017;18: 1–8. 10.1186/s12864-016-3406-7 PubMed DOI PMC

Burki F. The eukaryotic tree of life from a global phylogenomic perspective. Cold Spring Harb Perspect Biol. 2014;6: a016147 10.1101/cshperspect.a016147 PubMed DOI PMC

Massana R, Castresana J, Balagué V, Guillou L, Romari K, Groisillier A, et al. Phylogenetic and Ecological Analysis of Novel Marine Stramenopiles. Appl Environ Microbiol. 2004;70: 3528–3534. 10.1128/AEM.70.6.3528-3534.2004 PubMed DOI PMC

Clark CG, van der Giezen M, Alfellani MA, Stensvold CR. Recent Developments in Blastocystis Research. Advances in Parasitology. Elsevier; 2013. 10.1016/B978-0-12-407706-5.00001–0 PubMed DOI

Brown MW, Sharpe SC, Silberman JD, Heiss AA, Lang BF, Simpson AGBB, et al. Phylogenomics demonstrates that breviate flagellates are related to opisthokonts and apusomonads. Proc R Soc B Biol Sci. 2013;280: 20131755 10.1098/rspb.2013.1755 PubMed DOI PMC

Cole JR, Chai B, Marsh TL, Farris RJ, Wang Q, Kulam SA, et al. The Ribosomal Database Project (RDP-II): previewing a new autoaligner that allows regular updates and the new prokaryotic taxonomy. Nucleic Acids Res. 2003;31: 442–443. 10.1093/nar/gkg039 PubMed DOI PMC

Berney C, Ciuprina A, Bender S, Brodie J, Edgcomb V, Kim E, et al. UniEuk: Time to Speak a Common Language in Protistology! J Eukaryot Microbiol. 2017;64: 407–411. 10.1111/jeu.12414 PubMed DOI PMC

Boscaro V, Santoferrara LF, Zhang Q, Gentekaki E, Syberg-Olsen MJ, del Campo J, et al. EukRef-Ciliophora: A manually curated, phylogeny-based database of small subunit rRNA gene sequences of ciliates. Environ Microbiol. 2018; 10.1111/1462-2920.14264 PubMed DOI

Massana R, del Campo J, Sieracki ME, Audic S, Logares R. Exploring the uncultured microeukaryote majority in the oceans: reevaluation of ribogroups within stramenopiles. ISME J. 2014;8: 854–866. 10.1038/ismej.2013.204 PubMed DOI PMC

Gómez F, Moreira D, Benzerara K, López-García P. Solenicola setigera is the first characterized member of the abundant and cosmopolitan uncultured marine stramenopile group MAST-3. Environ Microbiol. 2011;13: 193–202. 10.1111/j.1462-2920.2010.02320.x PubMed DOI

Cavalier-Smith T, Scoble JM. Phylogeny of Heterokonta: Incisomonas marina, a uniciliate gliding opalozoan related to Solenicola (Nanomonadea), and evidence that Actinophryida evolved from raphidophytes. Eur J Protistol. Elsevier GmbH; 2012;49: 328–53. 10.1016/j.ejop.2012.09.002 PubMed DOI

Shiratori T, Thakur R, Ishida K. Pseudophyllomitus vesiculosus (Larsen and Patterson 1990) Lee, 2002, a Poorly Studied Phagotrophic Biflagellate is the First Characterized Member of Stramenopile Environmental Clade MAST-6. Protist. Elsevier GmbH.; 2017;168: 439–451. 10.1016/j.protis.2017.06.004 PubMed DOI

del Campo J, Sieracki ME, Molestina R, Keeling PJ, Massana R, Ruiz-Trillo I. The others: Our biased perspective of eukaryotic genomes. Trends Ecol Evol. Cell; 2014;29: 252–259. 10.1016/j.tree.2014.03.006 PubMed DOI PMC

Grattepanche JD, Walker LM, Ott BM, Paim Pinto DL, Delwiche CF, Lane CE, et al. Microbial Diversity in the Eukaryotic SAR Clade: Illuminating the Darkness Between Morphology and Molecular Data. BioEssays. 2018;1700198: 1–12. 10.1002/bies.201700198 PubMed DOI

McDonald D, Price MN, Goodrich JK, Nawrocki EP, DeSantis TZ, Probst AJ, et al. An improved Greengenes taxonomy with explicit ranks for ecological and evolutionary analyses of bacteria and archaea. ISME J. Nature Publishing Group; 2012;6: 610–618. 10.1038/ismej.2011.139 PubMed DOI PMC

Yarza P, Yilmaz P, Panzer K, Glöckner FO, Reich M. A phylogenetic framework for the kingdom Fungi based on 18S rRNA gene sequences. Mar Genomics. 2017;36: 33–39. 10.1016/j.margen.2017.05.009 PubMed DOI

Altschul SF, Madden TL, Schäffer AA, Zhang J, Zhang Z, Miller W, et al. Gapped BLAST and PSI-BLAST: A new generation of protein database search programs. Nucleic Acids Res. 1997;25: 3389–3402. 10.1093/nar/25.17.3389 PubMed DOI PMC

Rognes T, Flouri T, Nichols B, Quince C, Mahé F. VSEARCH: a versatile open source tool for metagenomics. PeerJ. 2016;4: e2584 10.7717/peerj.2584 PubMed DOI PMC

Katoh K, Standley DM. MAFFT multiple sequence alignment software version 7: Improvements in performance and usability. Mol Biol Evol. 2013;30: 772–780. 10.1093/molbev/mst010 PubMed DOI PMC

Thompson JD, Linard B, Lecompte O, Poch O. A Comprehensive Benchmark Study of Multiple Sequence Alignment Methods: Challenges Current and Perspectives Future. Badger J, editor. PLoS ONE. 2011;6(3): e18093 10.1371/journal.pone.0018093 PubMed DOI PMC

Capella-Gutiérrez S, Silla-Martínez JM, Gabaldón T. trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics. 2009;25: 1972–1973. 10.1093/bioinformatics/btp348 PubMed DOI PMC

Stamatakis A. RAxML version 8: A tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics. 2014;30: 1312–1313. 10.1093/bioinformatics/btu033 PubMed DOI PMC

Yilmaz P, Kottmann R, Field D, Knight R, Cole JR, Amaral-Zettler LA, et al. Minimum information about a marker gene sequence (MIMARKS) and minimum information about any (x) sequence (MIxS) specifications. Nat Biotechnol. 2011;29: 415–420. 10.1038/nbt.1823 PubMed DOI PMC

Buttigieg P, Morrison N, Smith B, Mungall CJ, Lewis SE. The environment ontology: contextualising biological and biomedical entities. J Biomed Semantics. 2013;4: 43 10.1186/2041-1480-4-43 PubMed DOI PMC

Cole JR, Wang Q, Fish JA, Chai B, McGarrell DM, Sun Y, et al. Ribosomal Database Project: Data and tools for high throughput rRNA analysis. Nucleic Acids Res. 2014;42: 633–642. 10.1093/nar/gkt1244 PubMed DOI PMC

Izquierdo-Carrasco F, Cazes J, Smith SA, Stamatakis A. PUmPER: Phylogenies updated perpetually. Bioinformatics. 2014;30: 1476–1477. 10.1093/bioinformatics/btu053 PubMed DOI PMC

Kozlov AM, Zhang J, Yilmaz P, Glöckner FO, Stamatakis A. Phylogeny-aware identification and correction of taxonomically mislabeled sequences. Nucleic Acids Research. 2016. Mar. 10.1093/nar/gkw396 PubMed DOI PMC

Rosati G, Modeo L, Melai M, Petroni G, Verni F. A Multidisciplinary Approach to Describe Protists: A Morphological, Ultrastructural, and Molecular Study on Peritromus kahli Villeneuve-Brachon, 1940 (Ciliophora, Heterotrichea). J Eukaryot Microbiol. 2004;51: 49–59. 10.1111/j.1550-7408.2004.tb00160.x PubMed DOI

Najít záznam

Citační ukazatele

Nahrávání dat ...

    Možnosti archivace