Bipartite Graphs for Visualization Analysis of Microbiome Data
Status PubMed-not-MEDLINE Jazyk angličtina Země Spojené státy americké Médium electronic-ecollection
Typ dokumentu časopisecké články
PubMed
27279729
PubMed Central
PMC4888752
DOI
10.4137/ebo.s38546
PII: ebo-suppl.1-2016-017
Knihovny.cz E-zdroje
- Klíčová slova
- 16S rRNA, OTU table, bipartite graph, graph modularity, metagenomics, visualization analysis,
- Publikační typ
- časopisecké články MeSH
Visualization analysis plays an important role in metagenomics research. Proper and clear visualization can help researchers get their first insights into data and by selecting different features, also revealing and highlighting hidden relationships and drawing conclusions. To prevent the resulting presentations from becoming chaotic, visualization techniques have to properly tackle the high dimensionality of microbiome data. Although a number of different methods based on dimensionality reduction, correlations, Venn diagrams, and network representations have already been published, there is still room for further improvement, especially in the techniques that allow visual comparison of several environments or developmental stages in one environment. In this article, we represent microbiome data by bipartite graphs, where one partition stands for taxa and the other stands for samples. We demonstrated that community detection is independent of taxonomical level. Moreover, focusing on higher taxonomical levels and the appropriate merging of samples greatly helps improving graph organization and makes our presentations clearer than other graph and network visualizations. Capturing labels in the vertices also brings the possibility of clearly comparing two or more microbial communities by showing their common and unique parts.
Zobrazit více v PubMed
Land M, Hauser L, Jun S, et al. Insights from 20 years of bacterial genome sequencing. Funct Integr Genomics. 2015;15(2):141–61. PubMed PMC
Simon C, Daniel R. Metagenomic analyses: past and future trends. Appl Environ Microbiol. 2011;77(4):1153–61. PubMed PMC
Fierer N, Breitbart M, Nulton J, et al. Metagenomic and small-subunit rRNA analyses reveal the genetic diversity of bacteria, archaea, fungi, and viruses in soil. Appl Environ Microbiol. 2007;73(21):7059–66. PubMed PMC
Lozupone C, Knight R. UniFrac: a new phylogenetic method for comparing microbial communities. Appl Environ Microbiol. 2005;71(12):8228–35. PubMed PMC
Hamady M, Lozupone C, Knight R. Fast UniFrac: facilitating high-throughput phylogenetic analyses of microbial communities including analysis of pyrosequencing and PhyloChip data. ISME J. 2009;4(1):17–27. PubMed PMC
Scholz M, Lo C, Chain P. Next generation sequencing and bioinformatic bottlenecks: the current state of metagenomic data analysis. Curr Opin Biotechnol. 2012;23(1):9–15. PubMed
Klindworth A, Pruesse E, Schweer T, et al. Evaluation of general 16S ribosomal RNA gene PCR primers for classical and next-generation sequencing-based diversity studies. Nucleic Acids Res. 2012;41(1):e1. PubMed PMC
Zhou J, Wu L, Deng Y, et al. Reproducibility and quantitation of amplicon sequencing-based detection. ISME J. 2011;5(8):1303–13. PubMed PMC
Kim O, Cho Y, Lee K, et al. Introducing EzTaxon-e: a prokaryotic 16S rRNA gene sequence database with phylotypes that represent uncultured species. Int J Syst Evol Microbiol. 2012;62(3):716–21. PubMed
Huson DH, Auch AF, Qi J, Schuster SC. MEGAN analysis of metagenomic data. Genome Res. 2007;17(3):377–86. PubMed PMC
Meyer F, Paarmann D, D’Souza M, et al. The metagenomics RAST server – a public resource for the automatic phylogenetic and functional analysis of metagenomes. BMC Bioinformatics. 2008;9(1):e386. PubMed PMC
Caporaso J, Kuczynski J, Stombaugh J, et al. QIIME allows analysis of high-throughput community sequencing data. Nat Methods. 2010;7(5):335–6. PubMed PMC
Wilkinson L, Friendly M. The history of the cluster heat map. Am Stat. 2009;63(2):179–84.
Gonzalez A, Knight R. Advancing analytical algorithms and pipelines for billions of microbial sequences. Curr Opin Biotechnol. 2012;23(1):64–71. PubMed PMC
Faust K, Raes J. Microbial interactions: from networks to models. Nat Rev Microbiol. 2012;10(8):538–50. PubMed
Khanipov K, Golovko G, Rojas M, et al. CoCo: an application to store high-throughput sequencing data in compact text and binary file formats; 2015 IEEE International Conference on Bioinformatics and Biomedicine (BIBM); Washington, DC: IEEE; 2015. pp. 1117–22.
Brenner J, Putonti C. HAsh-MaP-ERadicator: filtering non-target sequences from next generation sequencing reads; 2015 IEEE International Conference on Bioinformatics and Biomedicine (BIBM); Washington, DC: IEEE; 2015. pp. 1100–1.
Altschul S, Gish W, Miller W, et al. Basic local alignment search tool. J Mol Biol. 1990;215(3):403–10. PubMed
Videnska P, Rahman M, Faldynova M, et al. Characterization of egg laying hen and broiler fecal microbiota in poultry farms in Croatia, Czech Republic, Hungary and Slovenia. PLoS One. 2014;9(10):e110076. PubMed PMC
Sedlar K, Skutkova H, Videnska P, et al. Bipartite graphs for metagenomic data analysis and visualization; 2015 IEEE International Conference on Bioinformatics and Biomedicine (BIBM); Washington, DC: IEEE; 2015. pp. 1123–8.
Videnska P, Sedlar K, Lukac M, et al. Succession and replacement of bacterial populations in the caecum of egg laying hens over their whole life. PLoS One. 2014;9(12):e115142. PubMed PMC
Edgar R. Search and clustering orders of magnitude faster than BLAST. Bioinformatics. 2010;26(19):2460–1. PubMed
Wang Q, Garrity G, Tiedje J, et al. Classifier for rapid assignment of rRNA sequences into the new bacterial taxonomy. Appl Environ Microbiol. 2007;73(16):5261–7. PubMed PMC
TEAM R . Core. R: A Language and Environment for Statistical Computing. Vienna: R Foundation for Statistical Computing; 2014.
Bastian M, Heymann S, Jacomy M. Gephi: an open source software for exploring and manipulating networks; Proceedings of the International AAAI Conference on Weblogs and Social Media; Menlo Park, CA: Association for the Advancement of Artificial Intelligence; 2009.
Jacomy M, Venturini T, Heymann S, Bastian M. ForceAtlas2, a continuous graph layout algorithm for handy network visualization designed for the Gephi software. PLoS One. 2014;9(6):e98679. PubMed PMC
Blondel V, Guillaume J, Lambiotte R, et al. Fast unfolding of communities in large networks. J Stat Mech. 2008;2008(10):10008.
Lambiotte R, Delvenne J, Barahona M. Random walks, Markov processes and the multiscale modular organization of complex networks. IEEE Trans Netw Sci Eng. 2014;1(2):76–90.