Hunting Down Frame Shifts: Ecological Analysis of Diverse Functional Gene Sequences
Status PubMed-not-MEDLINE Jazyk angličtina Země Švýcarsko Médium electronic-ecollection
Typ dokumentu časopisecké články
PubMed
26635739
PubMed Central
PMC4656815
DOI
10.3389/fmicb.2015.01267
Knihovny.cz E-zdroje
- Klíčová slova
- FrameBot, Frameshift, amplicon sequencing, benzoate dioxygenase, biphenyl dioxygenase, functional genes,
- Publikační typ
- časopisecké články MeSH
Functional gene ecological analyses using amplicon sequencing can be challenging as translated sequences are often burdened with shifted reading frames. The aim of this work was to evaluate several bioinformatics tools designed to correct errors which arise during sequencing in an effort to reduce the number of frameshifts (FS). Genes encoding for alpha subunits of biphenyl (bphA) and benzoate (benA) dioxygenases were used as model sequences. FrameBot, a FS correction tool, was able to reduce the number of detected FS to zero. However, up to 44% of sequences were discarded by FrameBot as non-specific targets. Therefore, we proposed a de novo mode of FrameBot for FS correction, which works on a similar basis as common chimera identifying platforms and is not dependent on reference sequences. By nature of FrameBot de novo design, it is crucial to provide it with data as error free as possible. We tested the ability of several publicly available correction tools to decrease the number of errors in the data sets. The combination of maximum expected error filtering and single linkage pre-clustering proved to be the most efficient read processing approach. Applying FrameBot de novo on the processed data enabled analysis of BphA sequences with minimal losses of potentially functional sequences not homologous to those previously known. This experiment also demonstrated the extensive diversity of dioxygenases in soil. A script which performs FrameBot de novo is presented in the supplementary material to the study or available at https://github.com/strejcem/FBdenovo. The tool was also implemented into FunGene Pipeline available at http://fungene.cme.msu.edu/FunGenePipeline/.
Zobrazit více v PubMed
Barriault D., Lepine F., Mohammadi M., Milot S., Leberre N., Sylvestre M. (2004). Revisiting the regiospecificity of Burkholderia xenovorans LB400 biphenyl dioxygenase toward 2,2′-dichlorobiphenyl and 2,3,2′,3′-tetrachlorobiphenyl. PubMed DOI
Barriault D., Sylvestre M. (2004). Evolution of the biphenyl dioxygenase BphA from PubMed DOI
Bopp L. H. (1986). Degradation of highly chlorinated PCBs by DOI
Camacho C., Coulouris G., Avagyan V., Ma N., Papadopoulos J., Bealer K., et al. (2009). BLAST+: architecture and applications. PubMed DOI PMC
Caporaso J. G., Kuczynski J., Stombaugh J., Bittinger K., Bushman F. D., Costello E. K., et al. (2010). QIIME allows analysis of high-throughput community sequencing data. PubMed DOI PMC
Chang H.-K., Mohseni P., Zylstra G. J. (2003). Characterization and regulation of the genes for a novel anthranilate 1,2-dioxygenase from PubMed DOI PMC
Cole J. R., Wang Q., Fish J. A., Chai B., Mcgarrell D. M., Sun Y., et al. (2014). Ribosomal Database Project: data and tools for high throughput rRNA analysis. PubMed DOI PMC
Denonfoux J., Parisot N., Dugat-Bony E., Biderre-Petit C., Boucher D., Morgavi D. P., et al. (2013). Gene capture coupled to high-throughput sequencing as a strategy for targeted metagenome exploration. PubMed DOI PMC
Edgar R. C. (2004). MUSCLE: multiple sequence alignment with high accuracy and high throughput. PubMed DOI PMC
Edgar R. C. (2010). Search and clustering orders of magnitude faster than BLAST. PubMed DOI
Edgar R. C., Haas B. J., Clemente J. C., Quince C., Knight R. (2011). UCHIME improves sensitivity and speed of chimera detection. PubMed DOI PMC
Fish J. A., Chai B., Wang Q., Sun Y., Brown C. T., Tiedje J. M., et al. (2013). FunGene: the Functional Gene Pipeline and Repository. PubMed DOI PMC
Furukawa K., Hayase N., Taira K., Tomizuka N. (1989). Molecular relationship of chromosomal genes encoding biphenyl/polychlorinated biphenyl catabolism: some soil bacteria possess a highly conserved bph operon. PubMed PMC
Furukawa K., Suenaga H., Goto M. (2004). Biphenyl dioxygenases: functional versatilities and directed evolution. PubMed DOI PMC
Furusawa Y., Nagarajan V., Tanokura M., Masai E., Fukuda M., Senda T. (2004). Crystal structure of the terminal oxygenase component of biphenyl dioxygenase derived from PubMed DOI
Gaspar J. M., Thomas W. K. (2013). Assessing the consequences of denoising marker-based metagenomic data. PubMed DOI PMC
Ge Y., Eltis L. D. (2003). Characterization of hybrid toluate and benzoate dioxygenases. PubMed DOI PMC
Ge Y., Vaillancourt F. H., Agar N. Y. R., Eltis L. D. (2002). Reactivity of toluate dioxygenase with substituted benzoates and dioxygen. PubMed DOI PMC
Hurtubise Y., Barriault D., Powlowski J., Sylvestre M. (1995). Purification and characterization of the PubMed PMC
Huse S. M., Welch D. M., Morrison H. G., Sogin M. L. (2010). Ironing out the wrinkles in the rare biosphere through improved OTU clustering. PubMed DOI PMC
Iwai S., Chai B., Sul W. J., Cole J. R., Hashsham S. A., Tiedje J. M. (2010). Gene-targeted-metagenomics reveals extensive diversity of aromatic dioxygenase genes in the environment. PubMed DOI PMC
Kumamaru T., Suenaga H., Mitsuoka M., Watanabe T., Furukawa K. (1998). Enhanced degradation of polychlorinated biphenyls by directed evolution of biphenyl dioxygenase. PubMed DOI
Kumar P., Gomez-Gil L., Mohammadi M., Sylvestre M., Eltis L. D., Bolin J. T. (2011). Anaerobic crystallization and initial X-ray diffraction data of biphenyl 2,3-dioxygenase from PubMed DOI PMC
Kurzawová V., Štursa P., Uhlík O., Norková K., Strohalm M., Lipov J., et al. (2012). Plant-microorganism interactions in bioremediation of polychlorinated biphenyl-contaminated soil. PubMed DOI
Masai E., Yamada A., Healy J. M., Hatta T., Kimbara K., Fukuda M., et al. (1995). Characterization of biphenyl catabolic genes of gram-positive polychlorinated biphenyl degrader PubMed PMC
Mohammadi M., Sylvestre M. (2005). Resolving the profile of metabolites generated during oxidation of dibenzofuran and chlorodibenzofurans by the biphenyl catabolic pathway enzymes. PubMed DOI
Mondello F. J., Turcich M. P., Lobos J. H., Erickson B. D. (1997). Identification and modification of biphenyl dioxygenase sequences that determine the specificity of polychlorinated biphenyl degradation. PubMed PMC
Nam J. W., Nojiri H., Yoshida T., Habe H., Yamane H., Omori T. (2001). New classification system for oxygenase components involved in ring-hydroxylating oxygenations. PubMed DOI
Pavlíková D., Macek T., Macková M., Pavlík M. (2007). Monitoring native vegetation on a dumpsite of PCB-contaminated soil. PubMed DOI
Penton C. R., Johnson T. A., Quensen J. F., Iwai S., Cole J. R., Tiedje J. M. (2013). Functional genes to assess nitrogen cycling and aromatic hydrocarbon degradation: primers and processing matter. PubMed DOI PMC
Pham T. T. M., Sylvestre M. (2013). Has the bacterial biphenyl catabolic pathway evolved primarily to degrade biphenyl? The diphenylmethane case. PubMed DOI PMC
Pham T. T. M., Tu Y., Sylvestre M. (2012). Remarkable ability of PubMed DOI PMC
Pieper D. H., Seeger M. (2008). Bacterial metabolism of polychlorinated biphenyls. PubMed DOI
Quince C., Lanzén A., Curtis T. P., Davenport R. J., Hall N., Head I. M., et al. (2009). Accurate determination of microbial diversity from 454 pyrosequencing data. PubMed DOI
Quince C., Lanzen A., Davenport R. J., Turnbaugh P. J. (2011). Removing noise from pyrosequenced amplicons. PubMed DOI PMC
R Development Core Team (2009).
Ryšlavá E., Krejčjk Z., Macek T., Nováková H., Demnerová K., Macková M. (2003). Study of PCB degradation in real contaminated soil.
Schloss P. D., Gevers D., Westcott S. L. (2011). Reducing the effects of PCR amplification and sequencing artifacts on 16S rRNA-based studies. PubMed DOI PMC
Schloss P. D., Westcott S. L., Ryabin T., Hall J. R., Hartmann M., Hollister E. B., et al. (2009). Introducing mothur: open-source, platform-independent, community-supported software for describing and comparing microbial communities. PubMed DOI PMC
Tamura K., Stecher G., Peterson D., Filipski A., Kumar S. (2013). MEGA6: Molecular Evolutionary Genetics Analysis version 6.0. PubMed DOI PMC
Tang S., Antonov I., Borodovsky M. (2013). MetaGeneTack: ab initio detection of frameshifts in metagenomic sequences. PubMed DOI PMC
Uhlík O., Ječná K., Macková M., Vlček C., Hroudová M., Demnerová K., et al. (2009). Biphenyl-metabolizing bacteria in the rhizosphere of horseradish and bulk soil contaminated by polychlorinated biphenyls as revealed by stable isotope probing. PubMed DOI PMC
Uhlík O., Leewis M. C., Strejček M., Musilová L., Macková M., Leigh M. B., et al. (2013). Stable isotope probing in the metagenomics era: A bridge towards improved bioremediation. PubMed DOI PMC
Uhlík O., Wald J., Strejček M., Musilová L., Rídl J., Hroudová M., et al. (2012). Identification of bacteria utilizing biphenyl, benzoate, and naphthalene in long-term contaminated soil. PubMed DOI PMC
Vézina J., Barriault D., Sylvestre M. (2008). Diversity of the C-terminal portion of the biphenyl dioxygenase large subunit. PubMed DOI
Wang Q., Quensen J. F., Fish J. A., Kwon Lee T., Sun Y., Tiedje J. M., et al. (2013). Ecological patterns of nifH genes in four terrestrial climatic zones explored with targeted metagenomics using FrameBot, a new informatics tool. PubMed DOI PMC
Weisman D., Yasuda M., Bowen J. L. (2013). FunFrame: functional gene ecological analysis pipeline. PubMed DOI
Zhang S. W., Zhang Y. L., Pan Q., Cheng Y. M., Chou K. C. (2008). Estimating residue evolutionary conservation by introducing von Neumann entropy and a novel gap-treating approach. PubMed DOI PMC
Zhang Y., Sun Y. N. (2011). HMM-FRAME: accurate protein domain classification for metagenomic sequences containing frameshift errors. PubMed DOI PMC