-
Something wrong with this record ?
The Quantification of Representative Sequences pipeline for amplicon sequencing: case study on within-population ITS1 sequence variation in a microparasite infecting Daphnia
E. González-Tortuero, J. Rusek, A. Petrusek, S. Gießler, D. Lyras, S. Grath, F. Castro-Monzón, J. Wolinska,
Language English Country England, Great Britain
Document type Comparative Study, Evaluation Study, Journal Article, Research Support, Non-U.S. Gov't
- MeSH
- Daphnia parasitology MeSH
- Genetic Variation * MeSH
- Mesomycetozoea classification genetics MeSH
- DNA, Ribosomal Spacer chemistry genetics MeSH
- Sequence Analysis, DNA * MeSH
- Software MeSH
- Computational Biology methods MeSH
- High-Throughput Nucleotide Sequencing MeSH
- Animals MeSH
- Check Tag
- Animals MeSH
- Publication type
- Journal Article MeSH
- Evaluation Study MeSH
- Research Support, Non-U.S. Gov't MeSH
- Comparative Study MeSH
Next generation sequencing (NGS) platforms are replacing traditional molecular biology protocols like cloning and Sanger sequencing. However, accuracy of NGS platforms has rarely been measured when quantifying relative frequencies of genotypes or taxa within populations. Here we developed a new bioinformatic pipeline (QRS) that pools similar sequence variants and estimates their frequencies in NGS data sets from populations or communities. We tested whether the estimated frequency of representative sequences, generated by 454 amplicon sequencing, differs significantly from that obtained by Sanger sequencing of cloned PCR products. This was performed by analysing sequence variation of the highly variable first internal transcribed spacer (ITS1) of the ichthyosporean Caullerya mesnili, a microparasite of cladocerans of the genus Daphnia. This analysis also serves as a case example of the usage of this pipeline to study within-population variation. Additionally, a public Illumina data set was used to validate the pipeline on community-level data. Overall, there was a good correspondence in absolute frequencies of C. mesnili ITS1 sequences obtained from Sanger and 454 platforms. Furthermore, analyses of molecular variance (amova) revealed that population structure of C. mesnili differs across lakes and years independently of the sequencing platform. Our results support not only the usefulness of amplicon sequencing data for studies of within-population structure but also the successful application of the QRS pipeline on Illumina-generated data. The QRS pipeline is freely available together with its documentation under GNU Public Licence version 3 at http://code.google.com/p/quantification-representative-sequences.
References provided by Crossref.org
- 000
- 00000naa a2200000 a 4500
- 001
- bmc16028480
- 003
- CZ-PrNML
- 005
- 20161025125234.0
- 007
- ta
- 008
- 161005s2015 enk f 000 0|eng||
- 009
- AR
- 024 7_
- $a 10.1111/1755-0998.12396 $2 doi
- 024 7_
- $a 10.1111/1755-0998.12396 $2 doi
- 035 __
- $a (PubMed)25728529
- 040 __
- $a ABA008 $b cze $d ABA008 $e AACR2
- 041 0_
- $a eng
- 044 __
- $a enk
- 100 1_
- $a González-Tortuero, E $u Department of Ecosystem Research, Leibniz-Institute of Freshwater Ecology and Inland Fisheries (IGB), Müggelseedamm 301, 12587, Berlin, Germany. Berlin Centre for Genomics in Biodiversity Research (BeGenDiv), Königin-Luise-Straße 6-8, 14195, Berlin, Germany. Department of Biology II, Ludwig Maximilians University, Großhaderner Straße 2, 82512, Planegg-Martinsried, Germany.
- 245 14
- $a The Quantification of Representative Sequences pipeline for amplicon sequencing: case study on within-population ITS1 sequence variation in a microparasite infecting Daphnia / $c E. González-Tortuero, J. Rusek, A. Petrusek, S. Gießler, D. Lyras, S. Grath, F. Castro-Monzón, J. Wolinska,
- 520 9_
- $a Next generation sequencing (NGS) platforms are replacing traditional molecular biology protocols like cloning and Sanger sequencing. However, accuracy of NGS platforms has rarely been measured when quantifying relative frequencies of genotypes or taxa within populations. Here we developed a new bioinformatic pipeline (QRS) that pools similar sequence variants and estimates their frequencies in NGS data sets from populations or communities. We tested whether the estimated frequency of representative sequences, generated by 454 amplicon sequencing, differs significantly from that obtained by Sanger sequencing of cloned PCR products. This was performed by analysing sequence variation of the highly variable first internal transcribed spacer (ITS1) of the ichthyosporean Caullerya mesnili, a microparasite of cladocerans of the genus Daphnia. This analysis also serves as a case example of the usage of this pipeline to study within-population variation. Additionally, a public Illumina data set was used to validate the pipeline on community-level data. Overall, there was a good correspondence in absolute frequencies of C. mesnili ITS1 sequences obtained from Sanger and 454 platforms. Furthermore, analyses of molecular variance (amova) revealed that population structure of C. mesnili differs across lakes and years independently of the sequencing platform. Our results support not only the usefulness of amplicon sequencing data for studies of within-population structure but also the successful application of the QRS pipeline on Illumina-generated data. The QRS pipeline is freely available together with its documentation under GNU Public Licence version 3 at http://code.google.com/p/quantification-representative-sequences.
- 650 _2
- $a zvířata $7 D000818
- 650 _2
- $a výpočetní biologie $x metody $7 D019295
- 650 _2
- $a mezerníky ribozomální DNA $x chemie $x genetika $7 D021903
- 650 _2
- $a Daphnia $x parazitologie $7 D003621
- 650 12
- $a genetická variace $7 D014644
- 650 _2
- $a vysoce účinné nukleotidové sekvenování $7 D059014
- 650 _2
- $a Mesomycetozoea $x klasifikace $x genetika $7 D050298
- 650 12
- $a sekvenční analýza DNA $7 D017422
- 650 _2
- $a software $7 D012984
- 655 _2
- $a srovnávací studie $7 D003160
- 655 _2
- $a hodnotící studie $7 D023362
- 655 _2
- $a časopisecké články $7 D016428
- 655 _2
- $a práce podpořená grantem $7 D013485
- 700 1_
- $a Rusek, J $u Department of Biology II, Ludwig Maximilians University, Großhaderner Straße 2, 82512, Planegg-Martinsried, Germany.
- 700 1_
- $a Petrusek, A $u Department of Ecology, Faculty of Science, Charles University in Prague, Viničná 7, 128 44, Prague, Czech Republic.
- 700 1_
- $a Gießler, S $u Department of Biology II, Ludwig Maximilians University, Großhaderner Straße 2, 82512, Planegg-Martinsried, Germany.
- 700 1_
- $a Lyras, D $u Department of Biology II, Ludwig Maximilians University, Großhaderner Straße 2, 82512, Planegg-Martinsried, Germany.
- 700 1_
- $a Grath, S $u Department of Biology II, Ludwig Maximilians University, Großhaderner Straße 2, 82512, Planegg-Martinsried, Germany.
- 700 1_
- $a Castro-Monzón, F $u Department of Ecosystem Research, Leibniz-Institute of Freshwater Ecology and Inland Fisheries (IGB), Müggelseedamm 301, 12587, Berlin, Germany.
- 700 1_
- $a Wolinska, J $u Department of Ecosystem Research, Leibniz-Institute of Freshwater Ecology and Inland Fisheries (IGB), Müggelseedamm 301, 12587, Berlin, Germany.
- 773 0_
- $w MED00180393 $t Molecular ecology resources $x 1755-0998 $g Roč. 15, č. 6 (2015), s. 1385-95
- 856 41
- $u https://pubmed.ncbi.nlm.nih.gov/25728529 $y Pubmed
- 910 __
- $a ABA008 $b sig $c sign $y a $z 0
- 990 __
- $a 20161005 $b ABA008
- 991 __
- $a 20161025125648 $b ABA008
- 999 __
- $a ok $b bmc $g 1166794 $s 953110
- BAS __
- $a 3
- BAS __
- $a PreBMC
- BMC __
- $a 2015 $b 15 $c 6 $d 1385-95 $e 20150305 $i 1755-0998 $m Molecular ecology resources $n Mol. ecol. resour. $x MED00180393
- LZP __
- $a Pubmed-20161005