• Something wrong with this record ?

The Quantification of Representative Sequences pipeline for amplicon sequencing: case study on within-population ITS1 sequence variation in a microparasite infecting Daphnia

E. González-Tortuero, J. Rusek, A. Petrusek, S. Gießler, D. Lyras, S. Grath, F. Castro-Monzón, J. Wolinska,

. 2015 ; 15 (6) : 1385-95. [pub] 20150305

Language English Country England, Great Britain

Document type Comparative Study, Evaluation Study, Journal Article, Research Support, Non-U.S. Gov't

Next generation sequencing (NGS) platforms are replacing traditional molecular biology protocols like cloning and Sanger sequencing. However, accuracy of NGS platforms has rarely been measured when quantifying relative frequencies of genotypes or taxa within populations. Here we developed a new bioinformatic pipeline (QRS) that pools similar sequence variants and estimates their frequencies in NGS data sets from populations or communities. We tested whether the estimated frequency of representative sequences, generated by 454 amplicon sequencing, differs significantly from that obtained by Sanger sequencing of cloned PCR products. This was performed by analysing sequence variation of the highly variable first internal transcribed spacer (ITS1) of the ichthyosporean Caullerya mesnili, a microparasite of cladocerans of the genus Daphnia. This analysis also serves as a case example of the usage of this pipeline to study within-population variation. Additionally, a public Illumina data set was used to validate the pipeline on community-level data. Overall, there was a good correspondence in absolute frequencies of C. mesnili ITS1 sequences obtained from Sanger and 454 platforms. Furthermore, analyses of molecular variance (amova) revealed that population structure of C. mesnili differs across lakes and years independently of the sequencing platform. Our results support not only the usefulness of amplicon sequencing data for studies of within-population structure but also the successful application of the QRS pipeline on Illumina-generated data. The QRS pipeline is freely available together with its documentation under GNU Public Licence version 3 at http://code.google.com/p/quantification-representative-sequences.

References provided by Crossref.org

000      
00000naa a2200000 a 4500
001      
bmc16028480
003      
CZ-PrNML
005      
20161025125234.0
007      
ta
008      
161005s2015 enk f 000 0|eng||
009      
AR
024    7_
$a 10.1111/1755-0998.12396 $2 doi
024    7_
$a 10.1111/1755-0998.12396 $2 doi
035    __
$a (PubMed)25728529
040    __
$a ABA008 $b cze $d ABA008 $e AACR2
041    0_
$a eng
044    __
$a enk
100    1_
$a González-Tortuero, E $u Department of Ecosystem Research, Leibniz-Institute of Freshwater Ecology and Inland Fisheries (IGB), Müggelseedamm 301, 12587, Berlin, Germany. Berlin Centre for Genomics in Biodiversity Research (BeGenDiv), Königin-Luise-Straße 6-8, 14195, Berlin, Germany. Department of Biology II, Ludwig Maximilians University, Großhaderner Straße 2, 82512, Planegg-Martinsried, Germany.
245    14
$a The Quantification of Representative Sequences pipeline for amplicon sequencing: case study on within-population ITS1 sequence variation in a microparasite infecting Daphnia / $c E. González-Tortuero, J. Rusek, A. Petrusek, S. Gießler, D. Lyras, S. Grath, F. Castro-Monzón, J. Wolinska,
520    9_
$a Next generation sequencing (NGS) platforms are replacing traditional molecular biology protocols like cloning and Sanger sequencing. However, accuracy of NGS platforms has rarely been measured when quantifying relative frequencies of genotypes or taxa within populations. Here we developed a new bioinformatic pipeline (QRS) that pools similar sequence variants and estimates their frequencies in NGS data sets from populations or communities. We tested whether the estimated frequency of representative sequences, generated by 454 amplicon sequencing, differs significantly from that obtained by Sanger sequencing of cloned PCR products. This was performed by analysing sequence variation of the highly variable first internal transcribed spacer (ITS1) of the ichthyosporean Caullerya mesnili, a microparasite of cladocerans of the genus Daphnia. This analysis also serves as a case example of the usage of this pipeline to study within-population variation. Additionally, a public Illumina data set was used to validate the pipeline on community-level data. Overall, there was a good correspondence in absolute frequencies of C. mesnili ITS1 sequences obtained from Sanger and 454 platforms. Furthermore, analyses of molecular variance (amova) revealed that population structure of C. mesnili differs across lakes and years independently of the sequencing platform. Our results support not only the usefulness of amplicon sequencing data for studies of within-population structure but also the successful application of the QRS pipeline on Illumina-generated data. The QRS pipeline is freely available together with its documentation under GNU Public Licence version 3 at http://code.google.com/p/quantification-representative-sequences.
650    _2
$a zvířata $7 D000818
650    _2
$a výpočetní biologie $x metody $7 D019295
650    _2
$a mezerníky ribozomální DNA $x chemie $x genetika $7 D021903
650    _2
$a Daphnia $x parazitologie $7 D003621
650    12
$a genetická variace $7 D014644
650    _2
$a vysoce účinné nukleotidové sekvenování $7 D059014
650    _2
$a Mesomycetozoea $x klasifikace $x genetika $7 D050298
650    12
$a sekvenční analýza DNA $7 D017422
650    _2
$a software $7 D012984
655    _2
$a srovnávací studie $7 D003160
655    _2
$a hodnotící studie $7 D023362
655    _2
$a časopisecké články $7 D016428
655    _2
$a práce podpořená grantem $7 D013485
700    1_
$a Rusek, J $u Department of Biology II, Ludwig Maximilians University, Großhaderner Straße 2, 82512, Planegg-Martinsried, Germany.
700    1_
$a Petrusek, A $u Department of Ecology, Faculty of Science, Charles University in Prague, Viničná 7, 128 44, Prague, Czech Republic.
700    1_
$a Gießler, S $u Department of Biology II, Ludwig Maximilians University, Großhaderner Straße 2, 82512, Planegg-Martinsried, Germany.
700    1_
$a Lyras, D $u Department of Biology II, Ludwig Maximilians University, Großhaderner Straße 2, 82512, Planegg-Martinsried, Germany.
700    1_
$a Grath, S $u Department of Biology II, Ludwig Maximilians University, Großhaderner Straße 2, 82512, Planegg-Martinsried, Germany.
700    1_
$a Castro-Monzón, F $u Department of Ecosystem Research, Leibniz-Institute of Freshwater Ecology and Inland Fisheries (IGB), Müggelseedamm 301, 12587, Berlin, Germany.
700    1_
$a Wolinska, J $u Department of Ecosystem Research, Leibniz-Institute of Freshwater Ecology and Inland Fisheries (IGB), Müggelseedamm 301, 12587, Berlin, Germany.
773    0_
$w MED00180393 $t Molecular ecology resources $x 1755-0998 $g Roč. 15, č. 6 (2015), s. 1385-95
856    41
$u https://pubmed.ncbi.nlm.nih.gov/25728529 $y Pubmed
910    __
$a ABA008 $b sig $c sign $y a $z 0
990    __
$a 20161005 $b ABA008
991    __
$a 20161025125648 $b ABA008
999    __
$a ok $b bmc $g 1166794 $s 953110
BAS    __
$a 3
BAS    __
$a PreBMC
BMC    __
$a 2015 $b 15 $c 6 $d 1385-95 $e 20150305 $i 1755-0998 $m Molecular ecology resources $n Mol. ecol. resour. $x MED00180393
LZP    __
$a Pubmed-20161005

Find record

Citation metrics

Loading data ...

Archiving options

Loading data ...