-
Je něco špatně v tomto záznamu ?
Error correction and statistical analyses for intra-host comparisons of feline immunodeficiency virus diversity from high-throughput sequencing data
Y. Liu, F. Chiaromonte, H. Ross, R. Malhotra, D. Elleder, M. Poss,
Jazyk angličtina Země Anglie, Velká Británie
Typ dokumentu časopisecké články, Research Support, N.I.H., Extramural
NLK
BioMedCentral
od 2000-01-12
BioMedCentral Open Access
od 2000
Directory of Open Access Journals
od 2000
Free Medical Journals
od 2000
PubMed Central
od 2000
Europe PubMed Central
od 2000
ProQuest Central
od 2009-01-01
Open Access Digital Library
od 2000-07-01
Open Access Digital Library
od 2000-01-01
Open Access Digital Library
od 2000-01-01
Medline Complete (EBSCOhost)
od 2000-01-01
Health & Medicine (ProQuest)
od 2009-01-01
ROAD: Directory of Open Access Scholarly Resources
od 2000
Springer Nature OA/Free Journals
od 2000-12-01
- MeSH
- algoritmy MeSH
- DNA virů genetika MeSH
- interpretace statistických dat * MeSH
- kočičí AIDS genetika imunologie virologie MeSH
- kočky MeSH
- nemoci koček genetika imunologie přenos virologie MeSH
- statistické modely * MeSH
- virus kočičí imunodeficience klasifikace genetika patogenita MeSH
- vysoce účinné nukleotidové sekvenování metody MeSH
- zvířata MeSH
- Check Tag
- kočky MeSH
- zvířata MeSH
- Publikační typ
- časopisecké články MeSH
- Research Support, N.I.H., Extramural MeSH
BACKGROUND: Infection with feline immunodeficiency virus (FIV) causes an immunosuppressive disease whose consequences are less severe if cats are co-infected with an attenuated FIV strain (PLV). We use virus diversity measurements, which reflect replication ability and the virus response to various conditions, to test whether diversity of virulent FIV in lymphoid tissues is altered in the presence of PLV. Our data consisted of the 3' half of the FIV genome from three tissues of animals infected with FIV alone, or with FIV and PLV, sequenced by 454 technology. RESULTS: Since rare variants dominate virus populations, we had to carefully distinguish sequence variation from errors due to experimental protocols and sequencing. We considered an exponential-normal convolution model used for background correction of microarray data, and modified it to formulate an error correction approach for minor allele frequencies derived from high-throughput sequencing. Similar to accounting for over-dispersion in counts, this accounts for error-inflated variability in frequencies - and quite effectively reproduces empirically observed distributions. After obtaining error-corrected minor allele frequencies, we applied ANalysis Of VAriance (ANOVA) based on a linear mixed model and found that conserved sites and transition frequencies in FIV genes differ among tissues of dual and single infected cats. Furthermore, analysis of minor allele frequencies at individual FIV genome sites revealed 242 sites significantly affected by infection status (dual vs. single) or infection status by tissue interaction. All together, our results demonstrated a decrease in FIV diversity in bone marrow in the presence of PLV. Importantly, these effects were weakened or undetectable when error correction was performed with other approaches (thresholding of minor allele frequencies; probabilistic clustering of reads). We also queried the data for cytidine deaminase activity on the viral genome, which causes an asymmetric increase in G to A substitutions, but found no evidence for this host defense strategy. CONCLUSIONS: Our error correction approach for minor allele frequencies (more sensitive and computationally efficient than other algorithms) and our statistical treatment of variation (ANOVA) were critical for effective use of high-throughput sequencing data in understanding viral diversity. We found that co-infection with PLV shifts FIV diversity from bone marrow to lymph node and spleen.
Citace poskytuje Crossref.org
- 000
- 00000naa a2200000 a 4500
- 001
- bmc15031311
- 003
- CZ-PrNML
- 005
- 20151008115654.0
- 007
- ta
- 008
- 151005s2015 enk f 000 0|eng||
- 009
- AR
- 024 7_
- $a 10.1186/s12859-015-0607-z $2 doi
- 035 __
- $a (PubMed)26123018
- 040 __
- $a ABA008 $b cze $d ABA008 $e AACR2
- 041 0_
- $a eng
- 044 __
- $a enk
- 100 1_
- $a Liu, Yang $u Department of Statistics, The Pennsylvania State University, University Park, PA, 16802, USA. ywl5222@psu.edu. The Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, PA, 16802, USA. ywl5222@psu.edu.
- 245 10
- $a Error correction and statistical analyses for intra-host comparisons of feline immunodeficiency virus diversity from high-throughput sequencing data / $c Y. Liu, F. Chiaromonte, H. Ross, R. Malhotra, D. Elleder, M. Poss,
- 520 9_
- $a BACKGROUND: Infection with feline immunodeficiency virus (FIV) causes an immunosuppressive disease whose consequences are less severe if cats are co-infected with an attenuated FIV strain (PLV). We use virus diversity measurements, which reflect replication ability and the virus response to various conditions, to test whether diversity of virulent FIV in lymphoid tissues is altered in the presence of PLV. Our data consisted of the 3' half of the FIV genome from three tissues of animals infected with FIV alone, or with FIV and PLV, sequenced by 454 technology. RESULTS: Since rare variants dominate virus populations, we had to carefully distinguish sequence variation from errors due to experimental protocols and sequencing. We considered an exponential-normal convolution model used for background correction of microarray data, and modified it to formulate an error correction approach for minor allele frequencies derived from high-throughput sequencing. Similar to accounting for over-dispersion in counts, this accounts for error-inflated variability in frequencies - and quite effectively reproduces empirically observed distributions. After obtaining error-corrected minor allele frequencies, we applied ANalysis Of VAriance (ANOVA) based on a linear mixed model and found that conserved sites and transition frequencies in FIV genes differ among tissues of dual and single infected cats. Furthermore, analysis of minor allele frequencies at individual FIV genome sites revealed 242 sites significantly affected by infection status (dual vs. single) or infection status by tissue interaction. All together, our results demonstrated a decrease in FIV diversity in bone marrow in the presence of PLV. Importantly, these effects were weakened or undetectable when error correction was performed with other approaches (thresholding of minor allele frequencies; probabilistic clustering of reads). We also queried the data for cytidine deaminase activity on the viral genome, which causes an asymmetric increase in G to A substitutions, but found no evidence for this host defense strategy. CONCLUSIONS: Our error correction approach for minor allele frequencies (more sensitive and computationally efficient than other algorithms) and our statistical treatment of variation (ANOVA) were critical for effective use of high-throughput sequencing data in understanding viral diversity. We found that co-infection with PLV shifts FIV diversity from bone marrow to lymph node and spleen.
- 650 _2
- $a algoritmy $7 D000465
- 650 _2
- $a zvířata $7 D000818
- 650 _2
- $a nemoci koček $x genetika $x imunologie $x přenos $x virologie $7 D002371
- 650 _2
- $a kočky $7 D002415
- 650 _2
- $a DNA virů $x genetika $7 D004279
- 650 12
- $a interpretace statistických dat $7 D003627
- 650 _2
- $a kočičí AIDS $x genetika $x imunologie $x virologie $7 D016181
- 650 _2
- $a vysoce účinné nukleotidové sekvenování $x metody $7 D059014
- 650 _2
- $a virus kočičí imunodeficience $x klasifikace $x genetika $x patogenita $7 D016089
- 650 12
- $a statistické modely $7 D015233
- 655 _2
- $a časopisecké články $7 D016428
- 655 _2
- $a Research Support, N.I.H., Extramural $7 D052061
- 700 1_
- $a Chiaromonte, Francesca $u Department of Statistics, The Pennsylvania State University, University Park, PA, 16802, USA. chiaro@stat.psu.edu. The Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, PA, 16802, USA. chiaro@stat.psu.edu.
- 700 1_
- $a Ross, Howard $u Bioinformatics Institute, School of Biological Sciences, University of Auckland, Auckland, 1142, New Zealand. h.ross@auckland.ac.nz.
- 700 1_
- $a Malhotra, Raunaq $u Department of Computer Science and Engineering, The Pennsylvania State University, University Park, PA, 16802, USA. raunaq@psu.edu.
- 700 1_
- $a Elleder, Daniel $u Department of Biology, The Pennsylvania State University, University Park, PA, 16802, USA. Daniel.Elleder@img.cas.cz. The Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, PA, 16802, USA. Daniel.Elleder@img.cas.cz. Current address: Institute of Molecular Genetics, Academy of Sciences of the Czech Republic, Videnska 1083, Prague, 14000, Czech Republic. Daniel.Elleder@img.cas.cz.
- 700 1_
- $a Poss, Mary $u Department of Biology, The Pennsylvania State University, University Park, PA, 16802, USA. mposs@bx.psu.edu. Department of Veterinary and Biomedical Sciences, The Pennsylvania State University, University Park, PA, 16802, USA. mposs@bx.psu.edu. The Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, PA, 16802, USA. mposs@bx.psu.edu.
- 773 0_
- $w MED00008167 $t BMC bioinformatics $x 1471-2105 $g Roč. 16, č. - (2015), s. 202
- 856 41
- $u https://pubmed.ncbi.nlm.nih.gov/26123018 $y Pubmed
- 910 __
- $a ABA008 $b sig $c sign $y a $z 0
- 990 __
- $a 20151005 $b ABA008
- 991 __
- $a 20151008115840 $b ABA008
- 999 __
- $a ok $b bmc $g 1092187 $s 914437
- BAS __
- $a 3
- BAS __
- $a PreBMC
- BMC __
- $a 2015 $b 16 $c - $d 202 $e 20150630 $i 1471-2105 $m BMC bioinformatics $n BMC Bioinformatics $x MED00008167
- LZP __
- $a Pubmed-20151005