-
Je něco špatně v tomto záznamu ?
Genotype imputation from low-coverage data for medical and population genetic analyses
SA. Biagini, S. Becelaere, M. Aerden, T. Jatsenko, L. Hannes, P. Van Damme, J. Breckpot, K. Devriendt, B. Thienpont, JR. Vermeesch, I. Cleynen, T. Kivisild
Jazyk angličtina Země Spojené státy americké
Typ dokumentu časopisecké články
NLK
Free Medical Journals
od 1991 do Před 6 měsíci
Freely Accessible Science Journals
od 1991-08-01 do Před 1 rokem
PubMed Central
od 1997 do Před 6 měsíci
Europe PubMed Central
od 1997 do Před 6 měsíci
Open Access Digital Library
od 1991-08-01
Open Access Digital Library
od 1991-08-01
PubMed
40695596
DOI
10.1101/gr.280175.124
Knihovny.cz E-zdroje
- MeSH
- celogenomová asociační studie metody MeSH
- genotyp * MeSH
- jednonukleotidový polymorfismus MeSH
- lidé MeSH
- multifaktoriální dědičnost MeSH
- populační genetika * metody MeSH
- těhotenství MeSH
- tělesná výška genetika MeSH
- Check Tag
- lidé MeSH
- těhotenství MeSH
- ženské pohlaví MeSH
- Publikační typ
- časopisecké články MeSH
Genotype imputation from low-pass sequencing data presents unique opportunities for genomic analyses but comes with specific challenges. In this study, we explore the impact of quality filters on genetic ancestry and Polygenic Score (PGS) estimation after imputing 32,769 low-pass genome-wide sequences (LPS) from noninvasive prenatal screening (NIPS) with an average autosomal sequence depth of ∼0.15×. In studies involving ultra-low coverage sequences, conventional approaches to secure genotype accuracy may fail, especially when multiple samples are pooled. To enhance the proportion of high-quality genotypes in large data sets, we introduce a filtering approach called GDI that combines genotype probability (GP), alternate allele dosage (DS), and INFO score filters. We demonstrate that the imputation tools QUILT and GLIMPSE2 achieve similar accuracy, which is high enough for broad-scale ancestry mapping but insufficient for high resolution principal component analysis (PCA), when applied without filters. With the GDI approach, we can achieve quality that is adequate for such purposes. Furthermore, we explored the impact of imputation errors, choice of variants, and filtering methods on PGS prediction for height in 1911 subjects with height data. We show that polygenic scores predict 23.7% of variance in height in our imputed data and that, contrary to the effect on PCA, the GDI filter does not improve the performance of PGS in height prediction. These results highlight that imputed LPS data can be leveraged for further biomedical and population genetic use, but there is a need to consider each downstream analysis tool individually for its imputation quality thresholds and filtering requirements.
Center for Human Genetics University Hospitals Leuven University of Leuven Leuven 3000 Belgium
Department of Archaeology and Museology Masaryk University 662 43 Brno Czech Republic
Department of Human Genetics KU Leuven Leuven 3000 Belgium
Estonian Biocentre Institute of Genomics University of Tartu Tartu 51010 Estonia
Laboratory of Neurobiology Neuroscience Department KU Leuven Leuven 3000 Belgium
Neurology Department University Hospitals Leuven Leuven 3000 Belgium
Citace poskytuje Crossref.org
- 000
- 00000naa a2200000 a 4500
- 001
- bmc25021661
- 003
- CZ-PrNML
- 005
- 20251023075749.0
- 007
- ta
- 008
- 251014s2025 xxu f 000 0|eng||
- 009
- AR
- 024 7_
- $a 10.1101/gr.280175.124 $2 doi
- 035 __
- $a (PubMed)40695596
- 040 __
- $a ABA008 $b cze $d ABA008 $e AACR2
- 041 0_
- $a eng
- 044 __
- $a xxu
- 100 1_
- $a Biagini, Simone Andrea $u Department of Human Genetics, KU Leuven, Leuven 3000, Belgium $u Department of Archaeology and Museology, Masaryk University, 662 43 Brno, Czech Republic $u Center of Molecular Medicine, Central European Institute of Technology, Masaryk University, 662 43 Brno, Czech Republic $1 https://orcid.org/0000000244883162
- 245 10
- $a Genotype imputation from low-coverage data for medical and population genetic analyses / $c SA. Biagini, S. Becelaere, M. Aerden, T. Jatsenko, L. Hannes, P. Van Damme, J. Breckpot, K. Devriendt, B. Thienpont, JR. Vermeesch, I. Cleynen, T. Kivisild
- 520 9_
- $a Genotype imputation from low-pass sequencing data presents unique opportunities for genomic analyses but comes with specific challenges. In this study, we explore the impact of quality filters on genetic ancestry and Polygenic Score (PGS) estimation after imputing 32,769 low-pass genome-wide sequences (LPS) from noninvasive prenatal screening (NIPS) with an average autosomal sequence depth of ∼0.15×. In studies involving ultra-low coverage sequences, conventional approaches to secure genotype accuracy may fail, especially when multiple samples are pooled. To enhance the proportion of high-quality genotypes in large data sets, we introduce a filtering approach called GDI that combines genotype probability (GP), alternate allele dosage (DS), and INFO score filters. We demonstrate that the imputation tools QUILT and GLIMPSE2 achieve similar accuracy, which is high enough for broad-scale ancestry mapping but insufficient for high resolution principal component analysis (PCA), when applied without filters. With the GDI approach, we can achieve quality that is adequate for such purposes. Furthermore, we explored the impact of imputation errors, choice of variants, and filtering methods on PGS prediction for height in 1911 subjects with height data. We show that polygenic scores predict 23.7% of variance in height in our imputed data and that, contrary to the effect on PCA, the GDI filter does not improve the performance of PGS in height prediction. These results highlight that imputed LPS data can be leveraged for further biomedical and population genetic use, but there is a need to consider each downstream analysis tool individually for its imputation quality thresholds and filtering requirements.
- 650 _2
- $a lidé $7 D006801
- 650 12
- $a genotyp $7 D005838
- 650 _2
- $a ženské pohlaví $7 D005260
- 650 _2
- $a multifaktoriální dědičnost $7 D020412
- 650 12
- $a populační genetika $x metody $7 D005828
- 650 _2
- $a celogenomová asociační studie $x metody $7 D055106
- 650 _2
- $a jednonukleotidový polymorfismus $7 D020641
- 650 _2
- $a těhotenství $7 D011247
- 650 _2
- $a tělesná výška $x genetika $7 D001827
- 655 _2
- $a časopisecké články $7 D016428
- 700 1_
- $a Becelaere, Sara $u Department of Human Genetics, KU Leuven, Leuven 3000, Belgium
- 700 1_
- $a Aerden, Mio $u Department of Human Genetics, KU Leuven, Leuven 3000, Belgium $u Center for Human Genetics, University Hospitals Leuven, University of Leuven, Leuven 3000, Belgium
- 700 1_
- $a Jatsenko, Tatjana $u Department of Human Genetics, KU Leuven, Leuven 3000, Belgium
- 700 1_
- $a Hannes, Laurens $u Department of Human Genetics, KU Leuven, Leuven 3000, Belgium $u Center for Human Genetics, University Hospitals Leuven, University of Leuven, Leuven 3000, Belgium
- 700 1_
- $a Van Damme, Philip $u Laboratory of Neurobiology, Neuroscience Department, KU Leuven, Leuven 3000, Belgium $u Neurology Department, University Hospitals Leuven, Leuven 3000, Belgium $1 https://orcid.org/0000000240102357
- 700 1_
- $a Breckpot, Jeroen $u Department of Human Genetics, KU Leuven, Leuven 3000, Belgium $u Center for Human Genetics, University Hospitals Leuven, University of Leuven, Leuven 3000, Belgium
- 700 1_
- $a Devriendt, Koenraad $u Department of Human Genetics, KU Leuven, Leuven 3000, Belgium $u Center for Human Genetics, University Hospitals Leuven, University of Leuven, Leuven 3000, Belgium
- 700 1_
- $a Thienpont, Bernard $u Department of Human Genetics, KU Leuven, Leuven 3000, Belgium $1 https://orcid.org/0000000287726845
- 700 1_
- $a Vermeesch, Joris Robert $u Department of Human Genetics, KU Leuven, Leuven 3000, Belgium
- 700 1_
- $a Cleynen, Isabelle $u Department of Human Genetics, KU Leuven, Leuven 3000, Belgium
- 700 1_
- $a Kivisild, Toomas $u Department of Human Genetics, KU Leuven, Leuven 3000, Belgium; toomas.kivisild@kuleuven.be $u Estonian Biocentre, Institute of Genomics, University of Tartu, Tartu 51010, Estonia
- 773 0_
- $w MED00001911 $t Genome research $x 1549-5469 $g Roč. 35, č. 9 (2025), s. 1929-1941
- 856 41
- $u https://pubmed.ncbi.nlm.nih.gov/40695596 $y Pubmed
- 910 __
- $a ABA008 $b sig $c sign $y - $z 0
- 990 __
- $a 20251014 $b ABA008
- 991 __
- $a 20251023075755 $b ABA008
- 999 __
- $a ok $b bmc $g 2416833 $s 1259824
- BAS __
- $a 3
- BAS __
- $a PreBMC-MEDLINE
- BMC __
- $a 2025 $b 35 $c 9 $d 1929-1941 $e 20250902 $i 1549-5469 $m Genome research $n Genome Res $x MED00001911
- LZP __
- $a Pubmed-20251014