Detail
Článek
Článek online
FT
Medvik - BMČ
  • Je něco špatně v tomto záznamu ?

Genotype imputation from low-coverage data for medical and population genetic analyses

SA. Biagini, S. Becelaere, M. Aerden, T. Jatsenko, L. Hannes, P. Van Damme, J. Breckpot, K. Devriendt, B. Thienpont, JR. Vermeesch, I. Cleynen, T. Kivisild

. 2025 ; 35 (9) : 1929-1941. [pub] 20250902

Jazyk angličtina Země Spojené státy americké

Typ dokumentu časopisecké články

Perzistentní odkaz   https://www.medvik.cz/link/bmc25021661
E-zdroje Online Plný text

NLK Free Medical Journals od 1991 do Před 6 měsíci
Freely Accessible Science Journals od 1991-08-01 do Před 1 rokem
PubMed Central od 1997 do Před 6 měsíci
Europe PubMed Central od 1997 do Před 6 měsíci
Open Access Digital Library od 1991-08-01
Open Access Digital Library od 1991-08-01

Genotype imputation from low-pass sequencing data presents unique opportunities for genomic analyses but comes with specific challenges. In this study, we explore the impact of quality filters on genetic ancestry and Polygenic Score (PGS) estimation after imputing 32,769 low-pass genome-wide sequences (LPS) from noninvasive prenatal screening (NIPS) with an average autosomal sequence depth of ∼0.15×. In studies involving ultra-low coverage sequences, conventional approaches to secure genotype accuracy may fail, especially when multiple samples are pooled. To enhance the proportion of high-quality genotypes in large data sets, we introduce a filtering approach called GDI that combines genotype probability (GP), alternate allele dosage (DS), and INFO score filters. We demonstrate that the imputation tools QUILT and GLIMPSE2 achieve similar accuracy, which is high enough for broad-scale ancestry mapping but insufficient for high resolution principal component analysis (PCA), when applied without filters. With the GDI approach, we can achieve quality that is adequate for such purposes. Furthermore, we explored the impact of imputation errors, choice of variants, and filtering methods on PGS prediction for height in 1911 subjects with height data. We show that polygenic scores predict 23.7% of variance in height in our imputed data and that, contrary to the effect on PCA, the GDI filter does not improve the performance of PGS in height prediction. These results highlight that imputed LPS data can be leveraged for further biomedical and population genetic use, but there is a need to consider each downstream analysis tool individually for its imputation quality thresholds and filtering requirements.

Citace poskytuje Crossref.org

000      
00000naa a2200000 a 4500
001      
bmc25021661
003      
CZ-PrNML
005      
20251023075749.0
007      
ta
008      
251014s2025 xxu f 000 0|eng||
009      
AR
024    7_
$a 10.1101/gr.280175.124 $2 doi
035    __
$a (PubMed)40695596
040    __
$a ABA008 $b cze $d ABA008 $e AACR2
041    0_
$a eng
044    __
$a xxu
100    1_
$a Biagini, Simone Andrea $u Department of Human Genetics, KU Leuven, Leuven 3000, Belgium $u Department of Archaeology and Museology, Masaryk University, 662 43 Brno, Czech Republic $u Center of Molecular Medicine, Central European Institute of Technology, Masaryk University, 662 43 Brno, Czech Republic $1 https://orcid.org/0000000244883162
245    10
$a Genotype imputation from low-coverage data for medical and population genetic analyses / $c SA. Biagini, S. Becelaere, M. Aerden, T. Jatsenko, L. Hannes, P. Van Damme, J. Breckpot, K. Devriendt, B. Thienpont, JR. Vermeesch, I. Cleynen, T. Kivisild
520    9_
$a Genotype imputation from low-pass sequencing data presents unique opportunities for genomic analyses but comes with specific challenges. In this study, we explore the impact of quality filters on genetic ancestry and Polygenic Score (PGS) estimation after imputing 32,769 low-pass genome-wide sequences (LPS) from noninvasive prenatal screening (NIPS) with an average autosomal sequence depth of ∼0.15×. In studies involving ultra-low coverage sequences, conventional approaches to secure genotype accuracy may fail, especially when multiple samples are pooled. To enhance the proportion of high-quality genotypes in large data sets, we introduce a filtering approach called GDI that combines genotype probability (GP), alternate allele dosage (DS), and INFO score filters. We demonstrate that the imputation tools QUILT and GLIMPSE2 achieve similar accuracy, which is high enough for broad-scale ancestry mapping but insufficient for high resolution principal component analysis (PCA), when applied without filters. With the GDI approach, we can achieve quality that is adequate for such purposes. Furthermore, we explored the impact of imputation errors, choice of variants, and filtering methods on PGS prediction for height in 1911 subjects with height data. We show that polygenic scores predict 23.7% of variance in height in our imputed data and that, contrary to the effect on PCA, the GDI filter does not improve the performance of PGS in height prediction. These results highlight that imputed LPS data can be leveraged for further biomedical and population genetic use, but there is a need to consider each downstream analysis tool individually for its imputation quality thresholds and filtering requirements.
650    _2
$a lidé $7 D006801
650    12
$a genotyp $7 D005838
650    _2
$a ženské pohlaví $7 D005260
650    _2
$a multifaktoriální dědičnost $7 D020412
650    12
$a populační genetika $x metody $7 D005828
650    _2
$a celogenomová asociační studie $x metody $7 D055106
650    _2
$a jednonukleotidový polymorfismus $7 D020641
650    _2
$a těhotenství $7 D011247
650    _2
$a tělesná výška $x genetika $7 D001827
655    _2
$a časopisecké články $7 D016428
700    1_
$a Becelaere, Sara $u Department of Human Genetics, KU Leuven, Leuven 3000, Belgium
700    1_
$a Aerden, Mio $u Department of Human Genetics, KU Leuven, Leuven 3000, Belgium $u Center for Human Genetics, University Hospitals Leuven, University of Leuven, Leuven 3000, Belgium
700    1_
$a Jatsenko, Tatjana $u Department of Human Genetics, KU Leuven, Leuven 3000, Belgium
700    1_
$a Hannes, Laurens $u Department of Human Genetics, KU Leuven, Leuven 3000, Belgium $u Center for Human Genetics, University Hospitals Leuven, University of Leuven, Leuven 3000, Belgium
700    1_
$a Van Damme, Philip $u Laboratory of Neurobiology, Neuroscience Department, KU Leuven, Leuven 3000, Belgium $u Neurology Department, University Hospitals Leuven, Leuven 3000, Belgium $1 https://orcid.org/0000000240102357
700    1_
$a Breckpot, Jeroen $u Department of Human Genetics, KU Leuven, Leuven 3000, Belgium $u Center for Human Genetics, University Hospitals Leuven, University of Leuven, Leuven 3000, Belgium
700    1_
$a Devriendt, Koenraad $u Department of Human Genetics, KU Leuven, Leuven 3000, Belgium $u Center for Human Genetics, University Hospitals Leuven, University of Leuven, Leuven 3000, Belgium
700    1_
$a Thienpont, Bernard $u Department of Human Genetics, KU Leuven, Leuven 3000, Belgium $1 https://orcid.org/0000000287726845
700    1_
$a Vermeesch, Joris Robert $u Department of Human Genetics, KU Leuven, Leuven 3000, Belgium
700    1_
$a Cleynen, Isabelle $u Department of Human Genetics, KU Leuven, Leuven 3000, Belgium
700    1_
$a Kivisild, Toomas $u Department of Human Genetics, KU Leuven, Leuven 3000, Belgium; toomas.kivisild@kuleuven.be $u Estonian Biocentre, Institute of Genomics, University of Tartu, Tartu 51010, Estonia
773    0_
$w MED00001911 $t Genome research $x 1549-5469 $g Roč. 35, č. 9 (2025), s. 1929-1941
856    41
$u https://pubmed.ncbi.nlm.nih.gov/40695596 $y Pubmed
910    __
$a ABA008 $b sig $c sign $y - $z 0
990    __
$a 20251014 $b ABA008
991    __
$a 20251023075755 $b ABA008
999    __
$a ok $b bmc $g 2416833 $s 1259824
BAS    __
$a 3
BAS    __
$a PreBMC-MEDLINE
BMC    __
$a 2025 $b 35 $c 9 $d 1929-1941 $e 20250902 $i 1549-5469 $m Genome research $n Genome Res $x MED00001911
LZP    __
$a Pubmed-20251014

Najít záznam

Citační ukazatele

Pouze přihlášení uživatelé

Možnosti archivace

Nahrávání dat ...