Nejvíce citovaný článek - PubMed ID 25956247
Prediction accuracies for growth and wood attributes of interior spruce in space using genotyping-by-sequencing
BACKGROUND: Genomic selection (GS) or genomic prediction is a promising approach for tree breeding to obtain higher genetic gains by shortening time of progeny testing in breeding programs. As proof-of-concept for Scots pine (Pinus sylvestris L.), a genomic prediction study was conducted with 694 individuals representing 183 full-sib families that were genotyped with genotyping-by-sequencing (GBS) and phenotyped for growth and wood quality traits. 8719 SNPs were used to compare different genomic with pedigree prediction models. Additionally, four prediction efficiency methods were used to evaluate the impact of genomic breeding value estimations by assigning diverse ratios of training and validation sets, as well as several subsets of SNP markers. RESULTS: Genomic Best Linear Unbiased Prediction (GBLUP) and Bayesian Ridge Regression (BRR) combined with expectation maximization (EM) imputation algorithm showed slightly higher prediction efficiencies than Pedigree Best Linear Unbiased Prediction (PBLUP) and Bayesian LASSO, with some exceptions. A subset of approximately 6000 SNP markers, was enough to provide similar prediction efficiencies as the full set of 8719 markers. Additionally, prediction efficiencies of genomic models were enough to achieve a higher selection response, that varied between 50-143% higher than the traditional pedigree-based selection. CONCLUSIONS: Although prediction efficiencies were similar for genomic and pedigree models, the relative selection response was doubled for genomic models by assuming that earlier selections can be done at the seedling stage, reducing the progeny testing time, thus shortening the breeding cycle length roughly by 50%.
- Klíčová slova
- Bayesian, GBLUP, Pinus sylvestris, genotyping-by-sequencing, prediction accuracy, predictive ability, predictive accuracy, theoretical accuracy,
- MeSH
- Bayesova věta MeSH
- borovice lesní * genetika MeSH
- dřevo * genetika MeSH
- fenotyp MeSH
- genomika MeSH
- jednonukleotidový polymorfismus MeSH
- modely genetické MeSH
- rodokmen MeSH
- šlechtění rostlin MeSH
- Publikační typ
- časopisecké články MeSH
Here, we perform cross-generational GS analysis on coastal Douglas-fir (Pseudotsuga menziesii), reflecting trans-generational selective breeding application. A total of 1321 trees, representing 37 full-sib F1 families from 3 environments in British Columbia, Canada, were used as the training population for (1) EBVs (estimated breeding values) of juvenile height (HTJ) in the F1 generation predicting genomic EBVs of HTJ of 136 individuals in the F2 generation, (2) deregressed EBVs of F1 HTJ predicting deregressed genomic EBVs of F2 HTJ, (3) F1 mature height (HT35) predicting HTJ EBVs in F2, and (4) deregressed F1 HT35 predicting genomic deregressed HTJ EBVs in F2. Ridge regression best linear unbiased predictor (RR-BLUP), generalized ridge regression (GRR), and Bayes-B GS methods were used and compared to pedigree-based (ABLUP) predictions. GS accuracies for scenarios 1 (0.92, 0.91, and 0.91) and 3 (0.57, 0.56, and 0.58) were similar to their ABLUP counterparts (0.92 and 0.60, respectively) (using RR-BLUP, GRR, and Bayes-B). Results using deregressed values fell dramatically for both scenarios 2 and 4 which approached zero in many cases. Cross-generational GS validation of juvenile height in Douglas-fir produced predictive accuracies almost as high as that of ABLUP. Without capturing LD, GS cannot surpass the prediction of ABLUP. Here we tracked pedigree relatedness between training and validation sets. More markers or improved distribution of markers are required to capture LD in Douglas-fir. This is essential for accurate forward selection among siblings as markers that track pedigree are of little use for forward selection of individuals within controlled pollinated families.
- MeSH
- genomika MeSH
- lineární modely MeSH
- modely genetické MeSH
- Pseudotsuga genetika růst a vývoj MeSH
- šlechtění rostlin MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Geografické názvy
- Britská Kolumbie MeSH
The advantages of open-pollinated (OP) family testing over controlled crossing (i.e., structured pedigree) are the potential to screen and rank a large number of parents and offspring with minimal cost and efforts; however, the method produces inflated genetic parameters as the actual sibling relatedness within OP families rarely meets the half-sib relatedness assumption. Here, we demonstrate the unsurpassed utility of OP testing after shifting the analytical mode from pedigree- (ABLUP) to genomic-based (GBLUP) relationship using phenotypic tree height (HT) and wood density (WD) and genotypic (30k SNPs) data for 1126 38-year-old Interior spruce (Picea glauca (Moench) Voss x P. engelmannii Parry ex Engelm.) trees, representing 25 OP families, growing on three sites in Interior British Columbia, Canada. The use of the genomic realized relationship permitted genetic variance decomposition to additive, dominance, and epistatic genetic variances, and their interactions with the environment, producing more accurate narrow-sense heritability and breeding value estimates as compared to the pedigree-based counterpart. The impact of retaining (random folding) vs. removing (family folding) genetic similarity between the training and validation populations on the predictive accuracy of genomic selection was illustrated and highlighted the former caveats and latter advantages. Moreover, GBLUP models allowed breeding value prediction for individuals from families that were not included in the developed models, which was not possible with the ABLUP. Response to selection differences between the ABLUP and GBLUP models indicated the presence of systematic genetic gain overestimation of 35 and 63% for HT and WD, respectively, mainly caused by the inflated estimates of additive genetic variance and individuals' breeding values given by the ABLUP models. Extending the OP genomic-based models from single to multisite made the analysis applicable to existing OP testing programs.
- Klíčová slova
- Genetic variance decomposition, Interior spruce, Multienvironment, Open-pollinated families, Pedigree- and marker-based relationships,
- Publikační typ
- časopisecké články MeSH
BACKGROUND: Genomic selection (GS) can offer unprecedented gains, in terms of cost efficiency and generation turnover, to forest tree selective breeding; especially for late expressing and low heritability traits. Here, we used: 1) exome capture as a genotyping platform for 1372 Douglas-fir trees representing 37 full-sib families growing on three sites in British Columbia, Canada and 2) height growth and wood density (EBVs), and deregressed estimated breeding values (DEBVs) as phenotypes. Representing models with (EBVs) and without (DEBVs) pedigree structure. Ridge regression best linear unbiased predictor (RR-BLUP) and generalized ridge regression (GRR) were used to assess their predictive accuracies over space (within site, cross-sites, multi-site, and multi-site to single site) and time (age-age/ trait-trait). RESULTS: The RR-BLUP and GRR models produced similar predictive accuracies across the studied traits. Within-site GS prediction accuracies with models trained on EBVs were high (RR-BLUP: 0.79-0.91 and GRR: 0.80-0.91), and were generally similar to the multi-site (RR-BLUP: 0.83-0.91, GRR: 0.83-0.91) and multi-site to single-site predictive accuracies (RR-BLUP: 0.79-0.92, GRR: 0.79-0.92). Cross-site predictions were surprisingly high, with predictive accuracies within a similar range (RR-BLUP: 0.79-0.92, GRR: 0.78-0.91). Height at 12 years was deemed the earliest acceptable age at which accurate predictions can be made concerning future height (age-age) and wood density (trait-trait). Using DEBVs reduced the accuracies of all cross-validation procedures dramatically, indicating that the models were tracking pedigree (family means), rather than marker-QTL LD. CONCLUSIONS: While GS models' prediction accuracies were high, the main driving force was the pedigree tracking rather than LD. It is likely that many more markers are needed to increase the chance of capturing the LD between causal genes and markers.
- Klíčová slova
- Douglas-fir, Exome capture, Full-sib families, Genomic selection, Genotype x environment interaction, Predictive model,
- MeSH
- dřevo chemie genetika MeSH
- exom * MeSH
- genomika MeSH
- genotyp MeSH
- lineární modely MeSH
- lokus kvantitativního znaku MeSH
- modely genetické * MeSH
- Pseudotsuga genetika růst a vývoj MeSH
- selekce (genetika) * MeSH
- šlechtění rostlin * MeSH
- vysoce účinné nukleotidové sekvenování MeSH
- Publikační typ
- časopisecké články MeSH
The open-pollinated (OP) family testing combines the simplest known progeny evaluation and quantitative genetics analyses as candidates' offspring are assumed to represent independent half-sib families. The accuracy of genetic parameter estimates is often questioned as the assumption of "half-sibling" in OP families may often be violated. We compared the pedigree- vs. marker-based genetic models by analysing 22-yr height and 30-yr wood density for 214 white spruce [Picea glauca (Moench) Voss] OP families represented by 1694 individuals growing on one site in Quebec, Canada. Assuming half-sibling, the pedigree-based model was limited to estimating the additive genetic variances which, in turn, were grossly overestimated as they were confounded by very minor dominance and major additive-by-additive epistatic genetic variances. In contrast, the implemented genomic pairwise realized relationship models allowed the disentanglement of additive from all nonadditive factors through genetic variance decomposition. The marker-based models produced more realistic narrow-sense heritability estimates and, for the first time, allowed estimating the dominance and epistatic genetic variances from OP testing. In addition, the genomic models showed better prediction accuracies compared to pedigree models and were able to predict individual breeding values for new individuals from untested families, which was not possible using the pedigree-based model. Clearly, the use of marker-based relationship approach is effective in estimating the quantitative genetic parameters of complex traits even under simple and shallow pedigree structure.
- Klíčová slova
- GenPred, Mendelian sampling term, genetic variance decomposition, genomic selection, open-pollinated families, pedigree- and marker-based relationships, shared data resource,
- MeSH
- algoritmy MeSH
- fenotyp MeSH
- genetická variace MeSH
- genom rostlinný * MeSH
- genomika * metody MeSH
- genotyp MeSH
- genotypizační techniky MeSH
- kvantitativní znak dědičný MeSH
- modely genetické MeSH
- opylení genetika MeSH
- smrk klasifikace genetika MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
Genomic selection (GS) potentially offers an unparalleled advantage over traditional pedigree-based selection (TS) methods by reducing the time commitment required to carry out a single cycle of tree improvement. This quality is particularly appealing to tree breeders, where lengthy improvement cycles are the norm. We explored the prospect of implementing GS for interior spruce (Picea engelmannii × glauca) utilizing a genotyped population of 769 trees belonging to 25 open-pollinated families. A series of repeated tree height measurements through ages 3-40 years permitted the testing of GS methods temporally. The genotyping-by-sequencing (GBS) platform was used for single nucleotide polymorphism (SNP) discovery in conjunction with three unordered imputation methods applied to a data set with 60% missing information. Further, three diverse GS models were evaluated based on predictive accuracy (PA), and their marker effects. Moderate levels of PA (0.31-0.55) were observed and were of sufficient capacity to deliver improved selection response over TS. Additionally, PA varied substantially through time accordingly with spatial competition among trees. As expected, temporal PA was well correlated with age-age genetic correlation (r=0.99), and decreased substantially with increasing difference in age between the training and validation populations (0.04-0.47). Moreover, our imputation comparisons indicate that k-nearest neighbor and singular value decomposition yielded a greater number of SNPs and gave higher predictive accuracies than imputing with the mean. Furthermore, the ridge regression (rrBLUP) and BayesCπ (BCπ) models both yielded equal, and better PA than the generalized ridge regression heteroscedastic effect model for the traits evaluated.
- MeSH
- genotyp MeSH
- genotypizační techniky metody MeSH
- jednonukleotidový polymorfismus MeSH
- modely genetické * MeSH
- populační genetika MeSH
- selekce (genetika) * MeSH
- smrk genetika MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- srovnávací studie MeSH
- Geografické názvy
- Britská Kolumbie MeSH