Genome-wide association studies (GWAS) have identified more than 200 common genetic variants independently associated with colorectal cancer (CRC) risk, but the causal variants and target genes are mostly unknown. We sought to fine-map all known CRC risk loci using GWAS data from 100,204 cases and 154,587 controls of East Asian and European ancestry. Our stepwise conditional analyses revealed 238 independent association signals of CRC risk, each with a set of credible causal variants (CCVs), of which 28 signals had a single CCV. Our cis-eQTL/mQTL and colocalization analyses using colorectal tissue-specific transcriptome and methylome data separately from 1299 and 321 individuals, along with functional genomic investigation, uncovered 136 putative CRC susceptibility genes, including 56 genes not previously reported. Analyses of single-cell RNA-seq data from colorectal tissues revealed 17 putative CRC susceptibility genes with distinct expression patterns in specific cell types. Analyses of whole exome sequencing data provided additional support for several target genes identified in this study as CRC susceptibility genes. Enrichment analyses of the 136 genes uncover pathways not previously linked to CRC risk. Our study substantially expanded association signals for CRC and provided additional insight into the biological mechanisms underlying CRC development.
- MeSH
- Asijci * genetika MeSH
- běloši * genetika MeSH
- celogenomová asociační studie * MeSH
- genetická predispozice k nemoci * MeSH
- jednonukleotidový polymorfismus * MeSH
- kolorektální nádory * genetika MeSH
- lidé MeSH
- lokus kvantitativního znaku * MeSH
- mapování chromozomů MeSH
- sekvenování exomu MeSH
- studie případů a kontrol MeSH
- transkriptom MeSH
- východní Asiaté MeSH
- Check Tag
- lidé MeSH
- mužské pohlaví MeSH
- ženské pohlaví MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Research Support, N.I.H., Extramural MeSH
BACKGROUND: Transcriptome-wide association studies have been successful in identifying candidate susceptibility genes for colorectal cancer (CRC). To strengthen susceptibility gene discovery, we conducted a large transcriptome-wide association study and an alternative splicing transcriptome-wide association study in CRC using improved genetic prediction models and performed in-depth functional investigations. METHODS: We analyzed RNA-sequencing data from normal colon tissues and genotype data from 423 European descendants to build genetic prediction models of gene expression and alternative splicing and evaluated model performance using independent RNA-sequencing data from normal colon tissues of the Genotype-Tissue Expression Project. We applied the verified models to genome-wide association studies (GWAS) summary statistics among 58 131 CRC cases and 67 347 controls of European ancestry to evaluate associations of genetically predicted gene expression and alternative splicing with CRC risk. We performed in vitro functional assays for 3 selected genes in multiple CRC cell lines. RESULTS: We identified 57 putative CRC susceptibility genes, which included the 48 genes from transcriptome-wide association studies and 15 genes from splicing transcriptome-wide association studies, at a Bonferroni-corrected P value less than .05. Of these, 16 genes were not previously implicated in CRC susceptibility, including a gene PDE7B (6q23.3) at locus previously not reported by CRC GWAS. Gene knockdown experiments confirmed the oncogenic roles for 2 unreported genes, TRPS1 and METRNL, and a recently reported gene, C14orf166. CONCLUSION: This study discovered new putative susceptibility genes of CRC and provided novel insights into the biological mechanisms underlying CRC development.
- MeSH
- celogenomová asociační studie MeSH
- genetická predispozice k nemoci MeSH
- jednonukleotidový polymorfismus MeSH
- kolorektální nádory * genetika MeSH
- lidé MeSH
- represorové proteiny genetika MeSH
- RNA MeSH
- transkriptom * MeSH
- Check Tag
- lidé MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Research Support, N.I.H., Extramural MeSH
Colorectal cancer (CRC) is a leading cause of mortality worldwide. We conducted a genome-wide association study meta-analysis of 100,204 CRC cases and 154,587 controls of European and east Asian ancestry, identifying 205 independent risk associations, of which 50 were unreported. We performed integrative genomic, transcriptomic and methylomic analyses across large bowel mucosa and other tissues. Transcriptome- and methylome-wide association studies revealed an additional 53 risk associations. We identified 155 high-confidence effector genes functionally linked to CRC risk, many of which had no previously established role in CRC. These have multiple different functions and specifically indicate that variation in normal colorectal homeostasis, proliferation, cell adhesion, migration, immunity and microbial interactions determines CRC risk. Crosstissue analyses indicated that over a third of effector genes most probably act outside the colonic mucosa. Our findings provide insights into colorectal oncogenesis and highlight potential targets across tissues for new CRC treatment and chemoprevention strategies.
- MeSH
- celogenomová asociační studie MeSH
- Evropané * genetika MeSH
- genetická predispozice k nemoci MeSH
- jednonukleotidový polymorfismus genetika MeSH
- kolorektální nádory * genetika MeSH
- lidé MeSH
- multiomika MeSH
- východní Asiaté * genetika MeSH
- Check Tag
- lidé MeSH
- Publikační typ
- časopisecké články MeSH
- metaanalýza MeSH
- práce podpořená grantem MeSH
- Research Support, N.I.H., Extramural MeSH
Polygenic risk scores (PRS) have great potential to guide precision colorectal cancer (CRC) prevention by identifying those at higher risk to undertake targeted screening. However, current PRS using European ancestry data have sub-optimal performance in non-European ancestry populations, limiting their utility among these populations. Towards addressing this deficiency, we expand PRS development for CRC by incorporating Asian ancestry data (21,731 cases; 47,444 controls) into European ancestry training datasets (78,473 cases; 107,143 controls). The AUC estimates (95% CI) of PRS are 0.63(0.62-0.64), 0.59(0.57-0.61), 0.62(0.60-0.63), and 0.65(0.63-0.66) in independent datasets including 1681-3651 cases and 8696-115,105 controls of Asian, Black/African American, Latinx/Hispanic, and non-Hispanic White, respectively. They are significantly better than the European-centric PRS in all four major US racial and ethnic groups (p-values < 0.05). Further inclusion of non-European ancestry populations, especially Black/African American and Latinx/Hispanic, is needed to improve the risk prediction and enhance equity in applying PRS in clinical practice.
- MeSH
- celogenomová asociační studie MeSH
- etnicita * genetika MeSH
- genetická predispozice k nemoci MeSH
- jednonukleotidový polymorfismus MeSH
- kolorektální nádory * diagnóza genetika MeSH
- lidé MeSH
- multifaktoriální dědičnost MeSH
- rizikové faktory MeSH
- Check Tag
- lidé MeSH
- Publikační typ
- časopisecké články MeSH
- Research Support, N.I.H., Extramural MeSH
- MeSH
- dědičné nepolypózní kolorektální nádory * genetika MeSH
- kolorektální nádory * genetika MeSH
- lidé MeSH
- Check Tag
- lidé MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Research Support, N.I.H., Extramural MeSH
- Research Support, N.I.H., Intramural MeSH
- Research Support, U.S. Gov't, Non-P.H.S. MeSH
- Research Support, U.S. Gov't, P.H.S. MeSH
BACKGROUND: Polygenic risk scores (PRS) which summarize individuals' genetic risk profile may enhance targeted colorectal cancer screening. A critical step towards clinical implementation is rigorous external validations in large community-based cohorts. This study externally validated a PRS-enhanced colorectal cancer risk model comprising 140 known colorectal cancer loci to provide a comprehensive assessment on prediction performance. METHODS: The model was developed using 20,338 individuals and externally validated in a community-based cohort (n = 85,221). We validated predicted 5-year absolute colorectal cancer risk, including calibration using expected-to-observed case ratios (E/O) and calibration plots, and discriminatory accuracy using time-dependent AUC. The PRS-related improvement in AUC, sensitivity and specificity were assessed in individuals of age 45 to 74 years (screening-eligible age group) and 40 to 49 years with no endoscopy history (younger-age group). RESULTS: In European-ancestral individuals, the predicted 5-year risk calibrated well [E/O = 1.01; 95% confidence interval (CI), 0.91-1.13] and had high discriminatory accuracy (AUC = 0.73; 95% CI, 0.71-0.76). Adding the PRS to a model with age, sex, family and endoscopy history improved the 5-year AUC by 0.06 (P < 0.001) and 0.14 (P = 0.05) in the screening-eligible age and younger-age groups, respectively. Using a risk-threshold of 5-year SEER colorectal cancer incidence rate at age 50 years, adding the PRS had a similar sensitivity but improved the specificity by 11% (P < 0.001) in the screening-eligible age group. In the younger-age group it improved the sensitivity by 27% (P = 0.04) with similar specificity. CONCLUSIONS: The proposed PRS-enhanced model provides a well-calibrated 5-year colorectal cancer risk prediction and improves discriminatory accuracy in the external cohort. IMPACT: The proposed model has potential utility in risk-stratified colorectal cancer prevention.
BACKGROUND & AIMS: Early-onset colorectal cancer (CRC, in persons younger than 50 years old) is increasing in incidence; yet, in the absence of a family history of CRC, this population lacks harmonized recommendations for prevention. We aimed to determine whether a polygenic risk score (PRS) developed from 95 CRC-associated common genetic risk variants was associated with risk for early-onset CRC. METHODS: We studied risk for CRC associated with a weighted PRS in 12,197 participants younger than 50 years old vs 95,865 participants 50 years or older. PRS was calculated based on single nucleotide polymorphisms associated with CRC in a large-scale genome-wide association study as of January 2019. Participants were pooled from 3 large consortia that provided clinical and genotyping data: the Colon Cancer Family Registry, the Colorectal Transdisciplinary Study, and the Genetics and Epidemiology of Colorectal Cancer Consortium and were all of genetically defined European descent. Findings were replicated in an independent cohort of 72,573 participants. RESULTS: Overall associations with CRC per standard deviation of PRS were significant for early-onset cancer, and were stronger compared with late-onset cancer (P for interaction = .01); when we compared the highest PRS quartile with the lowest, risk increased 3.7-fold for early-onset CRC (95% CI 3.28-4.24) vs 2.9-fold for late-onset CRC (95% CI 2.80-3.04). This association was strongest for participants without a first-degree family history of CRC (P for interaction = 5.61 × 10-5). When we compared the highest with the lowest quartiles in this group, risk increased 4.3-fold for early-onset CRC (95% CI 3.61-5.01) vs 2.9-fold for late-onset CRC (95% CI 2.70-3.00). Sensitivity analyses were consistent with these findings. CONCLUSIONS: In an analysis of associations with CRC per standard deviation of PRS, we found the cumulative burden of CRC-associated common genetic variants to associate with early-onset cancer, and to be more strongly associated with early-onset than late-onset cancer, particularly in the absence of CRC family history. Analyses of PRS, along with environmental and lifestyle risk factors, might identify younger individuals who would benefit from preventive measures.
- MeSH
- anamnéza MeSH
- celogenomová asociační studie MeSH
- datové soubory jako téma MeSH
- genetická predispozice k nemoci * MeSH
- genotypizační techniky MeSH
- jednonukleotidový polymorfismus MeSH
- kohortové studie MeSH
- kolorektální nádory genetika MeSH
- lidé středního věku MeSH
- lidé MeSH
- mutační analýza DNA MeSH
- mutační rychlost MeSH
- rizikové faktory MeSH
- sekvenování celého genomu MeSH
- studie případů a kontrol MeSH
- věk při počátku nemoci MeSH
- životní styl MeSH
- Check Tag
- lidé středního věku MeSH
- lidé MeSH
- mužské pohlaví MeSH
- ženské pohlaví MeSH
- Publikační typ
- časopisecké články MeSH
- multicentrická studie MeSH
- pozorovací studie MeSH
- práce podpořená grantem MeSH
- Research Support, N.I.H., Extramural MeSH
- Research Support, N.I.H., Intramural MeSH
- Research Support, U.S. Gov't, P.H.S. MeSH
Accurate colorectal cancer (CRC) risk prediction models are critical for identifying individuals at low and high risk of developing CRC, as they can then be offered targeted screening and interventions to address their risks of developing disease (if they are in a high-risk group) and avoid unnecessary screening and interventions (if they are in a low-risk group). As it is likely that thousands of genetic variants contribute to CRC risk, it is clinically important to investigate whether these genetic variants can be used jointly for CRC risk prediction. In this paper, we derived and compared different approaches to generating predictive polygenic risk scores (PRS) from genome-wide association studies (GWASs) including 55,105 CRC-affected case subjects and 65,079 control subjects of European ancestry. We built the PRS in three ways, using (1) 140 previously identified and validated CRC loci; (2) SNP selection based on linkage disequilibrium (LD) clumping followed by machine-learning approaches; and (3) LDpred, a Bayesian approach for genome-wide risk prediction. We tested the PRS in an independent cohort of 101,987 individuals with 1,699 CRC-affected case subjects. The discriminatory accuracy, calculated by the age- and sex-adjusted area under the receiver operating characteristics curve (AUC), was highest for the LDpred-derived PRS (AUC = 0.654) including nearly 1.2 M genetic variants (the proportion of causal genetic variants for CRC assumed to be 0.003), whereas the PRS of the 140 known variants identified from GWASs had the lowest AUC (AUC = 0.629). Based on the LDpred-derived PRS, we are able to identify 30% of individuals without a family history as having risk for CRC similar to those with a family history of CRC, whereas the PRS based on known GWAS variants identified only top 10% as having a similar relative risk. About 90% of these individuals have no family history and would have been considered average risk under current screening guidelines, but might benefit from earlier screening. The developed PRS offers a way for risk-stratified CRC screening and other targeted interventions.
- MeSH
- Asijci genetika MeSH
- Bayesova věta MeSH
- celogenomová asociační studie MeSH
- genetická predispozice k nemoci * MeSH
- genom lidský genetika MeSH
- hodnocení rizik * MeSH
- jednonukleotidový polymorfismus genetika MeSH
- kolorektální nádory epidemiologie genetika patologie MeSH
- lidé středního věku MeSH
- lidé MeSH
- multifaktoriální dědičnost genetika MeSH
- rizikové faktory MeSH
- senioři MeSH
- Check Tag
- lidé středního věku MeSH
- lidé MeSH
- mužské pohlaví MeSH
- senioři MeSH
- ženské pohlaví MeSH
- Publikační typ
- časopisecké články MeSH