Most cited article - PubMed ID 28604730
Large-scale association analysis identifies new lung cancer susceptibility loci and heterogeneity in genetic susceptibility across histological subtypes
Alternative polyadenylation (APA) modulates mRNA processing in the 3'-untranslated regions (3' UTR), affecting mRNA stability and translation efficiency. Research into genetically regulated APA has the potential to provide insights into cancer risk. In this study, we conducted large APA-wide association studies to investigate associations between APA levels and cancer risk. Genetic models were built to predict APA levels in multiple tissues using genotype and RNA sequencing data from 1,337 samples from the Genotype-Tissue Expression project. Associations of genetically predicted APA levels with cancer risk were assessed by applying the prediction models to data from large genome-wide association studies of six common cancers among European ancestry populations: breast, ovarian, prostate, colorectal, lung, and pancreatic cancers. A total of 58 risk genes (corresponding to 76 APA sites) were associated with at least one type of cancer, including 25 genes previously not linked to cancer susceptibility. Of the identified risk APAs, 97.4% and 26.3% were supported by 3'-UTR APA quantitative trait loci and colocalization analyses, respectively. Luciferase reporter assays for four selected putative regulatory 3'-UTR variants demonstrated that the risk alleles of 3'-UTR variants, rs324015 (STAT6), rs2280503 (DIP2B), rs1128450 (FBXO38), and rs145220637 (LDHA), significantly increased the posttranscriptional activities of their target genes compared with reference alleles. Furthermore, knockdown of the target genes confirmed their ability to promote proliferation and migration. Overall, this study provides insights into the role of APA in the genetic susceptibility to common cancers. Significance: Systematic evaluation of associations of alternative polyadenylation with cancer risk reveals 58 putative susceptibility genes, highlighting the contribution of genetically regulated alternative polyadenylation of 3'UTRs to genetic susceptibility to cancer.
- MeSH
- 3' Untranslated Regions * genetics MeSH
- Genome-Wide Association Study * MeSH
- Genetic Predisposition to Disease * MeSH
- Polymorphism, Single Nucleotide MeSH
- Humans MeSH
- Quantitative Trait Loci MeSH
- RNA, Messenger genetics metabolism MeSH
- Cell Line, Tumor MeSH
- Neoplasms * genetics MeSH
- Polyadenylation * MeSH
- Gene Expression Regulation, Neoplastic MeSH
- Check Tag
- Humans MeSH
- Male MeSH
- Female MeSH
- Publication type
- Journal Article MeSH
- Names of Substances
- 3' Untranslated Regions * MeSH
- RNA, Messenger MeSH
BACKGROUND: Germline genetic variation contributes to lung cancer (LC) susceptibility. Previous genome-wide association studies (GWAS) have implicated susceptibility loci involved in smoking behaviors and DNA repair genes, but further work is required to identify susceptibility variants. METHODS: To identify LC susceptibility loci, a family history-based genome-wide association by proxy (GWAx) of LC (48 843 European proxy LC patients, 195 387 controls) was combined with a previous LC GWAS (29 266 patients, 56 450 controls) by meta-analysis. Colocalization was used to explore candidate genes and overlap with existing traits at discovered susceptibility loci. Polygenic risk scores (PRS) were tested within an independent validation cohort (1 666 LC patients vs 6 664 controls) using variants selected from the LC susceptibility loci and a novel selection approach using published GWAS summary statistics. Finally, the effects of the LC PRS on somatic mutational burden were explored in patients whose tumor resections have been profiled by exome (n = 685) and genome sequencing (n = 61). Statistical tests were 2-sided. RESULTS: The GWAx-GWAS meta-analysis identified 8 novel LC loci. Colocalization implicated DNA repair genes (CHEK1), metabolic genes (CYP1A1), and smoking propensity genes (CHRNA4 and CHRNB2). PRS analysis demonstrated that these variants, as well as subgenome-wide significant variants related to expression quantitative trait loci and/or smoking propensity, assisted in LC genetic risk prediction (odds ratio = 1.37, 95% confidence interval = 1.29 to 1.45; P < .001). Patients with higher genetic PRS loads of smoking-related variants tended to have higher mutation burdens in their lung tumors. CONCLUSIONS: This study has expanded the number of LC susceptibility loci and provided insights into the molecular mechanisms by which these susceptibility variants contribute to LC development.
- MeSH
- Genome-Wide Association Study * MeSH
- Genetic Predisposition to Disease MeSH
- Polymorphism, Single Nucleotide MeSH
- Humans MeSH
- Mutation MeSH
- Lung Neoplasms * epidemiology genetics pathology MeSH
- Germ Cells pathology MeSH
- Check Tag
- Humans MeSH
- Publication type
- Journal Article MeSH
- Meta-Analysis MeSH
- Research Support, Non-U.S. Gov't MeSH
- Research Support, N.I.H., Extramural MeSH
BACKGROUND: Observational studies indicate that periodontal disease may increase the risk of colorectal, lung, and pancreatic cancers. Using a 2-sample Mendelian randomization (MR) analysis, we assessed whether a genetic predisposition index for periodontal disease was associated with colorectal, lung, or pancreatic cancer risks. METHODS: Our primary instrument included single nucleotide polymorphisms with strong genome-wide association study evidence for associations with chronic, aggressive, and/or severe periodontal disease (rs729876, rs1537415, rs2738058, rs12461706, rs16870060, rs2521634, rs3826782, and rs7762544). We used summary-level genetic data for colorectal cancer (n = 58 131 cases; Genetics and Epidemiology of Colorectal Cancer Consortium, Colon Cancer Family Registry, and Colorectal Transdisciplinary Study), lung cancer (n = 18 082 cases; International Lung Cancer Consortium), and pancreatic cancer (n = 9254 cases; Pancreatic Cancer Consortia). Four MR approaches were employed for this analysis: random-effects inverse-variance weighted (primary analyses), Mendelian Randomization-Pleiotropy RESidual Sum and Outlier, simple median, and weighted median. We conducted secondary analyses to determine if associations varied by cancer subtype (colorectal cancer location, lung cancer histology), sex (colorectal and pancreatic cancers), or smoking history (lung and pancreatic cancer). All statistical tests were 2-sided. RESULTS: The genetic predisposition index for chronic or aggressive periodontitis was statistically significantly associated with a 3% increased risk of colorectal cancer (per unit increase in genetic index of periodontal disease; P = .03), 3% increased risk of colon cancer (P = .02), 4% increased risk of proximal colon cancer (P = .01), and 3% increased risk of colorectal cancer among females (P = .04); however, it was not statistically significantly associated with the risk of lung cancer or pancreatic cancer, overall or within most subgroups. CONCLUSIONS: Genetic predisposition to periodontitis may be associated with colorectal cancer risk. Further research should determine whether increased periodontitis prevention and increased cancer surveillance of patients with periodontitis is warranted.
- MeSH
- Genome-Wide Association Study MeSH
- Chronic Disease MeSH
- Genetic Predisposition to Disease * MeSH
- Polymorphism, Single Nucleotide MeSH
- Colorectal Neoplasms epidemiology genetics MeSH
- Smoking MeSH
- Humans MeSH
- Mendelian Randomization Analysis methods MeSH
- Lung Neoplasms epidemiology genetics pathology MeSH
- Rectal Neoplasms epidemiology genetics MeSH
- Pancreatic Neoplasms epidemiology genetics MeSH
- Colonic Neoplasms epidemiology genetics MeSH
- Periodontal Diseases genetics MeSH
- Risk Factors MeSH
- Sex Factors MeSH
- Check Tag
- Humans MeSH
- Male MeSH
- Female MeSH
- Publication type
- Journal Article MeSH
- Research Support, Non-U.S. Gov't MeSH
- Research Support, N.I.H., Extramural MeSH
Recent studies suggest that rare variants exhibit stronger effect sizes and might play a crucial role in the etiology of lung cancers (LC). Whole exome plus targeted sequencing of germline DNA was performed on 1045 LC cases and 885 controls in the discovery set. To unveil the inherited causal variants, we focused on rare and predicted deleterious variants and small indels enriched in cases or controls. Promising candidates were further validated in a series of 26,803 LCs and 555,107 controls. During discovery, we identified 25 rare deleterious variants associated with LC susceptibility, including 13 reported in ClinVar. Of the five validated candidates, we discovered two pathogenic variants in known LC susceptibility loci, ATM p.V2716A (Odds Ratio [OR] 19.55, 95%CI 5.04-75.6) and MPZL2 p.I24M frameshift deletion (OR 3.88, 95%CI 1.71-8.8); and three in novel LC susceptibility genes, POMC c.*28delT at 3' UTR (OR 4.33, 95%CI 2.03-9.24), STAU2 p.N364M frameshift deletion (OR 4.48, 95%CI 1.73-11.55), and MLNR p.Q334V frameshift deletion (OR 2.69, 95%CI 1.33-5.43). The potential cancer-promoting role of selected candidate genes and variants was further supported by endogenous DNA damage assays. Our analyses led to the identification of new rare deleterious variants with LC susceptibility. However, in-depth mechanistic studies are still needed to evaluate the pathogenic effects of these specific alleles.
- Publication type
- Journal Article MeSH
Few germline mutations are known to affect lung cancer risk. We performed analyses of rare variants from 39,146 individuals of European ancestry and investigated gene expression levels in 7,773 samples. We find a large-effect association with an ATM L2307F (rs56009889) mutation in adenocarcinoma for discovery (adjusted Odds Ratio = 8.82, P = 1.18 × 10-15) and replication (adjusted OR = 2.93, P = 2.22 × 10-3) that is more pronounced in females (adjusted OR = 6.81 and 3.19 and for discovery and replication). We observe an excess loss of heterozygosity in lung tumors among ATM L2307F allele carriers. L2307F is more frequent (4%) among Ashkenazi Jewish populations. We also observe an association in discovery (adjusted OR = 2.61, P = 7.98 × 10-22) and replication datasets (adjusted OR = 1.55, P = 0.06) with a loss-of-function mutation, Q4X (rs150665432) of an uncharacterized gene, KIAA0930. Our findings implicate germline genetic variants in ATM with lung cancer susceptibility and suggest KIAA0930 as a novel candidate gene for lung cancer risk.
- MeSH
- Adenocarcinoma genetics MeSH
- Alleles MeSH
- Ataxia Telangiectasia Mutated Proteins genetics MeSH
- White People genetics MeSH
- Databases, Genetic MeSH
- Genetic Predisposition to Disease MeSH
- Genotyping Techniques MeSH
- Heterozygote MeSH
- Middle Aged MeSH
- Humans MeSH
- Mutation, Missense MeSH
- Lung Neoplasms genetics MeSH
- Odds Ratio MeSH
- Risk Factors MeSH
- Pedigree MeSH
- Oligonucleotide Array Sequence Analysis MeSH
- RNA-Seq MeSH
- Aged MeSH
- Germ-Line Mutation MeSH
- Jews genetics MeSH
- Check Tag
- Middle Aged MeSH
- Humans MeSH
- Male MeSH
- Aged MeSH
- Female MeSH
- Publication type
- Journal Article MeSH
- Research Support, Non-U.S. Gov't MeSH
- Research Support, N.I.H., Extramural MeSH
- Research Support, U.S. Gov't, Non-P.H.S. MeSH
- Names of Substances
- ATM protein, human MeSH Browser
- Ataxia Telangiectasia Mutated Proteins MeSH
BACKGROUND: Evidence from observational studies of telomere length (TL) has been conflicting regarding its direction of association with cancer risk. We investigated the causal relevance of TL for lung and head and neck cancers using Mendelian Randomization (MR) and mediation analyses. METHODS: We developed a novel genetic instrument for TL in chromosome 5p15.33, using variants identified through deep-sequencing, that were genotyped in 2051 cancer-free subjects. Next, we conducted an MR analysis of lung (16 396 cases, 13 013 controls) and head and neck cancer (4415 cases, 5013 controls) using eight genetic instruments for TL. Lastly, the 5p15.33 instrument and distinct 5p15.33 lung cancer risk loci were evaluated using two-sample mediation analysis, to quantify their direct and indirect, telomere-mediated, effects. RESULTS: The multi-allelic 5p15.33 instrument explained 1.49-2.00% of TL variation in our data (p = 2.6 × 10-9). The MR analysis estimated that a 1000 base-pair increase in TL increases risk of lung cancer [odds ratio (OR) = 1.41, 95% confidence interval (CI): 1.20-1.65] and lung adenocarcinoma (OR = 1.92, 95% CI: 1.51-2.22), but not squamous lung carcinoma (OR = 1.04, 95% CI: 0.83-1.29) or head and neck cancers (OR = 0.90, 95% CI: 0.70-1.05). Mediation analysis of the 5p15.33 instrument indicated an absence of direct effects on lung cancer risk (OR = 1.00, 95% CI: 0.95-1.04). Analysis of distinct 5p15.33 susceptibility variants estimated that TL mediates up to 40% of the observed associations with lung cancer risk. CONCLUSIONS: Our findings support a causal role for long telomeres in lung cancer aetiology, particularly for adenocarcinoma, and demonstrate that telomere maintenance partially mediates the lung cancer susceptibility conferred by 5p15.33 loci.
- Keywords
- Mendelian Randomization, TERT, chromosome 5p15.33, lung cancer, mediation analysis, telomere length,
- MeSH
- Adenocarcinoma of Lung epidemiology MeSH
- Squamous Cell Carcinoma of Head and Neck epidemiology MeSH
- Telomere Homeostasis genetics MeSH
- Leukocytes metabolism MeSH
- Middle Aged MeSH
- Humans MeSH
- Chromosomes, Human, Pair 5 genetics MeSH
- Mendelian Randomization Analysis MeSH
- Head and Neck Neoplasms epidemiology MeSH
- Lung Neoplasms epidemiology MeSH
- Aged, 80 and over MeSH
- Aged MeSH
- Carcinoma, Squamous Cell epidemiology MeSH
- Telomere metabolism MeSH
- Check Tag
- Middle Aged MeSH
- Humans MeSH
- Male MeSH
- Aged, 80 and over MeSH
- Aged MeSH
- Female MeSH
- Publication type
- Journal Article MeSH
- Research Support, Non-U.S. Gov't MeSH
- Research Support, N.I.H., Extramural MeSH
- Research Support, U.S. Gov't, Non-P.H.S. MeSH
DNase I hypersensitive sites (DHS) are abundant in regulatory elements, such as promoter, enhancer and transcription factor binding sites. Many studies have revealed that disease-associated variants were concentrated in DHS-related regions. However, limited studies are available on the roles of DHS-related variants in lung cancer. In this study, we performed a large-scale case-control study with 20 871 lung cancer cases and 15 971 controls to evaluate the associations between regulatory genetic variants in DHS and lung cancer susceptibility. The expression quantitative trait loci (eQTL) analysis and pathway-enrichment analysis were performed to identify the possible target genes and pathways. In addition, we performed motif-based analysis to explore the lung-cancer-related motifs using sequence kernel association test. Two novel variants, rs186332 in 20q13.3 (C>T, odds ratio [OR] = 1.17, 95% confidence interval [95% CI]: 1.10-1.24, P = 8.45 × 10-7) and rs4839323 in 1p13.2 (T>C, OR = 0.92, 95% CI: 0.89-0.95, P = 1.02 × 10-6) showed significant association with lung cancer risk. The eQTL analysis suggested that these two SNPs might regulate the expression of MRGBP and SLC16A1, respectively. What's more, the expression of both MRGBP and SLC16A1 was aberrantly elevated in lung tumor tissues. The motif-based analysis identified 10 motifs related to the risk of lung cancer (P < 1.71 × 10-4). Our findings suggested that variants in DHS might modify lung cancer susceptibility through regulating the expression of surrounding genes. This study provided us a deeper insight into the roles of DHS-related genetic variants for lung cancer.
- MeSH
- Deoxyribonuclease I metabolism MeSH
- Genetic Predisposition to Disease * MeSH
- Polymorphism, Single Nucleotide MeSH
- Middle Aged MeSH
- Humans MeSH
- Quantitative Trait Loci MeSH
- Lung Neoplasms genetics MeSH
- Aged MeSH
- Case-Control Studies MeSH
- Check Tag
- Middle Aged MeSH
- Humans MeSH
- Male MeSH
- Aged MeSH
- Female MeSH
- Publication type
- Journal Article MeSH
- Research Support, Non-U.S. Gov't MeSH
- Names of Substances
- Deoxyribonuclease I MeSH
The development of cancer is driven by the accumulation of many oncogenesis-related genetic alterations and tumorigenesis is triggered by complex networks of involved genes rather than independent actions. To explore the epistasis existing among oncogenesis-related genes in lung cancer development, we conducted pairwise genetic interaction analyses among 35,031 SNPs from 2027 oncogenesis-related genes. The genotypes from three independent genome-wide association studies including a total of 24,037 lung cancer patients and 20,401 healthy controls with Caucasian ancestry were analyzed in the study. Using a two-stage study design including discovery and replication studies, and stringent Bonferroni correction for multiple statistical analysis, we identified significant genetic interactions between SNPs in RGL1:RAD51B (OR=0.44, p value=3.27x10-11 in overall lung cancer and OR=0.41, p value=9.71x10-11 in non-small cell lung cancer), SYNE1:RNF43 (OR=0.73, p value=1.01x10-12 in adenocarcinoma) and FHIT:TSPAN8 (OR=1.82, p value=7.62x10-11 in squamous cell carcinoma) in our analysis. None of these genes have been identified from previous main effect association studies in lung cancer. Further eQTL gene expression analysis in lung tissues provided information supporting the functional role of the identified epistasis in lung tumorigenesis. Gene set enrichment analysis revealed potential pathways and gene networks underlying molecular mechanisms in overall lung cancer as well as histology subtypes development. Our results provide evidence that genetic interactions between oncogenesis-related genes play an important role in lung tumorigenesis and epistasis analysis, combined with functional annotation, provides a valuable tool for uncovering functional novel susceptibility genes that contribute to lung cancer development by interacting with other modifier genes.
- Keywords
- epistasis, functional annotation, lung cancer, oncogenesis,
- Publication type
- Journal Article MeSH
Lung cancer has several genetic associations identified within the major histocompatibility complex (MHC); although the basis for these associations remains elusive. Here, we analyze MHC genetic variation among 26,044 lung cancer patients and 20,836 controls densely genotyped across the MHC, using the Illumina Illumina OncoArray or Illumina 660W SNP microarray. We impute sequence variation in classical HLA genes, fine-map MHC associations for lung cancer risk with major histologies and compare results between ethnicities. Independent and novel associations within HLA genes are identified in Europeans including amino acids in the HLA-B*0801 peptide binding groove and an independent HLA-DQB1*06 loci group. In Asians, associations are driven by two independent HLA allele sets that both increase risk in HLA-DQB1*0401 and HLA-DRB1*0701; the latter better represented by the amino acid Ala-104. These results implicate several HLA-tumor peptide interactions as the major MHC factor modulating lung cancer susceptibility.
- MeSH
- Asian People genetics MeSH
- White People genetics MeSH
- Gene Frequency MeSH
- Genetic Predisposition to Disease ethnology genetics MeSH
- Genotype MeSH
- HLA Antigens genetics MeSH
- Major Histocompatibility Complex genetics MeSH
- Polymorphism, Single Nucleotide MeSH
- Middle Aged MeSH
- Humans MeSH
- Chromosome Mapping * MeSH
- Lung Neoplasms ethnology genetics MeSH
- Peptides genetics MeSH
- Check Tag
- Middle Aged MeSH
- Humans MeSH
- Male MeSH
- Female MeSH
- Publication type
- Journal Article MeSH
- Research Support, Non-U.S. Gov't MeSH
- Research Support, N.I.H., Extramural MeSH
- Names of Substances
- HLA Antigens MeSH
- Peptides MeSH
Genome-wide association studies (GWAS) identified the chromosome 15q25.1 locus as a leading susceptibility region for lung cancer. However, the pathogenic pathways, through which susceptibility SNPs within chromosome 15q25.1 affects lung cancer risk, have not been explored. We analyzed three cohorts with GWAS data consisting 42,901 individuals and lung expression quantitative trait loci (eQTL) data on 409 individuals to identify and validate the underlying pathways and to investigate the combined effect of genes from the identified susceptibility pathways. The KEGG neuroactive ligand receptor interaction pathway, two Reactome pathways, and 22 Gene Ontology terms were identified and replicated to be significantly associated with lung cancer risk, with P values less than 0.05 and FDR less than 0.1. Functional annotation of eQTL analysis results showed that the neuroactive ligand receptor interaction pathway and gated channel activity were involved in lung cancer risk. These pathways provide important insights for the etiology of lung cancer.
- MeSH
- Child MeSH
- Adult MeSH
- Genetic Predisposition to Disease * MeSH
- Gene Ontology MeSH
- Gene Regulatory Networks MeSH
- Polymorphism, Single Nucleotide MeSH
- Cohort Studies MeSH
- Infant MeSH
- Smoking adverse effects MeSH
- Middle Aged MeSH
- Humans MeSH
- Chromosomes, Human, Pair 15 genetics MeSH
- Quantitative Trait Loci genetics MeSH
- Adolescent MeSH
- Young Adult MeSH
- Lung Neoplasms genetics MeSH
- Infant, Newborn MeSH
- Child, Preschool MeSH
- Reproducibility of Results MeSH
- Risk Factors MeSH
- Aged MeSH
- Check Tag
- Child MeSH
- Adult MeSH
- Infant MeSH
- Middle Aged MeSH
- Humans MeSH
- Adolescent MeSH
- Young Adult MeSH
- Male MeSH
- Infant, Newborn MeSH
- Child, Preschool MeSH
- Aged MeSH
- Female MeSH
- Publication type
- Journal Article MeSH
- Research Support, Non-U.S. Gov't MeSH
- Research Support, N.I.H., Extramural MeSH