Transcriptome sequencing (RNA-seq) is widely used to detect gene rearrangements and quantitate gene expression in acute lymphoblastic leukemia (ALL), but its utility and accuracy in identifying copy number variations (CNVs) has not been well described. CNV information inferred from RNA-seq can be highly informative to guide disease classification and risk stratification in ALL due to the high incidence of aneuploid subtypes within this disease. Here we describe RNAseqCNV, a method to detect large scale CNVs from RNA-seq data. We used models based on normalized gene expression and minor allele frequency to classify arm level CNVs with high accuracy in ALL (99.1% overall and 98.3% for non-diploid chromosome arms, respectively), and the models were further validated with excellent performance in acute myeloid leukemia (accuracy 99.8% overall and 99.4% for non-diploid chromosome arms). RNAseqCNV outperforms alternative RNA-seq based algorithms in calling CNVs in the ALL dataset, especially in samples with a high proportion of CNVs. The CNV calls were highly concordant with DNA-based CNV results and more reliable than conventional cytogenetic-based karyotypes. RNAseqCNV provides a method to robustly identify copy number alterations in the absence of DNA-based analyses, further enhancing the utility of RNA-seq to classify ALL subtype.
- MeSH
- algoritmy MeSH
- karyotypizace MeSH
- lidé MeSH
- sekvenování transkriptomu MeSH
- variabilita počtu kopií segmentů DNA * genetika MeSH
- vysoce účinné nukleotidové sekvenování * metody MeSH
- Check Tag
- lidé MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Research Support, N.I.H., Extramural MeSH
BACKGROUND: Genome-wide association studies are widely used to map genomic regions contributing to lung cancer (LC) susceptibility, but they typically do not identify the precise disease-causing genes/variants. To unveil the inherited genetic variants that cause LC, we performed focused exome-sequencing analyses on genes located in 121 genome-wide association study-identified loci previously implicated in the risk of LC, chronic obstructive pulmonary disease, pulmonary function level, and smoking behavior. METHODS: Germline DNA from 260 case patients with LC and 318 controls were sequenced by utilizing VCRome 2.1 exome capture. Filtering was based on enrichment of rare and potential deleterious variants in cases (risk alleles) or controls (protective alleles). Allelic association analyses of single-variant and gene-based burden tests of multiple variants were performed. Promising candidates were tested in two independent validation studies with a total of 1773 case patients and 1123 controls. RESULTS: We identified 48 rare variants with deleterious effects in the discovery analysis and validated 12 of the 43 candidates that were covered in the validation platforms. The top validated candidates included one well-established truncating variant, namely, BRCA2, DNA repair associated gene (BRCA2) K3326X (OR = 2.36, 95% confidence interval [CI]: 1.38-3.99), and three newly identified variations, namely, lymphotoxin beta gene (LTB) p.Leu87Phe (OR = 7.52, 95% CI: 1.01-16.56), prolyl 3-hydroxylase 2 gene (P3H2) p.Gln185His (OR = 5.39, 95% CI: 0.75-15.43), and dishevelled associated activator of morphogenesis 2 gene (DAAM2) p.Asp762Gly (OR = 0.25, 95% CI: 0.10-0.79). Burden tests revealed strong associations between zinc finger protein 93 gene (ZNF93), DAAM2, bromodomain containing 9 gene (BRD9), and the gene LTB and LC susceptibility. CONCLUSION: Our results extend the catalogue of regions associated with LC and highlight the importance of germline rare coding variants in LC susceptibility.
- MeSH
- celogenomová asociační studie metody MeSH
- dospělí MeSH
- genetická variace genetika MeSH
- lidé středního věku MeSH
- lidé MeSH
- nádory plic genetika patologie MeSH
- rizikové faktory MeSH
- senioři nad 80 let MeSH
- senioři MeSH
- Check Tag
- dospělí MeSH
- lidé středního věku MeSH
- lidé MeSH
- mužské pohlaví MeSH
- senioři nad 80 let MeSH
- senioři MeSH
- ženské pohlaví MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Research Support, N.I.H., Extramural MeSH
- Research Support, N.I.H., Intramural MeSH
- MeSH
- fenylalanin genetika MeSH
- Janus kinasa 2 genetika MeSH
- jednonukleotidový polymorfismus MeSH
- kohortové studie MeSH
- kultivované buňky MeSH
- lidé MeSH
- missense mutace * MeSH
- mutační analýza DNA MeSH
- polycythaemia vera genetika patologie MeSH
- substituce aminokyselin MeSH
- valin genetika MeSH
- zárodečné mutace * MeSH
- Check Tag
- lidé MeSH
- Publikační typ
- dopisy MeSH
- práce podpořená grantem MeSH
- Research Support, N.I.H., Extramural MeSH