Among alternative splicing events in the human transcriptome, tandem NAGNAG acceptor splice sites represent an appreciable proportion. Both proximal and distal NAG can be used to produce two splicing isoforms differing by three nucleotides. In some cases, the upstream exon can be alternatively spliced as well, which further increases the number of possible transcripts. In this study, we showed that NAG choice in tandem splice site depends considerably not only on the concerned acceptor, but also on the upstream donor splice site sequence. Using an extensive set of experiments with systematically modified two-exonic minigene systems of AFAP1L2 or CSTD gene, we recognized the third and fifth intronic upstream donor splice site position and the tandem acceptor splice site region spanning from -10 to +2, including NAGNAG itself, as the main drivers. In addition, competition between different branch points and their composition were also shown to play a significant role in NAG choice. All these nucleotide effects appeared almost additive, which explained the high variability in proximal versus distal NAG usage.
Acceptor splice site recognition (3' splice site: 3'ss) is a fundamental step in precursor messenger RNA (pre-mRNA) splicing. Generally, the U2 small nuclear ribonucleoprotein (snRNP) auxiliary factor (U2AF) heterodimer recognizes the 3'ss, of which U2AF35 has a dual function: (i) It binds to the intron-exon border of some 3'ss and (ii) mediates enhancer-binding splicing activators' interactions with the spliceosome. Alternative mechanisms for 3'ss recognition have been suggested, yet they are still not thoroughly understood. Here, we analyzed 3'ss recognition where the intron-exon border is bound by a ubiquitous splicing regulator SRSF1. Using the minigene analysis of two model exons and their mutants, BRCA2 exon 12 and VARS2 exon 17, we showed that the exon inclusion correlated much better with the predicted SRSF1 affinity than 3'ss quality, which were assessed using the Catalog of Inferred Sequence Binding Preferences of RNA binding proteins (CISBP-RNA) database and maximum entropy algorithm (MaxEnt) predictor and the U2AF35 consensus matrix, respectively. RNA affinity purification proved SRSF1 binding to the model 3'ss. On the other hand, knockdown experiments revealed that U2AF35 also plays a role in these exons' inclusion. Most probably, both factors stochastically bind the 3'ss, supporting exon recognition, more apparently in VARS2 exon 17. Identifying splicing activators as 3'ss recognition factors is crucial for both a basic understanding of splicing regulation and human genetic diagnostics when assessing variants' effects on splicing.
- MeSH
- alternativní sestřih genetika MeSH
- exony genetika MeSH
- HeLa buňky MeSH
- introny genetika MeSH
- lidé MeSH
- místa sestřihu RNA genetika fyziologie MeSH
- proteiny vázající RNA metabolismus MeSH
- regulační oblasti nukleových kyselin genetika MeSH
- sekvence nukleotidů genetika MeSH
- serin-arginin sestřihové faktory metabolismus MeSH
- sestřih RNA fyziologie MeSH
- sestřihové faktory metabolismus fyziologie MeSH
- sestřihový faktor U2AF metabolismus MeSH
- spliceozomy metabolismus MeSH
- Check Tag
- lidé MeSH
- Publikační typ
- časopisecké články MeSH
Splicing-affecting mutations can disrupt gene function by altering the transcript assembly. To ascertain splicing dysregulation principles, we modified a minigene assay for the parallel high-throughput evaluation of different mutations by next-generation sequencing. In our model system, all exonic and six intronic positions of the SMN1 gene's exon 7 were mutated to all possible nucleotide variants, which amounted to 180 unique single-nucleotide mutants and 470 double mutants. The mutations resulted in a wide range of splicing aberrations. Exonic splicing-affecting mutations resulted either in substantial exon skipping, supposedly driven by predicted exonic splicing silencer or cryptic donor splice site (5'ss) and de novo 5'ss strengthening and use. On the other hand, a single disruption of exonic splicing enhancer was not sufficient to cause major exon skipping, suggesting these elements can be substituted during exon recognition. While disrupting the acceptor splice site led only to exon skipping, some 5'ss mutations potentiated the use of three different cryptic 5'ss. Generally, single mutations supporting cryptic 5'ss use displayed better pre-mRNA/U1 snRNA duplex stability and increased splicing regulatory element strength across the original 5'ss. Analyzing double mutants supported the predominating splicing regulatory elements' effect, but U1 snRNA binding could contribute to the global balance of splicing isoforms. Based on these findings, we suggest that creating a new splicing enhancer across the mutated 5'ss can be one of the main factors driving cryptic 5'ss use.
- MeSH
- alternativní sestřih * MeSH
- buněčné linie MeSH
- exony * MeSH
- konformace nukleové kyseliny MeSH
- lidé MeSH
- místa sestřihu RNA MeSH
- mutace * MeSH
- mutageneze MeSH
- protein přežití motorických neuronů 1 chemie genetika metabolismus MeSH
- RNA malá jaderná chemie genetika metabolismus MeSH
- simulace molekulární dynamiky MeSH
- vazba proteinů MeSH
- výpočetní biologie metody MeSH
- vysoce účinné nukleotidové sekvenování MeSH
- Check Tag
- lidé MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
Both variants affecting splice sites and those in splicing regulatory elements (SREs) can impair pre-mRNA splicing, eventually leading to severe diseases. Despite the availability of many prediction tools, prognosis of splicing affection is not trivial, especially when SREs are involved. Here, we present data on 92 in silico-/55 minigene-analysed variants detected in genes responsible for the primary immunodeficiencies development (namely BTK, CD40LG, IL2RG, SERPING1, STAT3, and WAS). Of 20 splicing-affecting variants, 16 affected splice site while 4 disrupted potential SRE. The presence or absence of splicing defects was confirmed in 30 of 32 blood-derived patients' RNAs. Testing prediction tools performance, splice site disruptions and creations were reliably predicted in contrast to SRE-affecting variants for which just ESRseq, ΔHZEI-scores and EX-SKIP predictions showed promising results. Next, we found an interesting pattern in cryptic splice site predictions. These results might help PID-diagnosticians and geneticists cope with potential splicing-affecting variants.
- MeSH
- buňky Hep G2 MeSH
- dítě MeSH
- exony MeSH
- HeLa buňky MeSH
- kojenec MeSH
- komplement C1 - inaktivátory genetika MeSH
- lidé MeSH
- messenger RNA genetika MeSH
- mutace MeSH
- předškolní dítě MeSH
- protein Wiskottova-Aldrichova syndromu genetika MeSH
- receptory interleukinů - společná gama-podjednotka genetika MeSH
- rekombinantní fúzní proteiny genetika MeSH
- sestřih RNA * MeSH
- syndromy imunologické nedostatečnosti genetika MeSH
- transkripční faktor STAT3 genetika MeSH
- tyrosinkinasy genetika MeSH
- U937 buňky MeSH
- Check Tag
- dítě MeSH
- kojenec MeSH
- lidé MeSH
- předškolní dítě MeSH
- Publikační typ
- časopisecké články MeSH
- MeSH
- exony MeSH
- heterozygot MeSH
- jednonukleotidový polymorfismus * MeSH
- kadheriny genetika MeSH
- lidé MeSH
- nádory žaludku genetika MeSH
- rodokmen MeSH
- sestřih RNA * MeSH
- Check Tag
- lidé MeSH
- mužské pohlaví MeSH
- Publikační typ
- dopisy MeSH
- kazuistiky MeSH
- práce podpořená grantem MeSH