Non-coding region
Dotaz
Zobrazit nápovědu
Pervasive transcription is a widespread phenomenon leading to the production of a plethora of non-coding RNAs (ncRNAs) without apparent function. Pervasive transcription poses a threat to proper gene expression that needs to be controlled. In yeast, the highly conserved helicase Sen1 restricts pervasive transcription by inducing termination of non-coding transcription. However, the mechanisms underlying the specific function of Sen1 at ncRNAs are poorly understood. Here, we identify a motif in an intrinsically disordered region of Sen1 that mimics the phosphorylated carboxy-terminal domain (CTD) of RNA polymerase II, and structurally characterize its recognition by the CTD-interacting domain of Nrd1, an RNA-binding protein that binds specific sequences in ncRNAs. In addition, we show that Sen1-dependent termination strictly requires CTD recognition by the N-terminal domain of Sen1. We provide evidence that the Sen1-CTD interaction does not promote initial Sen1 recruitment, but rather enhances Sen1 capacity to induce the release of paused RNAPII from the DNA. Our results shed light on the network of protein-protein interactions that control termination of non-coding transcription by Sen1.
- Klíčová slova
- RNA polymerase II CTD, Sen1 helicase, non-coding transcription, pervasive transcription, transcription termination,
- MeSH
- DNA-helikasy chemie metabolismus MeSH
- fungální RNA metabolismus MeSH
- konformace proteinů MeSH
- molekulární modely MeSH
- nekódující RNA metabolismus MeSH
- proteinové domény MeSH
- proteiny vázající RNA chemie metabolismus MeSH
- regulace genové exprese u hub MeSH
- RNA-helikasy chemie metabolismus MeSH
- RNA-polymerasa II chemie MeSH
- Saccharomyces cerevisiae - proteiny chemie metabolismus MeSH
- Saccharomyces cerevisiae genetika metabolismus MeSH
- terminace genetické transkripce MeSH
- vazba proteinů MeSH
- vazebná místa MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Názvy látek
- DNA-helikasy MeSH
- fungální RNA MeSH
- nekódující RNA MeSH
- NRD1 protein, S cerevisiae MeSH Prohlížeč
- proteiny vázající RNA MeSH
- RNA-helikasy MeSH
- RNA-polymerasa II MeSH
- Saccharomyces cerevisiae - proteiny MeSH
- SEN1 protein, S cerevisiae MeSH Prohlížeč
BACKGROUND: The first systematic study of small non-coding RNAs (sRNA, ncRNA) in Streptomyces is presented. Except for a few exceptions, the Streptomyces sRNAs, as well as the sRNAs in other genera of the Actinomyces group, have remained unstudied. This study was based on sequence conservation in intergenic regions of Streptomyces, localization of transcription termination factors, and genomic arrangement of genes flanking the predicted sRNAs. RESULTS: Thirty-two potential sRNAs in Streptomyces were predicted. Of these, expression of 20 was detected by microarrays and RT-PCR. The prediction was validated by a structure based computational approach. Two predicted sRNAs were found to be terminated by transcription termination factors different from the Rho-independent terminators. One predicted sRNA was identified computationally with high probability as a Streptomyces 6S RNA. Out of the 32 predicted sRNAs, 24 were found to be structurally dissimilar from known sRNAs. CONCLUSION: Streptomyces is the largest genus of Actinomyces, whose sRNAs have not been studied. The Actinomyces is a group of bacterial species with unique genomes and phenotypes. Therefore, in Actinomyces, new unique bacterial sRNAs may be identified. The sequence and structural dissimilarity of the predicted Streptomyces sRNAs demonstrated by this study serve as the first evidence of the uniqueness of Actinomyces sRNAs.
- MeSH
- algoritmy MeSH
- bakteriální RNA chemie genetika MeSH
- druhová specificita MeSH
- genom bakteriální MeSH
- intergenová DNA MeSH
- konformace nukleové kyseliny MeSH
- molekulární modely MeSH
- nekódující RNA chemie genetika MeSH
- polymerázová řetězová reakce s reverzní transkripcí MeSH
- sekvence nukleotidů MeSH
- sekvenční analýza hybridizací s uspořádaným souborem oligonukleotidů MeSH
- Streptomyces coelicolor genetika MeSH
- Streptomyces genetika MeSH
- terminátorové oblasti (genetika) MeSH
- výpočetní biologie MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- validační studie MeSH
- Názvy látek
- bakteriální RNA MeSH
- intergenová DNA MeSH
- nekódující RNA MeSH
The short stature homeobox-containing (SHOX) is the most frequently analysed gene in patients classified as short stature patients (ISS) or diagnosed with Leri-Weill dyschondrosteosis (LWD), Langer mesomelic dysplasia (LMD), or Madelung deformity (MD). However, clinical testing of this gene focuses primarily on single nucleotide variants (SNV) in its coding sequences and copy number variants (CNV) overlapping SHOX gene. This review summarizes the clinical impact of variants in noncoding regions of SHOX. RECENT FINDINGS: CNV extending exclusively into the regulatory elements (i.e., not interrupting the coding sequence) are found more frequently in downstream regulatory elements of SHOX. Further, duplications are more frequent than deletions. Interestingly, downstream duplications are more common than deletions in patients with ISS or LWD but no such differences exist for upstream CNV. Moreover, the presence of specific CNVs in the patient population suggests the involvement of additional unknown factors. Some of its intronic variants, notably NM_000451.3(SHOX):c.-9delG and c.-65C>A in the 5'UTR, have unclear clinical roles. However, these intronic SNV may increase the probability that other CNV will arise de novo in the SHOX gene based on homologous recombination or incorrect splicing of mRNA. SUMMARY: This review highlights the clinical impact of noncoding changes in the SHOX gene and the need to apply new technologies and genotype-phenotype correlation in their analysis.
- Klíčová slova
- CNV, ISS, LWD, Non-coding region, SHOX, SNV,
- MeSH
- fenotyp MeSH
- genetická variace * MeSH
- haploinsuficience genetika MeSH
- intergenová DNA genetika MeSH
- lidé MeSH
- protein SHOX genetika MeSH
- regulace genové exprese MeSH
- Check Tag
- lidé MeSH
- Publikační typ
- časopisecké články MeSH
- přehledy MeSH
- Názvy látek
- intergenová DNA MeSH
- protein SHOX MeSH
Cytoplasmic male sterility (CMS) is a widespread phenomenon in flowering plants caused by mitochondrial (mt) genes. CMS genes typically encode novel proteins that interfere with mt functions and can be silenced by nuclear fertility-restorer genes. Although the molecular basis of CMS is well established in a number of crop systems, our understanding of it in natural populations is far more limited. To identify CMS genes in a gynodioecious plant, Silene vulgaris, we constructed mt transcriptomes and compared transcript levels and RNA editing patterns in floral bud tissue from female and hermaphrodite full siblings. The transcriptomes from female and hermaphrodite individuals were very similar overall with respect to variation in levels of transcript abundance across the genome, the extent of RNA editing, and the order in which RNA editing and intron splicing events occurred. We found only a single genomic region that was highly overexpressed and differentially edited in females relative to hermaphrodites. This region is not located near any other transcribed elements and lacks an open-reading frame (ORF) of even moderate size. To our knowledge, this transcript would represent the first non-coding mt RNA associated with CMS in plants and is, therefore, an important target for future functional validation studies.
- Klíčová slova
- Cytoplasmic male sterility, Silene vulgaris, editing, mitochondrion, non-coding RNA, splicing, transcriptome.,
- MeSH
- editace RNA MeSH
- květy genetika růst a vývoj MeSH
- mitochondriální geny * MeSH
- nekódující RNA * MeSH
- neplodnost rostlin * MeSH
- rostlinné proteiny genetika metabolismus MeSH
- Silene genetika fyziologie MeSH
- transkriptom * MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Research Support, U.S. Gov't, Non-P.H.S. MeSH
- Názvy látek
- nekódující RNA * MeSH
- rostlinné proteiny MeSH
Multiple myeloma (MM) is a plasma cell malignancy whereby a single clone of plasma cells over-propagates in the bone marrow, resulting in the increased production of monoclonal immunoglobulin. While the complex genetic architecture of MM is well characterized, much less is known about germline variants predisposing to MM. Genome-wide sequencing approaches in MM families have started to identify rare high-penetrance coding risk alleles. In addition, genome-wide association studies have discovered several common low-penetrance risk alleles, which are mainly located in the non-coding genome. Here, we further explored the genetic basis in familial MM within the non-coding genome in whole-genome sequencing data. We prioritized and characterized 150 upstream, 5' untranslated region (UTR) and 3' UTR variants from 14 MM families, including 20 top-scoring variants. These variants confirmed previously implicated biological pathways in MM development. Most importantly, protein network and pathway enrichment analyses also identified 10 genes involved in mitogen-activated protein kinase (MAPK) signaling pathways, which have previously been established as important MM pathways.
- Klíčová slova
- MAPK pathway, familial multiple myeloma, non-coding genome, whole-genome sequencing,
- MeSH
- celogenomová asociační studie * MeSH
- lidé MeSH
- MAP kinasový signální systém MeSH
- mnohočetný myelom * genetika MeSH
- sekvenování celého genomu MeSH
- zárodečné mutace MeSH
- Check Tag
- lidé MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
Colorectal cancer (CRC) is a complex disease that develops as a consequence of both genetic and environmental risk factors in interplay with epigenetic mechanisms, such as microRNAs (miRNAs). CRC cases are predominantly sporadic in which the disease develops with no apparent hereditary syndrome. The last decade has seen the progress of genome-wide association studies (GWAS) that allowed the discovery of several genetic regions and variants associated with weak effects on sporadic CRC. Collectively these variants may enable a more accurate prediction of an individual's risk to the disease and its prognosis. However, the number of variants contributing to CRC is still not fully explored.SNPs in genes encoding the miRNA sequence or in 3'UTR regions of the corresponding binding sites may affect miRNA transcription, miRNA processing, and/or the fidelity of the miRNA-mRNA interaction. These variants could plausibly impact miRNA expression and target mRNA translation into proteins critical for cellular integrity, differentiation, and proliferation.In the present chapter, we describe the different aspects of variations related to miRNAs and other non-coding RNAs (ncRNAs) and evidence from studies investigating these candidate genetic alterations in support to their role in CRC development and progression.
- Klíčová slova
- Colorectal cancer, Polymorphism, Risk factors, SNP, miRNA target site, miRSNP,
- MeSH
- 3' nepřekládaná oblast genetika MeSH
- celogenomová asociační studie MeSH
- epigeneze genetická MeSH
- genetická predispozice k nemoci MeSH
- interakce genů a prostředí MeSH
- jednonukleotidový polymorfismus * MeSH
- kokarcinogeneze MeSH
- kolorektální nádory diagnóza epidemiologie genetika MeSH
- lidé MeSH
- mikro RNA genetika MeSH
- nekódující RNA genetika MeSH
- polyadenylace MeSH
- prognóza MeSH
- progrese nemoci MeSH
- regulace genové exprese u nádorů genetika MeSH
- rizikové faktory MeSH
- RNA nádorová genetika MeSH
- Check Tag
- lidé MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- přehledy MeSH
- Názvy látek
- 3' nepřekládaná oblast MeSH
- mikro RNA MeSH
- nekódující RNA MeSH
- RNA nádorová MeSH
Ixodes ricinus ticks are distributed across Europe and are a vector of tick-borne diseases. Although I. ricinus transcriptome studies have focused exclusively on protein coding genes, the last decade witnessed a strong increase in long non-coding RNA (lncRNA) research and characterization. Here, we report for the first time an exhaustive analysis of these non-coding molecules in I. ricinus based on 131 RNA-seq datasets from three different BioProjects. Using this data, we obtained a consensus set of lncRNAs and showed that lncRNA expression is stable among different studies. While the length distribution of lncRNAs from the individual data sets is biased toward short length values, implying the existence of technical artefacts, the consensus lncRNAs show a more homogeneous distribution emphasizing the importance to incorporate data from different sources to generate a solid reference set of lncRNAs. KEGG enrichment analysis of host miRNAs putatively targeting lncRNAs upregulated upon feeding showed that these miRNAs are involved in several relevant functions for the tick-host interaction. The possibility that at least some tick lncRNAs act as host miRNA sponges was further explored by identifying lncRNAs with many target regions for a given host miRNA or sets of host miRNAs that consistently target lncRNAs together. Overall, our findings suggest that lncRNAs that may act as sponges have diverse biological roles related to the tick-host interaction in different tissues.
- Klíčová slova
- Ixodes ricinus, RNA-sequencing, ectoparasite-host interactions, host immunity, lncRNA,
- MeSH
- klíště * genetika MeSH
- mikro RNA * genetika MeSH
- nemoci přenášené klíšťaty * MeSH
- RNA dlouhá nekódující * genetika MeSH
- výpočetní biologie MeSH
- zvířata MeSH
- Check Tag
- zvířata MeSH
- Publikační typ
- časopisecké články MeSH
- Názvy látek
- mikro RNA * MeSH
- RNA dlouhá nekódující * MeSH
Non-coding RNAs (ncRNAs) are regulatory molecules encoded in the intergenic or intragenic regions of the genome. In prokaryotes, biocomputational identification of homologs of known ncRNAs in other species often fails due to weakly evolutionarily conserved sequences, structures, synteny and genome localization, except in the case of evolutionarily closely related species. To eliminate results from weak conservation, we focused on RNA structure, which is the most conserved ncRNA property. Analysis of the structure of one of the few well-studied bacterial ncRNAs, 6S RNA, demonstrated that unlike optimal and consensus structures, suboptimal structures are capable of capturing RNA homology even in divergent bacterial species. A computational procedure for the identification of homologous ncRNAs using suboptimal structures was created. The suggested procedure was applied to strongly divergent bacterial species and was capable of identifying homologous ncRNAs.
- MeSH
- bakteriální RNA chemie MeSH
- konformace nukleové kyseliny MeSH
- molekulární sekvence - údaje MeSH
- Mycobacterium genetika MeSH
- nekódující RNA chemie MeSH
- sekvence nukleotidů MeSH
- sekvenční homologie nukleových kyselin MeSH
- Streptomyces genetika MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Názvy látek
- 6S RNA MeSH Prohlížeč
- bakteriální RNA MeSH
- nekódující RNA MeSH
Genomic regions that encode small RNA genes exhibit characteristic patterns in their sequence, secondary structure, and evolutionary conservation. Convolutional Neural Networks are a family of algorithms that can classify data based on learned patterns. Here we present MuStARD an application of Convolutional Neural Networks that can learn patterns associated with user-defined sets of genomic regions, and scan large genomic areas for novel regions exhibiting similar characteristics. We demonstrate that MuStARD is a generic method that can be trained on different classes of human small RNA genomic loci, without need for domain specific knowledge, due to the automated feature and background selection processes built into the model. We also demonstrate the ability of MuStARD for inter-species identification of functional elements by predicting mouse small RNAs (pre-miRNAs and snoRNAs) using models trained on the human genome. MuStARD can be used to filter small RNA-Seq datasets for identification of novel small RNA loci, intra- and inter- species, as demonstrated in three use cases of human, mouse, and fly pre-miRNA prediction. MuStARD is easy to deploy and extend to a variety of genomic classification questions. Code and trained models are freely available at gitlab.com/RBP_Bioinformatics/mustard.
- MeSH
- algoritmy MeSH
- genomika metody MeSH
- lidé MeSH
- malá jadérková RNA genetika MeSH
- mikro RNA genetika MeSH
- myši MeSH
- nekódující RNA genetika MeSH
- neuronové sítě MeSH
- software MeSH
- výpočetní biologie metody MeSH
- zvířata MeSH
- Check Tag
- lidé MeSH
- myši MeSH
- zvířata MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Názvy látek
- malá jadérková RNA MeSH
- mikro RNA MeSH
- nekódující RNA MeSH
In a large family of Czech origin, we mapped a locus for an autosomal-dominant corneal endothelial dystrophy, posterior polymorphous corneal dystrophy 4 (PPCD4), to 8q22.3-q24.12. Whole-genome sequencing identified a unique variant (c.20+544G>T) in this locus, within an intronic regulatory region of GRHL2. Targeted sequencing identified the same variant in three additional previously unsolved PPCD-affected families, including a de novo occurrence that suggests this is a recurrent mutation. Two further unique variants were identified in intron 1 of GRHL2 (c.20+257delT and c.20+133delA) in unrelated PPCD-affected families. GRHL2 is a transcription factor that suppresses epithelial-to-mesenchymal transition (EMT) and is a direct transcriptional repressor of ZEB1. ZEB1 mutations leading to haploinsufficiency cause PPCD3. We previously identified promoter mutations in OVOL2, a gene not normally expressed in the corneal endothelium, as the cause of PPCD1. OVOL2 drives mesenchymal-to-epithelial transition (MET) by directly inhibiting EMT-inducing transcription factors, such as ZEB1. Here, we demonstrate that the GRHL2 regulatory variants identified in PPCD4-affected individuals induce increased transcriptional activity in vitro. Furthermore, although GRHL2 is not expressed in corneal endothelial cells in control tissue, we detected GRHL2 in the corneal "endothelium" in PPCD4 tissue. These cells were also positive for epithelial markers E-Cadherin and Cytokeratin 7, indicating they have transitioned to an epithelial-like cell type. We suggest that mutations inducing MET within the corneal endothelium are a convergent pathogenic mechanism leading to dysfunction of the endothelial barrier and disease.
- Klíčová slova
- GRHL2, PPCD, corneal dystrophy, corneal edema, corneal endothelium, ectopic expression, epithelial-to-mesenchymal transition, mesenchymal-to-epithelial transition, non-coding mutation, regulatory region,
- MeSH
- dědičné dystrofie rohovky genetika MeSH
- DNA vazebné proteiny genetika MeSH
- genetická transkripce MeSH
- genetické lokusy MeSH
- HEK293 buňky MeSH
- intergenová DNA genetika MeSH
- introny genetika MeSH
- lidé MeSH
- modely genetické MeSH
- mutace genetika MeSH
- promotorové oblasti (genetika) genetika MeSH
- rodina MeSH
- rodokmen MeSH
- rohovkový endotel patologie MeSH
- sekvence nukleotidů MeSH
- sekvenování celého genomu MeSH
- transkripční faktory genetika MeSH
- Check Tag
- lidé MeSH
- mužské pohlaví MeSH
- ženské pohlaví MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Názvy látek
- DNA vazebné proteiny MeSH
- GRHL2 protein, human MeSH Prohlížeč
- intergenová DNA MeSH
- transkripční faktory MeSH