Nejvíce citovaný článek - PubMed ID 31706022
Divergent distributions of inverted repeats and G-quadruplex forming sequences in Saccharomyces cerevisiae
Non-canonical secondary structures in DNA are increasingly being revealed as critical players in DNA metabolism, including modulating the accessibility and activity of promoters. These structures comprise the so-called G-quadruplexes (G4s) that are formed from sequences rich in guanine bases. Using a well-defined transcriptional reporter system, we sought to systematically investigate the impact of the presence of G4 structures on transcription in yeast Saccharomyces cerevisiae. To this aim, different G4 prone sequences were modeled to vary the chance of intramolecular G4 formation, analyzed in vitro by Thioflavin T binding test and circular dichroism and then placed at the yeast ADE2 locus on chromosome XV, downstream and adjacent to a P53 response element (RE) and upstream from a minimal CYC1 promoter and Luciferase 1 (LUC1) reporter gene in isogenic strains. While the minimal CYC1 promoter provides basal reporter activity, the P53 RE enables LUC1 transactivation under the control of P53 family proteins expressed under the inducible GAL1 promoter. Thus, the impact of the different G4 prone sequences on both basal and P53 family protein-dependent expression was measured after shifting cells onto galactose containing medium. The results showed that the presence of G4 prone sequences upstream of a yeast minimal promoter increased its basal activity proportionally to their potential to form intramolecular G4 structures; consequently, this feature, when present near the target binding site of P53 family transcription factors, can be exploited to regulate the transcriptional activity of P53, P63 and P73 proteins.
- Klíčová slova
- G-quadruplex, p53, transcriptional activity, yeast,
- MeSH
- DNA metabolismus MeSH
- G-kvadruplexy * MeSH
- nádorový supresorový protein p53 genetika MeSH
- promotorové oblasti (genetika) MeSH
- Saccharomyces cerevisiae * genetika metabolismus MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Názvy látek
- DNA MeSH
- nádorový supresorový protein p53 MeSH
Noncanonical secondary structures in nucleic acids have been studied intensively in recent years. Important biological roles of cruciform structures formed by inverted repeats (IRs) have been demonstrated in diverse organisms, including humans. Using Palindrome analyser, we analyzed IRs in all accessible bacterial genome sequences to determine their frequencies, lengths, and localizations. IR sequences were identified in all species, but their frequencies differed significantly across various evolutionary groups. We detected 242,373,717 IRs in all 1,565 bacterial genomes. The highest mean IR frequency was detected in the Tenericutes (61.89 IRs/kbp) and the lowest mean frequency was found in the Alphaproteobacteria (27.08 IRs/kbp). IRs were abundant near genes and around regulatory, tRNA, transfer-messenger RNA (tmRNA), and rRNA regions, pointing to the importance of IRs in such basic cellular processes as genome maintenance, DNA replication, and transcription. Moreover, we found that organisms with high IR frequencies were more likely to be endosymbiotic, antibiotic producing, or pathogenic. On the other hand, those with low IR frequencies were far more likely to be thermophilic. This first comprehensive analysis of IRs in all available bacterial genomes demonstrates their genomic ubiquity, nonrandom distribution, and enrichment in genomic regulatory regions. IMPORTANCE Our manuscript reports for the first time a complete analysis of inverted repeats in all fully sequenced bacterial genomes. Thanks to the availability of unique computational resources, we were able to statistically evaluate the presence and localization of these important regulatory sequences in bacterial genomes. This work revealed a strong abundance of these sequences in regulatory regions and provides researchers with a valuable tool for their manipulation.
- Klíčová slova
- Palindrome analyser, bacteria domain, bacterial genome analysis, inverted repeats,
- MeSH
- Bacteria genetika MeSH
- fylogeneze MeSH
- genomika * MeSH
- lidé MeSH
- replikace DNA * MeSH
- sekvence nukleotidů MeSH
- Check Tag
- lidé MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
Epigenetics deals with changes in gene expression that are not caused by modifications in the primary sequence of nucleic acids. These changes beyond primary structures of nucleic acids not only include DNA/RNA methylation, but also other reversible conversions, together with histone modifications or RNA interference. In addition, under particular conditions (such as specific ion concentrations or protein-induced stabilization), the right-handed double-stranded DNA helix (B-DNA) can form noncanonical structures commonly described as "non-B DNA" structures. These structures comprise, for example, cruciforms, i-motifs, triplexes, and G-quadruplexes. Their formation often leads to significant differences in replication and transcription rates. Noncanonical RNA structures have also been documented to play important roles in translation regulation and the biology of noncoding RNAs. In human and animal studies, the frequency and dynamics of noncanonical DNA and RNA structures are intensively investigated, especially in the field of cancer research and neurodegenerative diseases. In contrast, noncanonical DNA and RNA structures in plants have been on the fringes of interest for a long time and only a few studies deal with their formation, regulation, and physiological importance for plant stress responses. Herein, we present a review focused on the main fields of epigenetics in plants and their possible roles in stress responses and signaling, with special attention dedicated to noncanonical DNA and RNA structures.
- Klíčová slova
- Acetylation, Chromatin, Epigenetics, G-quadruplex, Gene expression, Histone, Methylation, Non-B DNA, Stress signaling,
- MeSH
- DNA genetika chemie MeSH
- epigeneze genetická MeSH
- G-kvadruplexy * MeSH
- lidé MeSH
- nukleové kyseliny * MeSH
- RNA genetika chemie MeSH
- rostliny genetika MeSH
- zvířata MeSH
- Check Tag
- lidé MeSH
- zvířata MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Názvy látek
- DNA MeSH
- nukleové kyseliny * MeSH
- RNA MeSH
G-quadruplexes (G4s) have been long considered rare and physiologically unimportant in vitro curiosities, but recent methodological advances have proved their presence and functions in vivo. Moreover, in addition to their functional relevance in bacteria and animals, including humans, their importance has been recently demonstrated in evolutionarily distinct plant species. In this study, we analyzed the genome of Pisum sativum (garden pea, or the so-called green pea), a unique member of the Fabaceae family. Our results showed that this genome contained putative G4 sequences (PQSs). Interestingly, these PQSs were located nonrandomly in the nuclear genome. We also found PQSs in mitochondrial (mt) and chloroplast (cp) DNA, and we experimentally confirmed G4 formation for sequences found in these two organelles. The frequency of PQSs for nuclear DNA was 0.42 PQSs per thousand base pairs (kbp), in the same range as for cpDNA (0.53/kbp), but significantly lower than what was found for mitochondrial DNA (1.58/kbp). In the nuclear genome, PQSs were mainly associated with regulatory regions, including 5'UTRs, and upstream of the rRNA region. In contrast to genomic DNA, PQSs were located around RNA genes in cpDNA and mtDNA. Interestingly, PQSs were also associated with specific transposable elements such as TIR and LTR and around them, pointing to their role in their spreading in nuclear DNA. The nonrandom localization of PQSs uncovered their evolutionary and functional significance in the Pisum sativum genome.
- Klíčová slova
- G-quadruplex, G4 propensity, chloroplast DNA, sequence prediction,
- MeSH
- 5' nepřekládaná oblast MeSH
- G-kvadruplexy * MeSH
- genom rostlinný MeSH
- hrách setý genetika MeSH
- lidé MeSH
- sekvence nukleotidů MeSH
- transpozibilní elementy DNA genetika MeSH
- zvířata MeSH
- Check Tag
- lidé MeSH
- zvířata MeSH
- Publikační typ
- časopisecké články MeSH
- Názvy látek
- 5' nepřekládaná oblast MeSH
- transpozibilní elementy DNA MeSH
Cruciforms occur when inverted repeat sequences in double-stranded DNA adopt intra-strand hairpins on opposing strands. Biophysical and molecular studies of these structures confirm their characterization as four-way junctions and have demonstrated that several factors influence their stability, including overall chromatin structure and DNA supercoiling. Here, we review our understanding of processes that influence the formation and stability of cruciforms in genomes, covering the range of sequences shown to have biological significance. It is challenging to accurately sequence repetitive DNA sequences, but recent advances in sequencing methods have deepened understanding about the amounts of inverted repeats in genomes from all forms of life. We highlight that, in the majority of genomes, inverted repeats are present in higher numbers than is expected from a random occurrence. It is, therefore, becoming clear that inverted repeats play important roles in regulating many aspects of DNA metabolism, including replication, gene expression, and recombination. Cruciforms are targets for many architectural and regulatory proteins, including topoisomerases, p53, Rif1, and others. Notably, some of these proteins can induce the formation of cruciform structures when they bind to DNA. Inverted repeat sequences also influence the evolution of genomes, and growing evidence highlights their significance in several human diseases, suggesting that the inverted repeat sequences and/or DNA cruciforms could be useful therapeutic targets in some cases.
- Klíčová slova
- DNA base sequence, DNA structure, DNA supercoiling, cruciform, epigenetics, genome stability, inverted repeat, replication, transcription,
- MeSH
- DNA genetika MeSH
- konformace nukleové kyseliny MeSH
- křížová struktura DNA MeSH
- lidé MeSH
- nukleové kyseliny * MeSH
- obrácené repetice MeSH
- repetitivní sekvence nukleových kyselin genetika MeSH
- Check Tag
- lidé MeSH
- Publikační typ
- časopisecké články MeSH
- přehledy MeSH
- Názvy látek
- DNA MeSH
- křížová struktura DNA MeSH
- nukleové kyseliny * MeSH
Telomerase RNA (TR) carries the template for synthesis of telomere DNA and provides a scaffold for telomerase assembly. Fungal TRs are long and have been compared to higher eukaryotes, where they show considerable diversity within phylogenetically close groups. TRs of several Saccharomycetaceae were recently identified, however, many of these remained uncharacterised in the template region. Here we show that this is mainly due to high variability in telomere sequence. We predicted the telomere sequences using Tandem Repeats Finder and then we identified corresponding putative template regions in TR candidates. Remarkably long telomere units and the corresponding putative TRs were found in Tetrapisispora species. Notably, variable lengths of the annealing sequence of the template region (1-10 nt) were found. Consequently, species with the same telomere sequence may not harbour identical TR templates. Thus, TR sequence alone can be used to predict a template region and telomere sequence, but not to determine these exactly. A conserved feature of telomere sequences, tracts of adjacent Gs, led us to test the propensity of individual telomere sequences to form G4. The results show highly diverse values of G4-propensity, indicating the lack of ubiquitous conservation of this feature across Saccharomycetaceae.
- MeSH
- benzothiazoly metabolismus MeSH
- fluorescence MeSH
- G-kvadruplexy MeSH
- genetická variace * MeSH
- genetické matrice * MeSH
- reprodukovatelnost výsledků MeSH
- RNA genetika MeSH
- Saccharomycetales genetika MeSH
- sekvence nukleotidů MeSH
- telomerasa genetika MeSH
- telomery genetika MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Názvy látek
- benzothiazoly MeSH
- RNA MeSH
- telomerasa MeSH
- telomerase RNA MeSH Prohlížeč
- thioflavin T MeSH Prohlížeč
Fungal infections cause >1 million deaths annually and the emergence of antifungal resistance has prompted the exploration for novel antifungal targets. Quadruplexes are four-stranded nucleic acid secondary structures, which can regulate processes such as transcription, translation, replication and recombination. They are also found in genes linked to virulence in microbes, and ligands that bind to quadruplexes can eliminate drug-resistant pathogens. Using a computational approach, we quantified putative quadruplex-forming sequences (PQS) in 1359 genomes across the fungal kingdom and explored their presence in genes related to virulence, drug resistance and biological processes associated with pathogenicity in Aspergillus fumigatus. Here we present the largest analysis of PQS in fungi and identify significant heterogeneity of these sequences throughout phyla, genera and species. PQS were genetically conserved in Aspergillus spp. and frequently pathogenic species appeared to contain fewer PQS than their lesser/non-pathogenic counterparts. GO-term analysis identified that PQS-containing genes were involved in processes linked with virulence such as zinc ion binding, the biosynthesis of secondary metabolites and regulation of transcription in A. fumigatus. Although the genome frequency of PQS was lower in A. fumigatus, PQS could be found enriched in genes involved in virulence, and genes upregulated during germination and hypoxia. Moreover, PQS were found in genes involved in drug resistance. Quadruplexes could have important roles within fungal biology and virulence, but their roles require further elucidation.
- Klíčová slova
- Aspergillus fumigatus, Fungi, G-quadruplexes, drug resistance, i-motifs, in-silico, virulence,
- MeSH
- algoritmy MeSH
- antifungální látky farmakologie MeSH
- Ascomycota MeSH
- Aspergillus fumigatus genetika MeSH
- Aspergillus MeSH
- fungální léková rezistence účinky léků MeSH
- genom fungální účinky léků MeSH
- genom virový MeSH
- transkriptom MeSH
- virulence MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Research Support, U.S. Gov't, Non-P.H.S. MeSH
- Názvy látek
- antifungální látky MeSH
BACKGROUND: Influenza viruses are dangerous pathogens. Seventy-Seven genomes of recently emerged genotype 4 reassortant Eurasian avian-like H1N1 virus (G4-EA-H1N1) are currently available. We investigated the presence and variation of potential G-quadruplex forming sequences (PQS), which can serve as targets for antiviral treatment. RESULTS: PQS were identified in all 77 genomes. The total number of PQS in G4-EA-H1N1 genomes was 571. Interestingly, the number of PQS per genome in individual close relative viruses varied from 4 to 12. PQS were not randomly distributed in the 8 segments of the G4-EA-H1N1 genome, the highest frequency of PQS being found in the NP segment (1.39 per 1000 nt), which is considered a potential target for antiviral therapy. In contrast, no PQS was found in the NS segment. Analyses of variability pointed the importance of some PQS; even if genome variation of influenza virus is extreme, the PQS with the highest G4Hunter score is the most conserved in all tested genomes. G-quadruplex formation in vitro was experimentally confirmed using spectroscopic methods. CONCLUSIONS: The results presented here hint several G-quadruplex-forming sequences in G4-EA-H1N1 genomes, that could provide good therapeutic targets.
- Klíčová slova
- G-quadruplex, G4Hunter, Influenza virus,
- MeSH
- chřipka lidská * MeSH
- G-kvadruplexy * MeSH
- genom virový MeSH
- genotyp MeSH
- lidé MeSH
- reassortantní viry genetika MeSH
- virus chřipky A, podtyp H1N1 * genetika MeSH
- Check Tag
- lidé MeSH
- Publikační typ
- časopisecké články MeSH
The importance of unusual DNA structures in the regulation of basic cellular processes is an emerging field of research. Amongst local non-B DNA structures, G-quadruplexes (G4s) have gained in popularity during the last decade, and their presence and functional relevance at the DNA and RNA level has been demonstrated in a number of viral, bacterial, and eukaryotic genomes, including humans. Here, we performed the first systematic search of G4-forming sequences in all archaeal genomes available in the NCBI database. In this article, we investigate the presence and locations of G-quadruplex forming sequences using the G4Hunter algorithm. G-quadruplex-prone sequences were identified in all archaeal species, with highly significant differences in frequency, from 0.037 to 15.31 potential quadruplex sequences per kb. While G4 forming sequences were extremely abundant in Hadesarchaea archeon (strikingly, more than 50% of the Hadesarchaea archaeon isolate WYZ-LMO6 genome is a potential part of a G4-motif), they were very rare in the Parvarchaeota phylum. The presence of G-quadruplex forming sequences does not follow a random distribution with an over-representation in non-coding RNA, suggesting possible roles for ncRNA regulation. These data illustrate the unique and non-random localization of G-quadruplexes in Archaea.
- Klíčová slova
- Archaea, G4-forming motif, genome analysis, sequence prediction, unusual nucleic acid structures,
- MeSH
- Archaea klasifikace genetika metabolismus MeSH
- archeální proteiny genetika metabolismus MeSH
- cirkulární dichroismus MeSH
- DNA vazebné proteiny genetika metabolismus MeSH
- DNA chemie genetika metabolismus MeSH
- druhová specificita MeSH
- fylogeneze MeSH
- G-kvadruplexy * MeSH
- genom archeí genetika MeSH
- genomika metody MeSH
- konformace nukleové kyseliny MeSH
- RNA chemie genetika metabolismus MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Názvy látek
- archeální proteiny MeSH
- DNA vazebné proteiny MeSH
- DNA MeSH
- RNA MeSH