Nejvíce citovaný článek - PubMed ID 31052562
The Presence and Localization of G-Quadruplex Forming Sequences in the Domain of Bacteria
Retroviruses are among the most extensively studied viral families, both historically and in contemporary research. They are primarily investigated in the fields of viral oncogenesis, reverse transcription mechanisms, and other infection-specific aspects. These include the integration of endogenous retroviruses (ERVs) into host genomes, a process widely utilized in genetic engineering, and the ongoing search for HIV/AIDS treatment. G-quadruplexes (G4) have emerged as potential therapeutic targets in antiviral therapy and have been identified in important regulatory regions of viral genomes. In this study, we examine the presence of potential G-quadruplex-forming sequences (PQS) across all currently available unique retroviral genomes. Given that these retroviral genomes typically consist of single-stranded RNA (ssRNA) molecules, we also investigated whether the localization of PQSs is strand-dependent. This is particularly relevant since antisense transcripts have been detected in HIV, and ERV integration into the host genome involves reverse transcription from genomic positive strand ssRNA to double-stranded DNA (dsDNA), implicating both strands in this process. We show that in most mammalian retroviruses, including human retroviruses, PQSs are significantly more prevalent on the negative (antisense) strand, with some notable exceptions such as HIV-1. In sharp contrast, avian retroviruses exhibit a higher prevalence of PQSs on the positive (sense) strand.
- Klíčová slova
- Bioinformatics, G-quadruplex, G4Hunter, Persistent infection, Retroviral genome,
- MeSH
- endogenní retroviry genetika MeSH
- G-kvadruplexy * MeSH
- genom virový * MeSH
- lidé MeSH
- Retroviridae * genetika MeSH
- zvířata MeSH
- Check Tag
- lidé MeSH
- zvířata MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
Current methods of processing archaeological samples combined with advances in sequencing methods lead to disclosure of a large part of H. neanderthalensis and Denisovans genetic information. It is hardly surprising that the genome variability between modern humans, Denisovans and H. neanderthalensis is relatively limited. Genomic studies may provide insight on the metabolism of extinct human species or lineages. Detailed analysis of G-quadruplex sequences in H. neanderthalensis and Denisovans mitochondrial DNA showed us interesting features. Relatively similar patterns in mitochondrial DNA are found compared to modern humans, with one notable exception for H. neanderthalensis. An interesting difference between H. neanderthalensis and H. sapiens corresponds to a motif found in the D-loop region of mtDNA, which is responsible for mitochondrial DNA replication. This area is directly responsible for the number of mitochondria and consequently for the efficient energy metabolism of cell. H. neanderthalensis harbor a long uninterrupted run of guanines in this region, which may cause problems for replication, in contrast with H. sapiens, for which this run is generally shorter and interrupted. One may propose that the predominant H. sapiens motif provided a selective advantage for modern humans regarding mtDNA replication and function.
- Publikační typ
- časopisecké články MeSH
Hepatitis delta virus (HDV) is a highly unusual RNA satellite virus that depends on the presence of hepatitis B virus (HBV) to be infectious. Its compact and variable single-stranded RNA genome consists of eight major genotypes distributed unevenly across different continents. The significance of noncanonical secondary structures such as G-quadruplexes (G4s) is increasingly recognized at the DNA and RNA levels, particularly for transcription, replication, and translation. G4s are formed from guanine-rich sequences and have been identified in the vast majority of viral, eukaryotic, and prokaryotic genomes. In this study, we analyzed the G4 propensity of HDV genomes by using G4Hunter. Unlike HBV, which has a G4 density similar to that of the human genome, HDV displays a significantly higher number of potential quadruplex-forming sequences (PQS), with a density more than four times greater than that of the human genome. This finding suggests a critical role for G4s in HDV, especially given that the PQS regions are conserved across HDV genotypes. Furthermore, the prevalence of G4-forming sequences may represent a promising target for therapeutic interventions to control HDV replication.
- Publikační typ
- časopisecké články MeSH
Noncanonical secondary structures in nucleic acids have been studied intensively in recent years. Important biological roles of cruciform structures formed by inverted repeats (IRs) have been demonstrated in diverse organisms, including humans. Using Palindrome analyser, we analyzed IRs in all accessible bacterial genome sequences to determine their frequencies, lengths, and localizations. IR sequences were identified in all species, but their frequencies differed significantly across various evolutionary groups. We detected 242,373,717 IRs in all 1,565 bacterial genomes. The highest mean IR frequency was detected in the Tenericutes (61.89 IRs/kbp) and the lowest mean frequency was found in the Alphaproteobacteria (27.08 IRs/kbp). IRs were abundant near genes and around regulatory, tRNA, transfer-messenger RNA (tmRNA), and rRNA regions, pointing to the importance of IRs in such basic cellular processes as genome maintenance, DNA replication, and transcription. Moreover, we found that organisms with high IR frequencies were more likely to be endosymbiotic, antibiotic producing, or pathogenic. On the other hand, those with low IR frequencies were far more likely to be thermophilic. This first comprehensive analysis of IRs in all available bacterial genomes demonstrates their genomic ubiquity, nonrandom distribution, and enrichment in genomic regulatory regions. IMPORTANCE Our manuscript reports for the first time a complete analysis of inverted repeats in all fully sequenced bacterial genomes. Thanks to the availability of unique computational resources, we were able to statistically evaluate the presence and localization of these important regulatory sequences in bacterial genomes. This work revealed a strong abundance of these sequences in regulatory regions and provides researchers with a valuable tool for their manipulation.
- Klíčová slova
- Palindrome analyser, bacteria domain, bacterial genome analysis, inverted repeats,
- MeSH
- Bacteria genetika MeSH
- fylogeneze MeSH
- genomika * MeSH
- lidé MeSH
- replikace DNA * MeSH
- sekvence nukleotidů MeSH
- Check Tag
- lidé MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
Epigenetics deals with changes in gene expression that are not caused by modifications in the primary sequence of nucleic acids. These changes beyond primary structures of nucleic acids not only include DNA/RNA methylation, but also other reversible conversions, together with histone modifications or RNA interference. In addition, under particular conditions (such as specific ion concentrations or protein-induced stabilization), the right-handed double-stranded DNA helix (B-DNA) can form noncanonical structures commonly described as "non-B DNA" structures. These structures comprise, for example, cruciforms, i-motifs, triplexes, and G-quadruplexes. Their formation often leads to significant differences in replication and transcription rates. Noncanonical RNA structures have also been documented to play important roles in translation regulation and the biology of noncoding RNAs. In human and animal studies, the frequency and dynamics of noncanonical DNA and RNA structures are intensively investigated, especially in the field of cancer research and neurodegenerative diseases. In contrast, noncanonical DNA and RNA structures in plants have been on the fringes of interest for a long time and only a few studies deal with their formation, regulation, and physiological importance for plant stress responses. Herein, we present a review focused on the main fields of epigenetics in plants and their possible roles in stress responses and signaling, with special attention dedicated to noncanonical DNA and RNA structures.
- Klíčová slova
- Acetylation, Chromatin, Epigenetics, G-quadruplex, Gene expression, Histone, Methylation, Non-B DNA, Stress signaling,
- MeSH
- DNA genetika chemie MeSH
- epigeneze genetická MeSH
- G-kvadruplexy * MeSH
- lidé MeSH
- nukleové kyseliny * MeSH
- RNA genetika chemie MeSH
- rostliny genetika MeSH
- zvířata MeSH
- Check Tag
- lidé MeSH
- zvířata MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Názvy látek
- DNA MeSH
- nukleové kyseliny * MeSH
- RNA MeSH
Sequences of nucleic acids with the potential to form four-stranded G-quadruplex structures are intensively studied mainly in the context of human diseases, pathogens, or extremophile organisms; nonetheless, the knowledge about their occurrence and putative role in plants is still limited. This work is focused on G-quadruplex-forming sites in two gene sets of interest: drought stress-responsive genes, and genes related to the production/biosynthesis of phenolic compounds in the model plant organism Arabidopsis thaliana. In addition, 20 housekeeping genes were analyzed as well, where the constitutive gene expression was expected (with no need for precise regulation depending on internal or external factors). The results have shown that none of the tested gene sets differed significantly in the content of G-quadruplex-forming sites, however, the highest frequency of G-quadruplex-forming sites was found in the 5'-UTR regions of phenolic compounds' biosynthesis genes, which indicates the possibility of their regulation at the mRNA level. In addition, mainly within the introns and 1000 bp flanks downstream gene regions, G-quadruplex-forming sites were highly underrepresented. Finally, cluster analysis allowed us to observe similarities between particular genes in terms of their PQS characteristics. We believe that the original approach used in this study may become useful for further and more comprehensive bioinformatic studies in the field of G-quadruplex genomics.
- Klíčová slova
- Arabidopsis thaliana, G-quadruplex, PQS, drought stress, phenolic compounds,
- Publikační typ
- časopisecké články MeSH
G-quadruplexes (G4s) have been long considered rare and physiologically unimportant in vitro curiosities, but recent methodological advances have proved their presence and functions in vivo. Moreover, in addition to their functional relevance in bacteria and animals, including humans, their importance has been recently demonstrated in evolutionarily distinct plant species. In this study, we analyzed the genome of Pisum sativum (garden pea, or the so-called green pea), a unique member of the Fabaceae family. Our results showed that this genome contained putative G4 sequences (PQSs). Interestingly, these PQSs were located nonrandomly in the nuclear genome. We also found PQSs in mitochondrial (mt) and chloroplast (cp) DNA, and we experimentally confirmed G4 formation for sequences found in these two organelles. The frequency of PQSs for nuclear DNA was 0.42 PQSs per thousand base pairs (kbp), in the same range as for cpDNA (0.53/kbp), but significantly lower than what was found for mitochondrial DNA (1.58/kbp). In the nuclear genome, PQSs were mainly associated with regulatory regions, including 5'UTRs, and upstream of the rRNA region. In contrast to genomic DNA, PQSs were located around RNA genes in cpDNA and mtDNA. Interestingly, PQSs were also associated with specific transposable elements such as TIR and LTR and around them, pointing to their role in their spreading in nuclear DNA. The nonrandom localization of PQSs uncovered their evolutionary and functional significance in the Pisum sativum genome.
- Klíčová slova
- G-quadruplex, G4 propensity, chloroplast DNA, sequence prediction,
- MeSH
- 5' nepřekládaná oblast MeSH
- G-kvadruplexy * MeSH
- genom rostlinný MeSH
- hrách setý genetika MeSH
- lidé MeSH
- sekvence nukleotidů MeSH
- transpozibilní elementy DNA genetika MeSH
- zvířata MeSH
- Check Tag
- lidé MeSH
- zvířata MeSH
- Publikační typ
- časopisecké články MeSH
- Názvy látek
- 5' nepřekládaná oblast MeSH
- transpozibilní elementy DNA MeSH
Parasitic helminths infecting humans are highly prevalent infecting ∼2 billion people worldwide, causing inflammatory responses, malnutrition and anemia that are the primary cause of morbidity. In addition, helminth infections of cattle have a significant economic impact on livestock production, milk yield and fertility. The etiological agents of helminth infections are mainly Nematodes (roundworms) and Platyhelminths (flatworms). G-quadruplexes (G4) are unusual nucleic acid structures formed by G-rich sequences that can be recognized by specific G4 ligands. Here we used the G4Hunter Web Tool to identify and compare potential G4 sequences (PQS) in the nuclear and mitochondrial genomes of various helminths to identify G4 ligand targets. PQS are nonrandomly distributed in these genomes and often located in the proximity of genes. Unexpectedly, a Nematode, Ascaris lumbricoides, was found to be highly enriched in stable PQS. This species can tolerate high-stability G4 structures, which are not counter selected at all, in stark contrast to most other species. We experimentally confirmed G4 formation for sequences found in four different parasitic helminths. Small molecules able to selectively recognize G4 were found to bind to Schistosoma mansoni G4 motifs. Two of these ligands demonstrated potent activity both against larval and adult stages of this parasite.
- MeSH
- cizopasní červi genetika MeSH
- G-kvadruplexy * MeSH
- genom MeSH
- hlístice * genetika MeSH
- lidé MeSH
- ligandy MeSH
- paraziti genetika MeSH
- ploštěnci * genetika MeSH
- skot MeSH
- zvířata MeSH
- Check Tag
- lidé MeSH
- skot MeSH
- zvířata MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Názvy látek
- ligandy MeSH
R-loops are common non-B nucleic acid structures formed by a three-stranded nucleic acid composed of an RNA-DNA hybrid and a displaced single-stranded DNA (ssDNA) loop. Because the aberrant R-loop formation leads to increased mutagenesis, hyper-recombination, rearrangements, and transcription-replication collisions, it is regarded as important in human diseases. Therefore, its prevalence and distribution in genomes are studied intensively. However, in silico tools for R-loop prediction are limited, and therefore, we have developed the R-loop tracker tool, which was implemented as a part of the DNA Analyser web server. This new tool is focused upon (1) prediction of R-loops in genomic DNA without length and sequence limitations; (2) integration of R-loop tracker results with other tools for nucleic acids analyses, including Genome Browser; (3) internal cross-evaluation of in silico results with experimental data, where available; (4) easy export and correlation analyses with other genome features and markers; and (5) enhanced visualization outputs. Our new R-loop tracker tool is freely accessible on the web pages of DNA Analyser tools, and its implementation on the web-based server allows effective analyses not only for DNA segments but also for full chromosomes and genomes.
- Klíčová slova
- RNA–DNA hybrid, non-B structure, sequence analysis,
- MeSH
- algoritmy * MeSH
- DNA chemie genetika MeSH
- genomika metody MeSH
- internet statistika a číselné údaje MeSH
- lidé MeSH
- nestabilita genomu * MeSH
- R-smyčka * MeSH
- software MeSH
- Check Tag
- lidé MeSH
- Publikační typ
- časopisecké články MeSH
- Názvy látek
- DNA MeSH
G-quadruplexes have long been perceived as rare and physiologically unimportant nucleic acid structures. However, several studies have revealed their importance in molecular processes, suggesting their possible role in replication and gene expression regulation. Pathways involving G-quadruplexes are intensively studied, especially in the context of human diseases, while their involvement in gene expression regulation in plants remains largely unexplored. Here, we conducted a bioinformatic study and performed a complex circular dichroism measurement to identify a stable G-quadruplex in the gene RPB1, coding for the RNA polymerase II large subunit. We found that this G-quadruplex-forming locus is highly evolutionarily conserved amongst plants sensu lato (Archaeplastida) that share a common ancestor more than one billion years old. Finally, we discussed a new hypothesis regarding G-quadruplexes interacting with UV light in plants to potentially form an additional layer of the regulatory network.
- Klíčová slova
- UV light, circular dichroism, evolution, nucleic acids, plant science,
- MeSH
- Arabidopsis chemie genetika účinky záření MeSH
- cirkulární dichroismus MeSH
- fylogeneze MeSH
- G-kvadruplexy * účinky záření MeSH
- Glaucophyta chemie genetika účinky záření MeSH
- molekulární evoluce MeSH
- regulace genové exprese u rostlin genetika MeSH
- Rhodophyta chemie genetika účinky záření MeSH
- RNA-polymerasa II chemie genetika MeSH
- rostlinné proteiny chemie genetika účinky záření MeSH
- rostliny chemie genetika účinky záření MeSH
- sekvence aminokyselin MeSH
- sekvenční seřazení MeSH
- ultrafialové záření MeSH
- výpočetní biologie MeSH
- Publikační typ
- časopisecké články MeSH
- Názvy látek
- RNA-polymerasa II MeSH
- rostlinné proteiny MeSH