Retroviruses are among the most extensively studied viral families, both historically and in contemporary research. They are primarily investigated in the fields of viral oncogenesis, reverse transcription mechanisms, and other infection-specific aspects. These include the integration of endogenous retroviruses (ERVs) into host genomes, a process widely utilized in genetic engineering, and the ongoing search for HIV/AIDS treatment. G-quadruplexes (G4) have emerged as potential therapeutic targets in antiviral therapy and have been identified in important regulatory regions of viral genomes. In this study, we examine the presence of potential G-quadruplex-forming sequences (PQS) across all currently available unique retroviral genomes. Given that these retroviral genomes typically consist of single-stranded RNA (ssRNA) molecules, we also investigated whether the localization of PQSs is strand-dependent. This is particularly relevant since antisense transcripts have been detected in HIV, and ERV integration into the host genome involves reverse transcription from genomic positive strand ssRNA to double-stranded DNA (dsDNA), implicating both strands in this process. We show that in most mammalian retroviruses, including human retroviruses, PQSs are significantly more prevalent on the negative (antisense) strand, with some notable exceptions such as HIV-1. In sharp contrast, avian retroviruses exhibit a higher prevalence of PQSs on the positive (sense) strand.
- Klíčová slova
- Bioinformatics, G-quadruplex, G4Hunter, Persistent infection, Retroviral genome,
- MeSH
- endogenní retroviry genetika MeSH
- G-kvadruplexy * MeSH
- genom virový * MeSH
- lidé MeSH
- Retroviridae * genetika MeSH
- zvířata MeSH
- Check Tag
- lidé MeSH
- zvířata MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
Current methods of processing archaeological samples combined with advances in sequencing methods lead to disclosure of a large part of H. neanderthalensis and Denisovans genetic information. It is hardly surprising that the genome variability between modern humans, Denisovans and H. neanderthalensis is relatively limited. Genomic studies may provide insight on the metabolism of extinct human species or lineages. Detailed analysis of G-quadruplex sequences in H. neanderthalensis and Denisovans mitochondrial DNA showed us interesting features. Relatively similar patterns in mitochondrial DNA are found compared to modern humans, with one notable exception for H. neanderthalensis. An interesting difference between H. neanderthalensis and H. sapiens corresponds to a motif found in the D-loop region of mtDNA, which is responsible for mitochondrial DNA replication. This area is directly responsible for the number of mitochondria and consequently for the efficient energy metabolism of cell. H. neanderthalensis harbor a long uninterrupted run of guanines in this region, which may cause problems for replication, in contrast with H. sapiens, for which this run is generally shorter and interrupted. One may propose that the predominant H. sapiens motif provided a selective advantage for modern humans regarding mtDNA replication and function.
- Publikační typ
- časopisecké články MeSH
Hepatitis delta virus (HDV) is a highly unusual RNA satellite virus that depends on the presence of hepatitis B virus (HBV) to be infectious. Its compact and variable single-stranded RNA genome consists of eight major genotypes distributed unevenly across different continents. The significance of noncanonical secondary structures such as G-quadruplexes (G4s) is increasingly recognized at the DNA and RNA levels, particularly for transcription, replication, and translation. G4s are formed from guanine-rich sequences and have been identified in the vast majority of viral, eukaryotic, and prokaryotic genomes. In this study, we analyzed the G4 propensity of HDV genomes by using G4Hunter. Unlike HBV, which has a G4 density similar to that of the human genome, HDV displays a significantly higher number of potential quadruplex-forming sequences (PQS), with a density more than four times greater than that of the human genome. This finding suggests a critical role for G4s in HDV, especially given that the PQS regions are conserved across HDV genotypes. Furthermore, the prevalence of G4-forming sequences may represent a promising target for therapeutic interventions to control HDV replication.
- Publikační typ
- časopisecké články MeSH
Metal ions are essential components for the survival of living organisms. For most species, intracellular and extracellular ionic conditions differ significantly. As G-quadruplexes (G4s) are ion-dependent structures, changes in the [Na+]/[K+] ratio may affect the folding of genomic G4s. More than 11000 putative G4 sequences in the human genome (hg19) contain at least two runs of three continuous cytosines, and these mixed G/C-rich sequences may form a quadruplex or a competing hairpin structure based on G-C base pairing. In this study, we examine how the [Na+]/[K+] ratio influences the structures of G/C-rich sequences. The natural G4 structure with a 9-nt long central loop, CEBwt, was chosen as a model sequence, and the loop bases were gradually replaced by cytosines. The series of CEB mutations revealed that the presence of cytosines in G4 loops does not prevent G4 folding or decrease G4 stability but increases the probability of forming a competing structure, either a hairpin or an intermolecular duplex. Slow conversion to the quadruplex in vitro (in a potassium-rich buffer) and cells was demonstrated by NMR. 'Shape-shifting' sequences may respond to [Na+]/[K+] changes with delayed kinetics.
Epigenetics deals with changes in gene expression that are not caused by modifications in the primary sequence of nucleic acids. These changes beyond primary structures of nucleic acids not only include DNA/RNA methylation, but also other reversible conversions, together with histone modifications or RNA interference. In addition, under particular conditions (such as specific ion concentrations or protein-induced stabilization), the right-handed double-stranded DNA helix (B-DNA) can form noncanonical structures commonly described as "non-B DNA" structures. These structures comprise, for example, cruciforms, i-motifs, triplexes, and G-quadruplexes. Their formation often leads to significant differences in replication and transcription rates. Noncanonical RNA structures have also been documented to play important roles in translation regulation and the biology of noncoding RNAs. In human and animal studies, the frequency and dynamics of noncanonical DNA and RNA structures are intensively investigated, especially in the field of cancer research and neurodegenerative diseases. In contrast, noncanonical DNA and RNA structures in plants have been on the fringes of interest for a long time and only a few studies deal with their formation, regulation, and physiological importance for plant stress responses. Herein, we present a review focused on the main fields of epigenetics in plants and their possible roles in stress responses and signaling, with special attention dedicated to noncanonical DNA and RNA structures.
- Klíčová slova
- Acetylation, Chromatin, Epigenetics, G-quadruplex, Gene expression, Histone, Methylation, Non-B DNA, Stress signaling,
- MeSH
- DNA genetika chemie MeSH
- epigeneze genetická MeSH
- G-kvadruplexy * MeSH
- lidé MeSH
- nukleové kyseliny * MeSH
- RNA genetika chemie MeSH
- rostliny genetika MeSH
- zvířata MeSH
- Check Tag
- lidé MeSH
- zvířata MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Názvy látek
- DNA MeSH
- nukleové kyseliny * MeSH
- RNA MeSH
Sequences of nucleic acids with the potential to form four-stranded G-quadruplex structures are intensively studied mainly in the context of human diseases, pathogens, or extremophile organisms; nonetheless, the knowledge about their occurrence and putative role in plants is still limited. This work is focused on G-quadruplex-forming sites in two gene sets of interest: drought stress-responsive genes, and genes related to the production/biosynthesis of phenolic compounds in the model plant organism Arabidopsis thaliana. In addition, 20 housekeeping genes were analyzed as well, where the constitutive gene expression was expected (with no need for precise regulation depending on internal or external factors). The results have shown that none of the tested gene sets differed significantly in the content of G-quadruplex-forming sites, however, the highest frequency of G-quadruplex-forming sites was found in the 5'-UTR regions of phenolic compounds' biosynthesis genes, which indicates the possibility of their regulation at the mRNA level. In addition, mainly within the introns and 1000 bp flanks downstream gene regions, G-quadruplex-forming sites were highly underrepresented. Finally, cluster analysis allowed us to observe similarities between particular genes in terms of their PQS characteristics. We believe that the original approach used in this study may become useful for further and more comprehensive bioinformatic studies in the field of G-quadruplex genomics.
- Klíčová slova
- Arabidopsis thaliana, G-quadruplex, PQS, drought stress, phenolic compounds,
- Publikační typ
- časopisecké články MeSH
G-quadruplexes (G4s) have been long considered rare and physiologically unimportant in vitro curiosities, but recent methodological advances have proved their presence and functions in vivo. Moreover, in addition to their functional relevance in bacteria and animals, including humans, their importance has been recently demonstrated in evolutionarily distinct plant species. In this study, we analyzed the genome of Pisum sativum (garden pea, or the so-called green pea), a unique member of the Fabaceae family. Our results showed that this genome contained putative G4 sequences (PQSs). Interestingly, these PQSs were located nonrandomly in the nuclear genome. We also found PQSs in mitochondrial (mt) and chloroplast (cp) DNA, and we experimentally confirmed G4 formation for sequences found in these two organelles. The frequency of PQSs for nuclear DNA was 0.42 PQSs per thousand base pairs (kbp), in the same range as for cpDNA (0.53/kbp), but significantly lower than what was found for mitochondrial DNA (1.58/kbp). In the nuclear genome, PQSs were mainly associated with regulatory regions, including 5'UTRs, and upstream of the rRNA region. In contrast to genomic DNA, PQSs were located around RNA genes in cpDNA and mtDNA. Interestingly, PQSs were also associated with specific transposable elements such as TIR and LTR and around them, pointing to their role in their spreading in nuclear DNA. The nonrandom localization of PQSs uncovered their evolutionary and functional significance in the Pisum sativum genome.
- Klíčová slova
- G-quadruplex, G4 propensity, chloroplast DNA, sequence prediction,
- MeSH
- 5' nepřekládaná oblast MeSH
- G-kvadruplexy * MeSH
- genom rostlinný MeSH
- hrách setý genetika MeSH
- lidé MeSH
- sekvence nukleotidů MeSH
- transpozibilní elementy DNA genetika MeSH
- zvířata MeSH
- Check Tag
- lidé MeSH
- zvířata MeSH
- Publikační typ
- časopisecké články MeSH
- Názvy látek
- 5' nepřekládaná oblast MeSH
- transpozibilní elementy DNA MeSH
Parasitic helminths infecting humans are highly prevalent infecting ∼2 billion people worldwide, causing inflammatory responses, malnutrition and anemia that are the primary cause of morbidity. In addition, helminth infections of cattle have a significant economic impact on livestock production, milk yield and fertility. The etiological agents of helminth infections are mainly Nematodes (roundworms) and Platyhelminths (flatworms). G-quadruplexes (G4) are unusual nucleic acid structures formed by G-rich sequences that can be recognized by specific G4 ligands. Here we used the G4Hunter Web Tool to identify and compare potential G4 sequences (PQS) in the nuclear and mitochondrial genomes of various helminths to identify G4 ligand targets. PQS are nonrandomly distributed in these genomes and often located in the proximity of genes. Unexpectedly, a Nematode, Ascaris lumbricoides, was found to be highly enriched in stable PQS. This species can tolerate high-stability G4 structures, which are not counter selected at all, in stark contrast to most other species. We experimentally confirmed G4 formation for sequences found in four different parasitic helminths. Small molecules able to selectively recognize G4 were found to bind to Schistosoma mansoni G4 motifs. Two of these ligands demonstrated potent activity both against larval and adult stages of this parasite.
- MeSH
- cizopasní červi genetika MeSH
- G-kvadruplexy * MeSH
- genom MeSH
- hlístice * genetika MeSH
- lidé MeSH
- ligandy MeSH
- paraziti genetika MeSH
- ploštěnci * genetika MeSH
- skot MeSH
- zvířata MeSH
- Check Tag
- lidé MeSH
- skot MeSH
- zvířata MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Názvy látek
- ligandy MeSH
G-quadruplexes have long been perceived as rare and physiologically unimportant nucleic acid structures. However, several studies have revealed their importance in molecular processes, suggesting their possible role in replication and gene expression regulation. Pathways involving G-quadruplexes are intensively studied, especially in the context of human diseases, while their involvement in gene expression regulation in plants remains largely unexplored. Here, we conducted a bioinformatic study and performed a complex circular dichroism measurement to identify a stable G-quadruplex in the gene RPB1, coding for the RNA polymerase II large subunit. We found that this G-quadruplex-forming locus is highly evolutionarily conserved amongst plants sensu lato (Archaeplastida) that share a common ancestor more than one billion years old. Finally, we discussed a new hypothesis regarding G-quadruplexes interacting with UV light in plants to potentially form an additional layer of the regulatory network.
- Klíčová slova
- UV light, circular dichroism, evolution, nucleic acids, plant science,
- MeSH
- Arabidopsis chemie genetika účinky záření MeSH
- cirkulární dichroismus MeSH
- fylogeneze MeSH
- G-kvadruplexy * účinky záření MeSH
- Glaucophyta chemie genetika účinky záření MeSH
- molekulární evoluce MeSH
- regulace genové exprese u rostlin genetika MeSH
- Rhodophyta chemie genetika účinky záření MeSH
- RNA-polymerasa II chemie genetika MeSH
- rostlinné proteiny chemie genetika účinky záření MeSH
- rostliny chemie genetika účinky záření MeSH
- sekvence aminokyselin MeSH
- sekvenční seřazení MeSH
- ultrafialové záření MeSH
- výpočetní biologie MeSH
- Publikační typ
- časopisecké články MeSH
- Názvy látek
- RNA-polymerasa II MeSH
- rostlinné proteiny MeSH
The importance of gene expression regulation in viruses based upon G-quadruplex may point to its potential utilization in therapeutic targeting. Here, we present analyses as to the occurrence of putative G-quadruplex-forming sequences (PQS) in all reference viral dsDNA genomes and evaluate their dependence on PQS occurrence in host organisms using the G4Hunter tool. PQS frequencies differ across host taxa without regard to GC content. The overlay of PQS with annotated regions reveals the localization of PQS in specific regions. While abundance in some, such as repeat regions, is shared by all groups, others are unique. There is abundance within introns of Eukaryota-infecting viruses, but depletion of PQS in introns of bacteria-infecting viruses. We reveal a significant positive correlation between PQS frequencies in dsDNA viruses and corresponding hosts from archaea, bacteria, and eukaryotes. A strong relationship between PQS in a virus and its host indicates their close coevolution and evolutionarily reciprocal mimicking of genome organization.
- Klíčová slova
- G-quadruplex, G4Hunter, bioinformatics, coevolution, dsDNA, host, virus,
- MeSH
- Archaea virologie MeSH
- Bacteria virologie MeSH
- DNA genetika MeSH
- G-kvadruplexy * MeSH
- genom virový * MeSH
- genom MeSH
- lidé MeSH
- regulace genové exprese MeSH
- virové proteiny genetika MeSH
- viry genetika MeSH
- výpočetní biologie metody MeSH
- Check Tag
- lidé MeSH
- Publikační typ
- časopisecké články MeSH
- Názvy látek
- DNA MeSH
- virové proteiny MeSH