Most cited article - PubMed ID 24616836
Quadruplex-forming DNA sequences spread by retrotransposons may serve as genome regulators
Non-canonical (non-B) DNA structures-e.g. bent DNA, hairpins, G-quadruplexes (G4s), Z-DNA, etc.-which form at certain sequence motifs (e.g. A-phased repeats, inverted repeats, etc.), have emerged as important regulators of cellular processes and drivers of genome evolution. Yet, they have been understudied due to their repetitive nature and potentially inaccurate sequences generated with short-read technologies. Here we comprehensively characterize such motifs in the long-read telomere-to-telomere (T2T) genomes of human, bonobo, chimpanzee, gorilla, Bornean orangutan, Sumatran orangutan, and siamang. Non-B DNA motifs are enriched at the genomic regions added to T2T assemblies and occupy 9%-15%, 9%-11%, and 12%-38% of autosomes and chromosomes X and Y, respectively. G4s and Z-DNA are enriched at promoters and enhancers, as well as at origins of replication. Repetitive sequences harbor more non-B DNA motifs than non-repetitive sequences, especially in the short arms of acrocentric chromosomes. Most centromeres and/or their flanking regions are enriched in at least one non-B DNA motif type, consistent with a potential role of non-B structures in determining centromeres. Our results highlight the uneven distribution of predicted non-B DNA structures across ape genomes and suggest their novel functions in previously inaccessible genomic regions.
- MeSH
- DNA * chemistry genetics MeSH
- G-Quadruplexes MeSH
- Genome, Human MeSH
- Genome * MeSH
- Hominidae * genetics MeSH
- Humans MeSH
- Nucleotide Motifs MeSH
- Pan troglodytes genetics MeSH
- Repetitive Sequences, Nucleic Acid MeSH
- Telomere * genetics MeSH
- Animals MeSH
- Check Tag
- Humans MeSH
- Animals MeSH
- Publication type
- Journal Article MeSH
- Names of Substances
- DNA * MeSH
Non-canonical (non-B) DNA structures-e.g., bent DNA, hairpins, G-quadruplexes (G4s), Z-DNA, etc.-which form at certain sequence motifs (e.g., A-phased repeats, inverted repeats, etc.), have emerged as important regulators of cellular processes and drivers of genome evolution. Yet, they have been understudied due to their repetitive nature and potentially inaccurate sequences generated with short-read technologies. Here we comprehensively characterize such motifs in the long-read telomere-to-telomere (T2T) genomes of human, bonobo, chimpanzee, gorilla, Bornean orangutan, Sumatran orangutan, and siamang. Non-B DNA motifs are enriched at the genomic regions added to T2T assemblies, and occupy 9-15%, 9-11%, and 12-38% of autosomes, and chromosomes X and Y, respectively. G4s and Z-DNA are enriched at promoters and enhancers, as well as at origins of replication. Repetitive sequences harbor more non-B DNA motifs than non-repetitive sequences, especially in the short arms of acrocentric chromosomes. Most centromeres and/or their flanking regions are enriched in at least one non-B DNA motif type, consistent with a potential role of non-B structures in determining centromeres. Our results highlight the uneven distribution of predicted non-B DNA structures across ape genomes and suggest their novel functions in previously inaccessible genomic regions.
- Publication type
- Journal Article MeSH
- Preprint MeSH
BACKGROUND: Canonical telomeres (telomerase-synthetised) are readily forming G-quadruplexes (G4) on the G-rich strand. However, there are examples of non-canonical telomeres among eukaryotes where telomeric tandem repeats are invaded by specific retrotransposons. Drosophila melanogaster represents an extreme example with telomeres composed solely by three retrotransposons-Het-A, TAHRE and TART (HTT). Even though non-canonical telomeres often show strand biased G-distribution, the evidence for the G4-forming potential is limited. RESULTS: Using circular dichroism spectroscopy and UV absorption melting assay we have verified in vitro G4-formation in the HTT elements of D. melanogaster. Namely 3 in Het-A, 8 in TART and 2 in TAHRE. All the G4s are asymmetrically distributed as in canonical telomeres. Bioinformatic analysis showed that asymmetric distribution of potential quadruplex sequences (PQS) is common in telomeric retrotransposons in other Drosophila species. Most of the PQS are located in the gag gene where PQS density correlates with higher DNA sequence conservation and codon selection favoring G4-forming potential. The importance of G4s in non-canonical telomeres is further supported by analysis of telomere-associated retrotransposons from various eukaryotic species including green algae, Diplomonadida, fungi, insects and vertebrates. Virtually all analyzed telomere-associated retrotransposons contained PQS, frequently with asymmetric strand distribution. Comparison with non-telomeric elements showed independent selection of PQS-rich elements from four distinct LINE clades. CONCLUSION: Our findings of strand-biased G4-forming motifs in telomere-associated retrotransposons from various eukaryotic species support the G4-formation as one of the prerequisites for the recruitment of specific retrotransposons to chromosome ends and call for further experimental studies.
- Keywords
- Drosophila, G-quadruplex, Het-A, Retrotransposon, TAHRE, TART, Telomere,
- Publication type
- Journal Article MeSH
Guanine quadruplexes (G4s) serve as regulators of replication, recombination and gene expression. G4 motifs have been recently identified in LTR retrotransposons, but their role in the retrotransposon life-cycle is yet to be understood. Therefore, we inserted G4s into the 3'UTR of Ty1his3-AI retrotransposon and measured the frequency of retrotransposition in yeast strains BY4741, Y00509 (without Pif1 helicase) and with G4-stabilization by N-methyl mesoporphyrin IX (NMM) treatment. We evaluated the impact of G4s on mRNA levels by RT-qPCR and products of reverse transcription by Southern blot analysis. We found that the presence of G4 inhibited Ty1his3-AI retrotransposition. The effect was stronger when G4s were on a transcription template strand which leads to reverse transcription interruption. Both NMM and Pif1p deficiency reduced the retrotransposition irrespective of the presence of a G4 motif in the Ty1his3-AI element. Quantity of mRNA and products of reverse transcription did not fully explain the impact of G4s on Ty1his3-AI retrotransposition indicating that G4s probably affect some other steps of the retrotransposon life-cycle (e.g., translation, VLP formation, integration). Our results suggest that G4 DNA conformation can tune the activity of mobile genetic elements that in turn contribute to shaping the eukaryotic genomes.
- Keywords
- G-quadruplex, N-methyl mesoporphyrin (NMM), Pif1 helicase, Ty1 LTR retrotransposon, retrotransposition, yeast,
- Publication type
- Journal Article MeSH
BACKGROUND: Many studies have shown that guanine-rich DNA sequences form quadruplex structures (G4) in vitro but there is scarce evidence of guanine quadruplexes in vivo. The majority of potential quadruplex-forming sequences (PQS) are located in transposable elements (TEs), especially close to promoters within long terminal repeats of plant LTR retrotransposons. RESULTS: In order to test the potential effect of G4s on retrotransposon expression, we cloned the long terminal repeats of selected maize LTR retrotransposons upstream of the lacZ reporter gene and measured its transcription and translation in yeast. We found that G4s had an inhibitory effect on translation in vivo since "mutants" (where guanines were replaced by adenines in PQS) showed higher expression levels than wild-types. In parallel, we confirmed by circular dichroism measurements that the selected sequences can indeed adopt G4 conformation in vitro. Analysis of RNA-Seq of polyA RNA in maize seedlings grown in the presence of a G4-stabilizing ligand (NMM) showed both inhibitory as well as stimulatory effects on the transcription of LTR retrotransposons. CONCLUSIONS: Our results demonstrate that quadruplex DNA located within long terminal repeats of LTR retrotransposons can be formed in vivo and that it plays a regulatory role in the LTR retrotransposon life-cycle, thus also affecting genome dynamics.
- Keywords
- Circular dichroism, G4 motifs, Maize LTR retrotransposons, NMM ligand, Quadruplex DNA, Transposable elements,
- MeSH
- G-Quadruplexes * MeSH
- Transcription, Genetic MeSH
- Genome, Plant * MeSH
- Terminal Repeat Sequences * MeSH
- Zea mays genetics growth & development metabolism MeSH
- Genes, Reporter * MeSH
- Retroelements * MeSH
- Saccharomyces cerevisiae genetics growth & development MeSH
- High-Throughput Nucleotide Sequencing MeSH
- Publication type
- Journal Article MeSH
- Research Support, Non-U.S. Gov't MeSH
- Names of Substances
- Retroelements * MeSH
A significant part of eukaryotic genomes is formed by transposable elements (TEs) containing not only genes but also regulatory sequences. Some of the regulatory sequences located within TEs can form secondary structures like hairpins or three-stranded (triplex DNA) and four-stranded (quadruplex DNA) conformations. This review focuses on recent evidence showing that G-quadruplex-forming sequences in particular are often present in specific parts of TEs in plants and humans. We discuss the potential role of these structures in the TE life cycle as well as the impact of G-quadruplexes on replication, transcription, translation, chromatin status, and recombination. The aim of this review is to emphasize that TEs may serve as vehicles for the genomic spread of G-quadruplexes. These non-canonical DNA structures and their conformational switches may constitute another regulatory system that, together with small and long non-coding RNA molecules and proteins, contribute to the complex cellular network resulting in the large diversity of eukaryotes.
- Keywords
- DNA and RNA quadruplexes, G-quadruplexes, LTR retrotransposons, recombination, replication, transcription, transposable elements,
- MeSH
- DNA-Binding Proteins metabolism MeSH
- G-Quadruplexes * MeSH
- Genomics MeSH
- Humans MeSH
- Open Reading Frames MeSH
- Gene Expression Regulation MeSH
- Regulatory Sequences, Nucleic Acid MeSH
- Repetitive Sequences, Nucleic Acid MeSH
- DNA Replication MeSH
- Retroelements genetics MeSH
- RNA chemistry genetics MeSH
- Plants genetics MeSH
- DNA Transposable Elements genetics MeSH
- Protein Binding MeSH
- Check Tag
- Humans MeSH
- Publication type
- Journal Article MeSH
- Research Support, Non-U.S. Gov't MeSH
- Review MeSH
- Names of Substances
- DNA-Binding Proteins MeSH
- Retroelements MeSH
- RNA MeSH
- DNA Transposable Elements MeSH
BACKGROUND: Transposable elements form a significant proportion of eukaryotic genomes. Recently, Lexa et al. (Nucleic Acids Res 42:968-978, 2014) reported that plant long terminal repeat (LTR) retrotransposons often contain potential quadruplex sequences (PQSs) in their LTRs and experimentally confirmed their ability to adopt four-stranded DNA conformations. RESULTS: Here, we searched for PQSs in human retrotransposons and found that PQSs are specifically localized in the 3'-UTR of LINE-1 elements, in LTRs of HERV elements and are strongly accumulated in specific regions of SVA elements. Circular dichroism spectroscopy confirmed that most PQSs had adopted monomolecular or bimolecular guanine quadruplex structures. Evolutionarily young SVA elements contained more PQSs than older elements and their propensity to form quadruplex DNA was higher. Full-length L1 elements contained more PQSs than truncated elements; the highest proportion of PQSs was found inside transpositionally active L1 elements (PA2 and HS families). CONCLUSIONS: Conservation of quadruplexes at specific positions of transposable elements implies their importance in their life cycle. The increasing quadruplex presence in evolutionarily young LINE-1 and SVA families makes these elements important contributors toward present genome-wide quadruplex distribution.
- MeSH
- Long Interspersed Nucleotide Elements MeSH
- Alu Elements MeSH
- Endogenous Retroviruses MeSH
- G-Quadruplexes * MeSH
- Genomics MeSH
- Humans MeSH
- Chromosome Mapping MeSH
- Repetitive Sequences, Nucleic Acid MeSH
- DNA Transposable Elements * MeSH
- Check Tag
- Humans MeSH
- Publication type
- Journal Article MeSH
- Research Support, Non-U.S. Gov't MeSH
- Names of Substances
- DNA Transposable Elements * MeSH