Nejvíce citovaný článek - PubMed ID 33077876
Repeat-sequence turnover shifts fundamentally in species with large genomes
Repetitive DNA contributes significantly to plant genome size, adaptation, and evolution. However, little is understood about the transcription of repeats. This is addressed here in the plant green foxtail millet (Setaria viridis). First, we used RepeatExplorer2 to calculate the genome proportion (GP) of all repeat types and compared the GP of long terminal repeat (LTR) retroelements against annotated complete and incomplete LTR retroelements (Ty1/copia and Ty3/gypsy) identified by DANTE in a whole genome assembly. We show that DANTE-identified LTR retroelements can comprise ∼0.75% of the inflorescence poly-A transcriptome and ∼0.24% of the stem ribo-depleted transcriptome. In the RNA libraries from inflorescence tissue, both LTR retroelements and DNA transposons identified by RepeatExplorer2 were highly abundant, where they may be taking advantage of the reduced epigenetic silencing in the germ line to amplify. Typically, there was a higher representation of DANTE-identified LTR retroelements in the transcriptome than RepeatExplorer2-identified LTR retroelements, potentially reflecting the transcription of elements that have insufficient genomic copy numbers to be detected by RepeatExplorer2. In contrast, for ribo-depleted libraries of stem tissues, the reverse was observed, with a higher transcriptome representation of RepeatExplorer2-identified LTR retroelements. For RepeatExplorer2-identified repeats, we show that the GP of most Ty1/copia and Ty3/gypsy families were positively correlated with their transcript proportion. In addition, guanine- and cytosine-rich repeats with high sequence similarity were also the most abundant in the transcriptome, and these likely represent young elements that are most capable of amplification due to their ability to evade epigenetic silencing.
Long terminal repeat (LTR) retrotransposons constitute a predominant class of repetitive DNA elements in most plant genomes. With the increasing number of sequenced plant genomes, there is an ongoing demand for computational tools facilitating efficient annotation and classification of LTR retrotransposons in plant genome assemblies. Herein, we introduce DANTE, a computational pipeline for Domain-based ANnotation of Transposable Elements, designed for sensitive detection of these elements via their conserved protein domain sequences. The identified protein domains are subsequently inputted into the DANTE_LTR pipeline to annotate complete element sequences by detecting their structural features, such as LTRs, in adjacent genomic regions. Leveraging domain sequences allows for precise classification of elements into phylogenetic lineages, offering a more granular annotation compared with coarser conventional superfamily-based classification methods. The efficiency and accuracy of this approach were evidenced via annotation of LTR retrotransposons in 93 plant genomes. Results were benchmarked against several established pipelines, showing that DANTE_LTR is capable of identifying significantly more intact LTR retrotransposons. DANTE and DANTE_LTR are provided as user-friendly Galaxy tools accessible via a public server (https://repeatexplorer-elixir.cerit-sc.cz), installable on local Galaxy instances from the Galaxy tool shed or executable from the command line.
- Publikační typ
- časopisecké články MeSH
The genus Vigna (Leguminosae) comprises about 150 species grouped into five subgenera. The present study aimed to improve the understanding of karyotype diversity and evolution in Vigna, using new and previously published data through different cytogenetic and DNA content approaches. In the Vigna subgenera, we observed a random distribution of rDNA patterns. The 35S rDNA varied in position, from terminal to proximal, and in number, ranging from one (V. aconitifolia, V. subg. Ceratotropis) to seven pairs (V. unguiculata subsp. unguiculata, V. subg. Vigna). On the other hand, the number of 5S rDNA was conserved (one or two pairs), except for V. radiata (V. subg. Ceratotropis), which had three pairs. Genome size was relatively conserved within the genus, ranging from 1C = 0.43 to 0.70 pg in V. oblongifolia and V. unguiculata subsp. unguiculata, respectively, both belonging to V. subg. Vigna. However, we observed a positive correlation between DNA content and the number of 35S rDNA sites. In addition, data from chromosome-specific BAC-FISH suggest that the ancestral 35S rDNA locus is conserved on chromosome 6 within Vigna. Considering the rapid diversification in the number and position of rDNA sites, such conservation is surprising and suggests that additional sites may have spread out from this ancestral locus.
- Klíčová slova
- Vigna, DNA content, FISH, Karyotype evolution, Molecular cytogenetics, rDNA sites,
- MeSH
- chromozomy rostlin genetika MeSH
- DNA rostlinná genetika MeSH
- Fabaceae genetika MeSH
- fylogeneze MeSH
- genetická variace MeSH
- karyotyp MeSH
- ribozomální DNA genetika MeSH
- vigna * genetika MeSH
- Publikační typ
- časopisecké články MeSH
- Názvy látek
- DNA rostlinná MeSH
- ribozomální DNA MeSH
Although both are salient features of genomes, at first glance ribosomal DNAs and transposable elements are genetic elements with not much in common: whereas ribosomal DNAs are mainly viewed as housekeeping genes that uphold all prime genome functions, transposable elements are generally portrayed as selfish and disruptive. These opposing characteristics are also mirrored in other attributes: organization in tandem (ribosomal DNAs) versus organization in a dispersed manner (transposable elements); evolution in a concerted manner (ribosomal DNAs) versus evolution by diversification (transposable elements); and activity that prolongs genomic stability (ribosomal DNAs) versus activity that shortens it (transposable elements). Re-visiting relevant instances in which ribosomal DNA-transposable element interactions have been reported, we note that both repeat types share at least four structural and functional hallmarks: (1) they are repetitive DNAs that shape genomes in evolutionary timescales, (2) they exchange structural motifs and can enter co-evolution processes, (3) they are tightly controlled genomic stress sensors playing key roles in senescence/aging, and (4) they share common epigenetic marks such as DNA methylation and histone modification. Here, we give an overview of the structural, functional, and evolutionary characteristics of both ribosomal DNAs and transposable elements, discuss their roles and interactions, and highlight trends and future directions as we move forward in understanding ribosomal DNA-transposable element associations.
- Klíčová slova
- concerted evolution, genome size, genome stability, homogenization, housekeeping genes, long-read sequencing, molecular cytogenetics, recombination, repetitive DNA, ribosomal DNA, transposable elements, transposition,
- MeSH
- cytogenetické vyšetření MeSH
- genomika * MeSH
- metylace DNA MeSH
- molekulární evoluce MeSH
- ribozomální DNA MeSH
- transpozibilní elementy DNA * MeSH
- Publikační typ
- časopisecké články MeSH
- Názvy látek
- ribozomální DNA MeSH
- transpozibilní elementy DNA * MeSH
INTRODUCTION: Meiotic recombination is one of the most important processes of evolution and adaptation to environmental conditions. Even though there is substantial knowledge about proteins involved in the process, targeting specific DNA loci by the recombination machinery is not well understood. OBJECTIVES: This study aims to investigate a wheat recombination hotspot (H1) in comparison with a "regular" recombination site (Rec7) on the sequence and epigenetic level in conditions with functional and non-functional Ph1 locus. METHODS: The DNA sequence, methylation pattern, and recombination frequency were analyzed for the H1 and Rec7 in three mapping populations derived by crossing introgressive wheat line 8.1 with cv. Chinese Spring (with Ph1 and ph1 alleles) and cv. Tähti. RESULTS: The H1 and Rec7 loci are 1.586 kb and 2.538 kb long, respectively. High-density mapping allowed to delimit the Rec7 and H1 to 19 and 574 bp and 593 and 571 bp CO sites, respectively. A new method (ddPing) allowed screening recombination frequency in almost 66 thousand gametes. The screening revealed a 5.94-fold higher recombination frequency at the H1 compared to the Rec7. The H1 was also found out of the Ph1 control, similarly as gamete distortion. The recombination was strongly affected by larger genomic rearrangements but not by the SNP proximity. Moreover, chromatin markers for open chromatin and DNA hypomethylation were found associated with crossover occurrence except for the CHH methylation. CONCLUSION: Our results, for the first time, allowed study of wheat recombination directly on sequence, shed new light on chromatin landmarks associated with particular recombination sites, and deepened knowledge about role of the Ph1 locus in control of wheat recombination processes. The results are suggesting more than one recombination control pathway. Understanding this phenomenon may become a base for more efficient wheat genome manipulation, gene pool enrichment, breeding, and study processes of recombination itself.
- Klíčová slova
- Crossovers, DNA methylation, Hotspot, Ph1 locus, Recombination, Wheat,
- MeSH
- chromatin * genetika MeSH
- chromozomy MeSH
- DNA MeSH
- pšenice * genetika MeSH
- šlechtění rostlin MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Názvy látek
- chromatin * MeSH
- DNA MeSH
The classical model of concerted evolution states that hundreds to thousands of ribosomal DNA (rDNA) units undergo homogenization, making the multiple copies of the individual units more uniform across the genome than would be expected given mutation frequencies and gene redundancy. While the universality of this over 50-year-old model has been confirmed in a range of organisms, advanced high throughput sequencing techniques have also revealed that rDNA homogenization in many organisms is partial and, in rare cases, even apparently failing. The potential underpinning processes leading to unexpected intragenomic variation have been discussed in a number of studies, but a comprehensive understanding remains to be determined. In this work, we summarize information on variation or polymorphisms in rDNAs across a wide range of taxa amongst animals, fungi, plants, and protists. We discuss the definition and description of concerted evolution and describe whether incomplete concerted evolution of rDNAs predominantly affects coding or non-coding regions of rDNA units and if it leads to the formation of pseudogenes or not. We also discuss the factors contributing to rDNA variation, such as interspecific hybridization, meiotic cycles, rDNA expression status, genome size, and the activity of effector genes involved in genetic recombination, epigenetic modifications, and DNA editing. Finally, we argue that a combination of approaches is needed to target genetic and epigenetic phenomena influencing incomplete concerted evolution, to give a comprehensive understanding of the evolution and functional consequences of intragenomic variation in rDNA.
- MeSH
- fylogeneze MeSH
- genetická variace * MeSH
- houby genetika MeSH
- molekulární evoluce MeSH
- mutace MeSH
- polymorfismus genetický * MeSH
- ribozomální DNA genetika MeSH
- zvířata MeSH
- Check Tag
- zvířata MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- přehledy MeSH
- Názvy látek
- ribozomální DNA MeSH
The W chromosome of Lepidoptera is typically gene-poor, repeat-rich and composed of heterochromatin. Pioneering studies investigating this chromosome reported an abundance of mobile elements. However, the actual composition of the W chromosome varies greatly between species, as repeatedly demonstrated by comparative genomic hybridization (CGH) or fluorescence in situ hybridization (FISH). Here we present an analysis of repeats on the W chromosome in the willow beauty, Peribatodes rhomboidaria (Geometridae), a species in which CGH predicted an abundance of W-enriched or W-specific sequences. Indeed, comparative analysis of male and female genomes using RepeatExplorer identified ten putative W chromosome-enriched repeats, most of which are LTR or LINE mobile elements. We analysed the two most abundant: PRW LINE-like and PRW Bel-Pao. The results of FISH mapping and bioinformatic analysis confirmed their enrichment on the W chromosome, supporting the hypothesis that mobile elements are the driving force of W chromosome differentiation in Lepidoptera. As the W chromosome is highly underrepresented in chromosome-level genome assemblies of Lepidoptera, this recently introduced approach, combining bioinformatic comparative genome analysis with molecular cytogenetics, provides an elegant tool for studying this elusive and rapidly evolving part of the genome.
- MeSH
- hybridizace in situ fluorescenční MeSH
- můry * genetika MeSH
- pohlavní chromozomy genetika MeSH
- retroelementy genetika MeSH
- Salix * genetika MeSH
- srovnávací genomová hybridizace MeSH
- zvířata MeSH
- Check Tag
- mužské pohlaví MeSH
- ženské pohlaví MeSH
- zvířata MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Názvy látek
- retroelementy MeSH
Simple telomeric repeats composed of six to seven iterating nucleotide units are important sequences typically found at the ends of chromosomes. Here we analyzed their abundance and homogeneity in 42 gymnosperm (29 newly sequenced), 29 angiosperm (one newly sequenced), and eight bryophytes using bioinformatics, conventional cytogenetic and molecular biology approaches to explore their diversity across land plants. We found more than 10 000-fold variation in the amounts of telomeric repeats among the investigated taxa. Repeat abundance was positively correlated with increasing intragenomic sequence heterogeneity and occurrence at non-telomeric positions, but there was no correlation with genome size. The highest abundance/heterogeneity was found in the gymnosperm genus Cycas (Cycadaceae), in which megabase-sized blocks of telomeric repeats (i.e., billions of copies) were identified. Fluorescent in situ hybridization experiments using variant-specific probes revealed canonical Arabidopsis-type telomeric TTTAGGG repeats at chromosome ends, while pericentromeric blocks comprised at least four major telomeric variants with decreasing abundance: TTTAGGG>TTCAGGG >TTTAAGG>TTCAAGG. Such a diversity of repeats was not found in the sister cycad family Zamiaceae or in any other species analyzed. Using immunocytochemistry, we showed that the pericentromeric blocks of telomeric repeats overlapped with histone H3 serine 10 phosphorylation signals. We show that species of Cycas have amplified their telomeric repeats in centromeric and telomeric positions on telocentric chromosomes to extraordinary high levels. The ancestral chromosome number reconstruction suggests their occurrence is unlikely to be the product of ancient Robertsonian chromosome fusions. We speculate as to how the observed chromosome dynamics may be associated with the diversification of cycads.
- Klíčová slova
- Cycadaceae, centromeres, chromosome rearrangements, epigenetics, genome evolution, gymnosperms, telomeres,
- MeSH
- centromera genetika MeSH
- cykasy * genetika MeSH
- hybridizace in situ fluorescenční MeSH
- Magnoliopsida * genetika MeSH
- telomery genetika MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
Genome size varies 2400-fold across plants, influencing their evolution through changes in cell size and cell division rates which impact plants' environmental stress tolerance. Repetitive element expansion explains much genome size diversity, and the processes structuring repeat 'communities' are analogous to those structuring ecological communities. However, which environmental stressors influence repeat community dynamics has not yet been examined from an ecological perspective. We measured genome size and leveraged climatic data for 91% of genera within the ecologically diverse palm family (Arecaceae). We then generated genomic repeat profiles for 141 palm species, and analysed repeats using phylogenetically informed linear models to explore relationships between repeat dynamics and environmental factors. We show that palm genome size and repeat 'community' composition are best explained by aridity. Specifically, Ty3-gypsy and TIR elements were more abundant in palm species from wetter environments, which generally had larger genomes, suggesting amplification. By contrast, Ty1-copia and LINE elements were more abundant in drier environments. Our results suggest that water stress inhibits repeat expansion through selection on upper genome size limits. However, elements that may associate with stress-response genes (e.g. Ty1-copia) have amplified in arid-adapted palm species. Overall, we provide novel evidence of climate influencing the assembly of repeat 'communities'.
- Klíčová slova
- Arecaceae (palms), adaptation, ecology, genome size, phylogenetic regression, plant evolution, trait evolution, transposable elements,
- MeSH
- Arecaceae * genetika MeSH
- délka genomu MeSH
- fylogeneze MeSH
- genom rostlinný MeSH
- molekulární evoluce MeSH
- retroelementy * MeSH
- sekvenční analýza DNA MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Názvy látek
- retroelementy * MeSH
Telomeres are essential structures formed from satellite DNA repeats at the ends of chromosomes in most eukaryotes. Satellite DNA repeat sequences are useful markers for karyotyping, but have a more enigmatic role in the eukaryotic cell. Much work has been done to investigate the structure and arrangement of repetitive DNA elements in classical models with implications for species evolution. Still more is needed until there is a complete picture of the biological function of DNA satellite sequences, particularly when considering non-model organisms. Celebrating Gregor Mendel's anniversary by going to the roots, this review is designed to inspire and aid new research into telomeres and satellites with a particular focus on non-model organisms and accessible experimental and in silico methods that do not require specialized equipment or expensive materials. We describe how to identify telomere (and satellite) repeats giving many examples of published (and some unpublished) data from these techniques to illustrate the principles behind the experiments. We also present advice on how to perform and analyse such experiments, including details of common pitfalls. Our examples are a selection of recent developments and underexplored areas of research from the past. As a nod to Mendel's early work, we use many examples from plants and insects, especially as much recent work has expanded beyond the human and yeast models traditional in telomere research. We give a general introduction to the accepted knowledge of telomere and satellite systems and include references to specialized reviews for the interested reader.
- Klíčová slova
- FISH, NGS, TRAP, eukaryotic tree of life, interstitial telomere sequences, retroelements, satellite, subtelomere structure, telomerase RNA, telomere evolution,
- MeSH
- DNA MeSH
- lidé MeSH
- repetitivní sekvence nukleových kyselin MeSH
- satelitní DNA * MeSH
- sekvence nukleotidů MeSH
- telomery * genetika MeSH
- Check Tag
- lidé MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- přehledy MeSH
- Názvy látek
- DNA MeSH
- satelitní DNA * MeSH