Nejvíce citovaný článek - PubMed ID 30622655
Systematic survey of plant LTR-retrotransposons elucidates phylogenetic relationships of their polyprotein domains and provides a reference for element classification
Repetitive DNA contributes significantly to plant genome size, adaptation, and evolution. However, little is understood about the transcription of repeats. This is addressed here in the plant green foxtail millet (Setaria viridis). First, we used RepeatExplorer2 to calculate the genome proportion (GP) of all repeat types and compared the GP of long terminal repeat (LTR) retroelements against annotated complete and incomplete LTR retroelements (Ty1/copia and Ty3/gypsy) identified by DANTE in a whole genome assembly. We show that DANTE-identified LTR retroelements can comprise ∼0.75% of the inflorescence poly-A transcriptome and ∼0.24% of the stem ribo-depleted transcriptome. In the RNA libraries from inflorescence tissue, both LTR retroelements and DNA transposons identified by RepeatExplorer2 were highly abundant, where they may be taking advantage of the reduced epigenetic silencing in the germ line to amplify. Typically, there was a higher representation of DANTE-identified LTR retroelements in the transcriptome than RepeatExplorer2-identified LTR retroelements, potentially reflecting the transcription of elements that have insufficient genomic copy numbers to be detected by RepeatExplorer2. In contrast, for ribo-depleted libraries of stem tissues, the reverse was observed, with a higher transcriptome representation of RepeatExplorer2-identified LTR retroelements. For RepeatExplorer2-identified repeats, we show that the GP of most Ty1/copia and Ty3/gypsy families were positively correlated with their transcript proportion. In addition, guanine- and cytosine-rich repeats with high sequence similarity were also the most abundant in the transcriptome, and these likely represent young elements that are most capable of amplification due to their ability to evade epigenetic silencing.
In most studied eukaryotes, chromosomes are monocentric, with centromere activity confined to a single region. However, the rush family (Juncaceae) includes species with both monocentric (Juncus) and holocentric (Luzula) chromosomes, where centromere activity is distributed along the entire chromosome length. Here, we combine chromosome-scale genome assembly, epigenetic analysis, immuno-FISH and super-resolution microscopy to study the transition to holocentricity in Luzula sylvatica. We report repeat-based holocentromeres with an irregular distribution of features along the chromosomes. Luzula sylvatica holocentromeres are predominantly associated with two satellite DNA repeats (Lusy1 and Lusy2), while CENH3 also binds satellite-free gene-poor regions. Comparative repeat analysis suggests that Lusy1 plays a crucial role in centromere function across most Luzula species. Furthermore, synteny analysis between L. sylvatica (n = 6) and Juncus effusus (n = 21) suggests that holocentric chromosomes in Luzula could have arisen from chromosome fusions of ancestral monocentric chromosomes, accompanied by the expansion of CENH3-associated satellite repeats.
- MeSH
- centromera * genetika MeSH
- chromozomy rostlin * genetika MeSH
- DNA rostlinná genetika MeSH
- genom rostlinný MeSH
- hybridizace in situ fluorescenční MeSH
- molekulární evoluce MeSH
- repetitivní sekvence nukleových kyselin genetika MeSH
- satelitní DNA * genetika MeSH
- syntenie MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Názvy látek
- DNA rostlinná MeSH
- satelitní DNA * MeSH
Grasspea (Lathyrus sativus L.) is an underutilised but promising legume crop with tolerance to a wide range of abiotic and biotic stress factors, and potential for climate-resilient agriculture. Despite a long history and wide geographical distribution of cultivation, only limited breeding resources are available. This paper reports a 5.96 Gbp genome assembly of grasspea genotype LS007, of which 5.03 Gbp is scaffolded into 7 pseudo-chromosomes. The assembly has a BUSCO completeness score of 99.1% and is annotated with 31719 gene models and repeat elements. This represents the most contiguous and accurate assembly of the grasspea genome to date.
- MeSH
- chromozomy rostlin * genetika MeSH
- genom rostlinný * MeSH
- Lathyrus * genetika MeSH
- Publikační typ
- časopisecké články MeSH
- dataset MeSH
BACKGROUND: The genus Pulmonaria (Boraginaceae) represents a taxonomically complex group of species in which morphological similarity contrasts with striking karyological variation. The presence of different numbers of chromosomes in the diploid state suggests multiple hybridization/polyploidization events followed by chromosome rearrangements (dysploidy). Unfortunately, the phylogenetic relationships and evolution of the genome, have not yet been elucidated. Our study focused on the P. officinalis group, the most widespread species complex, which includes two morphologically similar species that differ in chromosome number, i.e. P. obscura (2n = 14) and P. officinalis (2n = 16). Ornamental cultivars, morphologically similar to P. officinalis (garden escapes), whose origin is unclear, were also studied. Here, we present a pilot study on genome size and repeatome dynamics of these closely related species in order to gain new information on their genome and chromosome structure. RESULTS: Flow cytometry confirmed a significant difference in genome size between P. obscura and P. officinalis, corresponding to the number of chromosomes. Genome-wide repeatome analysis performed on genome skimming data showed that retrotransposons were the most abundant repeat type, with a higher proportion of Ty3/Gypsy elements, mainly represented by the Tekay lineage. Comparative analysis revealed no species-specific retrotransposons or striking differences in their copy number between the species. A new set of chromosome-specific cytogenetic markers, represented by satellite DNAs, showed that the chromosome structure in P. officinalis was more variable compared to that of P. obscura. Comparative karyotyping supported the hybrid origin of putative hybrids with 2n = 15 collected from a mixed population of both species and outlined the origin of ornamental garden escapes, presumably derived from the P. officinalis complex. CONCLUSIONS: Large-scale genome size analysis and repeatome characterization of the two morphologically similar species of the P. officinalis group improved our knowledge of the genome dynamics and differences in the karyotype structure. A new set of chromosome-specific cytogenetic landmarks was identified and used to reveal the origin of putative hybrids and ornamental cultivars morphologically similar to P. officinalis.
- Klíčová slova
- Pulmonaria, Comparative karyotyping, Genome size, Repeatome, Satellite DNA,
- MeSH
- chromozomy rostlin * genetika MeSH
- délka genomu MeSH
- fylogeneze MeSH
- genom rostlinný * MeSH
- karyotyp MeSH
- karyotypizace * MeSH
- plicník genetika MeSH
- Publikační typ
- časopisecké články MeSH
- srovnávací studie MeSH
Sex chromosomes have evolved in many plant species with separate sexes. Current plant research is shifting from examining the structure of sex chromosomes to exploring their functional aspects. New studies are progressively unveiling the specific genetic and epigenetic mechanisms responsible for shaping distinct sexes in plants. While the fundamental methods of molecular biology and genomics are generally employed for the analysis of sex chromosomes, it is often necessary to modify classical procedures not only to simplify and expedite analyses but sometimes to make them possible at all. In this review, we demonstrate how, at the level of structural and functional genetics, cytogenetics, and bioinformatics, it is essential to adapt established procedures for sex chromosome analysis.
- Klíčová slova
- Bioinformatics, chromosome dissection, cytogenetics, dioecious plants, epigenetics, functional genetics, sex chromosomes, tandem repeats, transposable elements,
- MeSH
- chromozomy rostlin * genetika MeSH
- pohlavní chromozomy * genetika MeSH
- rostliny genetika MeSH
- výpočetní biologie metody MeSH
- Publikační typ
- časopisecké články MeSH
- přehledy MeSH
Long terminal repeat (LTR) retrotransposons constitute a predominant class of repetitive DNA elements in most plant genomes. With the increasing number of sequenced plant genomes, there is an ongoing demand for computational tools facilitating efficient annotation and classification of LTR retrotransposons in plant genome assemblies. Herein, we introduce DANTE, a computational pipeline for Domain-based ANnotation of Transposable Elements, designed for sensitive detection of these elements via their conserved protein domain sequences. The identified protein domains are subsequently inputted into the DANTE_LTR pipeline to annotate complete element sequences by detecting their structural features, such as LTRs, in adjacent genomic regions. Leveraging domain sequences allows for precise classification of elements into phylogenetic lineages, offering a more granular annotation compared with coarser conventional superfamily-based classification methods. The efficiency and accuracy of this approach were evidenced via annotation of LTR retrotransposons in 93 plant genomes. Results were benchmarked against several established pipelines, showing that DANTE_LTR is capable of identifying significantly more intact LTR retrotransposons. DANTE and DANTE_LTR are provided as user-friendly Galaxy tools accessible via a public server (https://repeatexplorer-elixir.cerit-sc.cz), installable on local Galaxy instances from the Galaxy tool shed or executable from the command line.
- Publikační typ
- časopisecké články MeSH
Centromeres in most multicellular eukaryotes are composed of long arrays of repetitive DNA sequences. Interestingly, several transposable elements, including the well-known long terminal repeat centromeric retrotransposon of maize (CRM), were found to be enriched in functional centromeres marked by the centromeric histone H3 (CENH3). Here, we report a centromeric long interspersed nuclear element (LINE), Celine, in Populus species. Celine has colonized preferentially in the CENH3-associated chromatin of every poplar chromosome, with 84% of the Celine elements localized in the CENH3-binding domains. In contrast, only 51% of the CRM elements were bound to CENH3 domains in Populus trichocarpa. These results suggest different centromere targeting mechanisms employed by Celine and CRM elements. Nevertheless, the high target specificity seems to be detrimental to further amplification of the Celine elements, leading to a shorter life span and patchy distribution among plant species compared with the CRM elements. Using a phylogenetically guided approach, we were able to identify Celine-like LINE elements in tea plant (Camellia sinensis) and green ash tree (Fraxinus pennsylvanica). The centromeric localization of these Celine-like LINEs was confirmed in both species. We demonstrate that the centromere targeting property of Celine-like LINEs is of primitive origin and has been conserved among distantly related plant species.
- MeSH
- centromera * genetika metabolismus MeSH
- chromozomy rostlin * genetika MeSH
- dlouhé rozptýlené jaderné elementy genetika MeSH
- fylogeneze MeSH
- histony metabolismus genetika MeSH
- Populus * genetika MeSH
- retroelementy * genetika MeSH
- Publikační typ
- časopisecké články MeSH
- Názvy látek
- histony MeSH
- retroelementy * MeSH
Genome size variation is a crucial aspect of plant evolution, influenced by a complex interplay of factors. Repetitive elements, which are fundamental components of genomic architecture, often play a role in genome expansion by selectively amplifying specific repeat motifs. This study focuses on Amomum, a genus in the ginger family (Zingiberaceae), known for its 4.4-fold variation in genome size. Using a robust methodology involving PhyloNet reconstruction, RepeatExplorer clustering, and repeat similarity-based phylogenetic network construction, we investigated the repeatome composition, analyzed repeat dynamics, and identified potential hybridization events within the genus. Our analysis confirmed the presence of four major infrageneric clades (A-D) within Amomum, with clades A-C exclusively comprising diploid species (2n = 48) and clade D encompassing both diploid and tetraploid species (2n = 48 and 96). We observed an increase in the repeat content within the genus, ranging from 84% to 89%, compared to outgroup species with 75% of the repeatome. The SIRE lineage of the Ty1-Copia repeat superfamily was prevalent in most analyzed ingroup genomes. We identified significant difference in repeatome structure between the basal Amomum clades (A, B, C) and the most diverged clade D. Our investigation revealed evidence of ancient hybridization events within Amomum, coinciding with a substantial proliferation of multiple repeat groups. This finding supports the hypothesis that ancient hybridization is a driving force in the genomic evolution of Amomum. Furthermore, we contextualize our findings within the broader context of genome size variations and repeatome dynamics observed across major monocot lineages. This study enhances our understanding of evolutionary processes within monocots by highlighting the crucial roles of repetitive elements in shaping genome size and suggesting the mechanisms that drive these changes.
- Klíčová slova
- 5S rDNA, Zingiberaceae, genome evolution, genome size, interspecific hybridization, phylogeny, repeatome, repetitive DNA,
- Publikační typ
- časopisecké články MeSH
The centromere is the chromosome region where microtubules attach during cell division. In contrast to monocentric chromosomes with one centromere, holocentric species usually distribute hundreds of centromere units along the entire chromatid. We assembled the chromosome-scale reference genome and analyzed the holocentromere and (epi)genome organization of the lilioid Chionographis japonica. Remarkably, each of its holocentric chromatids consists of only 7 to 11 evenly spaced megabase-sized centromere-specific histone H3-positive units. These units contain satellite arrays of 23 and 28 bp-long monomers capable of forming palindromic structures. Like monocentric species, C. japonica forms clustered centromeres in chromocenters at interphase. In addition, the large-scale eu- and heterochromatin arrangement differs between C. japonica and other known holocentric species. Finally, using polymer simulations, we model the formation of prometaphase line-like holocentromeres from interphase centromere clusters. Our findings broaden the knowledge about centromere diversity, showing that holocentricity is not restricted to species with numerous and small centromere units.
Genes for major ribosomal RNAs (rDNA) are present in multiple copies mainly organized in tandem arrays. The number and position of rDNA loci can change dynamically and their repatterning is presumably driven by other repetitive sequences. We explored a peculiar rDNA organization in several representatives of Lepidoptera with either extremely large or numerous rDNA clusters. We combined molecular cytogenetics with analyses of second- and third-generation sequencing data to show that rDNA spreads as a transcription unit and reveal association between rDNA and various repeats. Furthermore, we performed comparative long read analyses among the species with derived rDNA distribution and moths with a single rDNA locus, which is considered ancestral. Our results suggest that satellite arrays, rather than mobile elements, facilitate homology-mediated spread of rDNA via either integration of extrachromosomal rDNA circles or ectopic recombination. The latter arguably better explains preferential spread of rDNA into terminal regions of lepidopteran chromosomes as efficiency of ectopic recombination depends on the proximity of homologous sequences to telomeres.
- Klíčová slova
- Lepidoptera, major ribosomal RNA genes, mobile elements, satellite,
- MeSH
- chromozomy MeSH
- můry * genetika MeSH
- repetitivní sekvence nukleových kyselin * MeSH
- ribozomální DNA genetika MeSH
- zvířata MeSH
- Check Tag
- zvířata MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Názvy látek
- ribozomální DNA MeSH