Most cited article - PubMed ID 26606051
In Depth Characterization of Repetitive DNA in 23 Plant Genomes Reveals Sources of Genome Size Variation in the Legume Tribe Fabeae
Repetitive DNA contributes significantly to plant genome size, adaptation, and evolution. However, little is understood about the transcription of repeats. This is addressed here in the plant green foxtail millet (Setaria viridis). First, we used RepeatExplorer2 to calculate the genome proportion (GP) of all repeat types and compared the GP of long terminal repeat (LTR) retroelements against annotated complete and incomplete LTR retroelements (Ty1/copia and Ty3/gypsy) identified by DANTE in a whole genome assembly. We show that DANTE-identified LTR retroelements can comprise ∼0.75% of the inflorescence poly-A transcriptome and ∼0.24% of the stem ribo-depleted transcriptome. In the RNA libraries from inflorescence tissue, both LTR retroelements and DNA transposons identified by RepeatExplorer2 were highly abundant, where they may be taking advantage of the reduced epigenetic silencing in the germ line to amplify. Typically, there was a higher representation of DANTE-identified LTR retroelements in the transcriptome than RepeatExplorer2-identified LTR retroelements, potentially reflecting the transcription of elements that have insufficient genomic copy numbers to be detected by RepeatExplorer2. In contrast, for ribo-depleted libraries of stem tissues, the reverse was observed, with a higher transcriptome representation of RepeatExplorer2-identified LTR retroelements. For RepeatExplorer2-identified repeats, we show that the GP of most Ty1/copia and Ty3/gypsy families were positively correlated with their transcript proportion. In addition, guanine- and cytosine-rich repeats with high sequence similarity were also the most abundant in the transcriptome, and these likely represent young elements that are most capable of amplification due to their ability to evade epigenetic silencing.
- MeSH
- Transcription, Genetic MeSH
- Genome, Plant MeSH
- Terminal Repeat Sequences * MeSH
- Gene Expression Regulation, Plant MeSH
- Retroelements * MeSH
- Setaria Plant * genetics MeSH
- Transcriptome MeSH
- Publication type
- Journal Article MeSH
- Names of Substances
- Retroelements * MeSH
Repetitive elements can cause large-scale chromosomal rearrangements, for example through ectopic recombination, potentially promoting reproductive isolation and speciation. Species with holocentric chromosomes, that lack a localized centromere, might be more likely to retain chromosomal rearrangements that lead to karyotype changes such as fusions and fissions. This is because chromosome segregation during cell division should be less affected than in organisms with a localized centromere. The relationships between repetitive elements and chromosomal rearrangements and how they may translate to patterns of speciation in holocentric organisms are though poorly understood. Here, we use a reference-free approach based on low-coverage short-read sequencing data to characterize the repeat landscape of two independently evolved holocentric groups: Erebia butterflies and Carex sedges. We consider both micro- and macro-evolutionary scales to investigate the repeat landscape differentiation between Erebia populations and the association between repeats and karyotype changes in a phylogenetic framework for both Erebia and Carex. At a micro-evolutionary scale, we found population differentiation in repeat landscape that increases with overall intraspecific genetic differentiation among four Erebia species. At a macro-evolutionary scale, we found indications for an association between repetitive elements and karyotype changes along both Erebia and Carex phylogenies. Altogether, our results suggest that repetitive elements are associated with the level of population differentiation and chromosomal rearrangements in holocentric clades and therefore likely play a role in adaptation and potentially species diversification.
- Keywords
- Carex, Erebia, Lepidoptera, speciation, transposable elements,
- MeSH
- Biological Evolution MeSH
- Carex Plant genetics MeSH
- Phylogeny * MeSH
- Karyotype * MeSH
- Evolution, Molecular MeSH
- Butterflies * genetics MeSH
- Genetics, Population MeSH
- Repetitive Sequences, Nucleic Acid genetics MeSH
- Genetic Speciation MeSH
- Animals MeSH
- Check Tag
- Animals MeSH
- Publication type
- Journal Article MeSH
Grasspea (Lathyrus sativus L.) is an underutilised but promising legume crop with tolerance to a wide range of abiotic and biotic stress factors, and potential for climate-resilient agriculture. Despite a long history and wide geographical distribution of cultivation, only limited breeding resources are available. This paper reports a 5.96 Gbp genome assembly of grasspea genotype LS007, of which 5.03 Gbp is scaffolded into 7 pseudo-chromosomes. The assembly has a BUSCO completeness score of 99.1% and is annotated with 31719 gene models and repeat elements. This represents the most contiguous and accurate assembly of the grasspea genome to date.
- MeSH
- Chromosomes, Plant * genetics MeSH
- Genome, Plant * MeSH
- Lathyrus * genetics MeSH
- Publication type
- Journal Article MeSH
- Dataset MeSH
BACKGROUND: The genus Pulmonaria (Boraginaceae) represents a taxonomically complex group of species in which morphological similarity contrasts with striking karyological variation. The presence of different numbers of chromosomes in the diploid state suggests multiple hybridization/polyploidization events followed by chromosome rearrangements (dysploidy). Unfortunately, the phylogenetic relationships and evolution of the genome, have not yet been elucidated. Our study focused on the P. officinalis group, the most widespread species complex, which includes two morphologically similar species that differ in chromosome number, i.e. P. obscura (2n = 14) and P. officinalis (2n = 16). Ornamental cultivars, morphologically similar to P. officinalis (garden escapes), whose origin is unclear, were also studied. Here, we present a pilot study on genome size and repeatome dynamics of these closely related species in order to gain new information on their genome and chromosome structure. RESULTS: Flow cytometry confirmed a significant difference in genome size between P. obscura and P. officinalis, corresponding to the number of chromosomes. Genome-wide repeatome analysis performed on genome skimming data showed that retrotransposons were the most abundant repeat type, with a higher proportion of Ty3/Gypsy elements, mainly represented by the Tekay lineage. Comparative analysis revealed no species-specific retrotransposons or striking differences in their copy number between the species. A new set of chromosome-specific cytogenetic markers, represented by satellite DNAs, showed that the chromosome structure in P. officinalis was more variable compared to that of P. obscura. Comparative karyotyping supported the hybrid origin of putative hybrids with 2n = 15 collected from a mixed population of both species and outlined the origin of ornamental garden escapes, presumably derived from the P. officinalis complex. CONCLUSIONS: Large-scale genome size analysis and repeatome characterization of the two morphologically similar species of the P. officinalis group improved our knowledge of the genome dynamics and differences in the karyotype structure. A new set of chromosome-specific cytogenetic landmarks was identified and used to reveal the origin of putative hybrids and ornamental cultivars morphologically similar to P. officinalis.
- Keywords
- Pulmonaria, Comparative karyotyping, Genome size, Repeatome, Satellite DNA,
- MeSH
- Chromosomes, Plant * genetics MeSH
- Genome Size MeSH
- Phylogeny MeSH
- Genome, Plant * MeSH
- Karyotype MeSH
- Karyotyping * MeSH
- Pulmonaria genetics MeSH
- Publication type
- Journal Article MeSH
- Comparative Study MeSH
Odd ploidy-level cytotypes in sexually reproducing species are considered a dead end due to absent or reduced fertility. If sterility is only partial, however, their contribution to the population gene pool can be augmented by longevity and clonal growth. To test this, we investigated the cytotype origin and spatial pattern, and pollen viability in three relict shrub species of the genus Daphne (Thymelaeaceae Juss.) in central Europe. Daphne cneorum subsp. cneorum is a widespread European species that has a broad ecological amplitude, whereas D. cneorum subsp. arbusculoides and D. arbuscula are narrow endemics of the western Pannonian Plain and the Western Carpathians, respectively. Our study confirmed that all three taxa are diploid. However, of more than a thousand analysed individuals of D. cneorum subsp. cneorum, five in four different populations were triploid. Our data indicate that these triploids most likely originate from recurrent autopolyploidization events caused by the fusion of reduced and unreduced gametes. High pollen viability was observed in all three taxa and in both diploid and triploid cytotypes, ranging from 65 to 100 %. Our study highlights the significant role of odd ploidy-level cytotypes in interploidy gene flow, calling for more research into their reproduction, genetic variability, and overall fitness. Interestingly, while the endemic D. arbuscula differs from D. cneorum based on genetic and genome size data, D. cneorum subsp. arbusculoides was indistinguishable from D. cneorum subsp. cneorum. However, our study reveals that the subspecies differ in the number of flowers per inflorescence. This is the first comprehensive cytogeographic study of this intriguing genus at a regional scale, and in spite of its karyological stability, it contributes to our understanding of genomic evolution in plant species with a wide ecological amplitude.
- Keywords
- Carpathians, Daphne, ITS, Pannonian Basin, endemics, genome size stasis, pollen fertility, polyploidy, relicts, triploids,
- Publication type
- Journal Article MeSH
Increasing the proportion of locally produced plant protein in currently meat-rich diets could substantially reduce greenhouse gas emissions and loss of biodiversity1. However, plant protein production is hampered by the lack of a cool-season legume equivalent to soybean in agronomic value2. Faba bean (Vicia faba L.) has a high yield potential and is well suited for cultivation in temperate regions, but genomic resources are scarce. Here, we report a high-quality chromosome-scale assembly of the faba bean genome and show that it has expanded to a massive 13 Gb in size through an imbalance between the rates of amplification and elimination of retrotransposons and satellite repeats. Genes and recombination events are evenly dispersed across chromosomes and the gene space is remarkably compact considering the genome size, although with substantial copy number variation driven by tandem duplication. Demonstrating practical application of the genome sequence, we develop a targeted genotyping assay and use high-resolution genome-wide association analysis to dissect the genetic basis of seed size and hilum colour. The resources presented constitute a genomics-based breeding platform for faba bean, enabling breeders and geneticists to accelerate the improvement of sustainable protein production across the Mediterranean, subtropical and northern temperate agroecological zones.
- MeSH
- Gene Amplification genetics MeSH
- Genome-Wide Association Study MeSH
- Chromosomes, Plant genetics MeSH
- Diploidy * MeSH
- Genetic Variation * genetics MeSH
- Genome, Plant * genetics MeSH
- Genomics * MeSH
- Recombination, Genetic MeSH
- Retroelements genetics MeSH
- Genes, Plant genetics MeSH
- Plant Proteins * genetics metabolism MeSH
- DNA, Satellite genetics MeSH
- Seeds anatomy & histology genetics MeSH
- Plant Breeding * methods MeSH
- DNA Copy Number Variations genetics MeSH
- Vicia faba * anatomy & histology genetics metabolism MeSH
- Crops, Agricultural * genetics metabolism MeSH
- Geography MeSH
- Publication type
- Journal Article MeSH
- Research Support, Non-U.S. Gov't MeSH
- Names of Substances
- Retroelements MeSH
- Plant Proteins * MeSH
- DNA, Satellite MeSH
Grass pea (Lathyrus sativus L.) is a rich source of protein cultivated as an insurance crop in Ethiopia, Eritrea, India, Bangladesh, and Nepal. Its resilience to both drought and flooding makes it a promising crop for ensuring food security in a changing climate. The lack of genetic resources and the crop's association with the disease neurolathyrism have limited the cultivation of grass pea. Here, we present an annotated, long read-based assembly of the 6.5 Gbp L. sativus genome. Using this genome sequence, we have elucidated the biosynthetic pathway leading to the formation of the neurotoxin, β-L-oxalyl-2,3-diaminopropionic acid (β-L-ODAP). The final reaction of the pathway depends on an interaction between L. sativus acyl-activating enzyme 3 (LsAAE3) and a BAHD-acyltransferase (LsBOS) that form a metabolon activated by CoA to produce β-L-ODAP. This provides valuable insight into the best approaches for developing varieties which produce substantially less toxin.
- MeSH
- Amino Acids, Diamino * metabolism MeSH
- Genomics MeSH
- Lathyrus * genetics metabolism MeSH
- Neurotoxins metabolism MeSH
- Publication type
- Journal Article MeSH
- Research Support, Non-U.S. Gov't MeSH
- Names of Substances
- Amino Acids, Diamino * MeSH
- Neurotoxins MeSH
- oxalyldiaminopropionic acid MeSH Browser
Centromeres in the legume genera Pisum and Lathyrus exhibit unique morphological characteristics, including extended primary constrictions and multiple separate domains of centromeric chromatin. These so-called metapolycentromeres resemble an intermediate form between monocentric and holocentric types, and therefore provide a great opportunity for studying the transitions between different types of centromere organizations. However, because of the exceedingly large and highly repetitive nature of metapolycentromeres, highly contiguous assemblies needed for these studies are lacking. Here, we report on the assembly and analysis of a 177.6 Mb region of pea (Pisum sativum) chromosome 6, including the 81.6 Mb centromere region (CEN6) and adjacent chromosome arms. Genes, DNA methylation profiles, and most of the repeats were uniformly distributed within the centromere, and their densities in CEN6 and chromosome arms were similar. The exception was an accumulation of satellite DNA in CEN6, where it formed multiple arrays up to 2 Mb in length. Centromeric chromatin, characterized by the presence of the CENH3 protein, was predominantly associated with arrays of three different satellite repeats; however, five other satellites present in CEN6 lacked CENH3. The presence of CENH3 chromatin was found to determine the spatial distribution of the respective satellites during the cell cycle. Finally, oligo-FISH painting experiments, performed using probes specifically designed to label the genomic regions corresponding to CEN6 in Pisum, Lathyrus, and Vicia species, revealed that metapolycentromeres evolved via the expansion of centromeric chromatin into neighboring chromosomal regions and the accumulation of novel satellite repeats. However, in some of these species, centromere evolution also involved chromosomal translocations and centromere repositioning.
- MeSH
- Centromere genetics MeSH
- Chromatin genetics MeSH
- Pisum sativum * genetics MeSH
- Humans MeSH
- Chromosomes, Human, Pair 6 * MeSH
- DNA, Satellite genetics MeSH
- Check Tag
- Humans MeSH
- Publication type
- Journal Article MeSH
- Research Support, Non-U.S. Gov't MeSH
- Names of Substances
- Chromatin MeSH
- DNA, Satellite MeSH
Trifolium medium L. is a wild polyploid relative of the agriculturally important red clover that possesses traits promising for breeding purposes. To date, T. medium also remains the only clover species with which agriculturally important red clover has successfully been hybridized. Even though allopolyploid origin has previously been suggested, little has in fact been known about the T. medium karyotype and its origin. We researched T. medium and related karyotypes using comparative cytogenomic methods, such as fluorescent in situ hybridization (FISH) and RepeatExplorer cluster analysis. The results indicate an exceptional karyotype diversity regarding numbers and mutual positions of 5S and 26S rDNA loci and centromeric repeats in populations of T. medium ecotypes and varieties. The observed variability among T. medium ecotypes and varieties suggests current karyotype instability that can be attributed to ever-ongoing battle between satellite DNA together with genomic changes and rearrangements enhanced by post-hybridization events. Comparative cytogenomic analyses of a T. medium hexaploid variety and diploid relatives revealed stable karyotypes with a possible case of chromosomal rearrangement. Moreover, the results provided evidence of T. medium having autopolyploid origin.
- Keywords
- 26S rDNA, 5S rDNA, centromeric repeat, clover, fluorescent in situ hybridization, polyploidy, zigzag clover,
- Publication type
- Journal Article MeSH
Genome size varies 2400-fold across plants, influencing their evolution through changes in cell size and cell division rates which impact plants' environmental stress tolerance. Repetitive element expansion explains much genome size diversity, and the processes structuring repeat 'communities' are analogous to those structuring ecological communities. However, which environmental stressors influence repeat community dynamics has not yet been examined from an ecological perspective. We measured genome size and leveraged climatic data for 91% of genera within the ecologically diverse palm family (Arecaceae). We then generated genomic repeat profiles for 141 palm species, and analysed repeats using phylogenetically informed linear models to explore relationships between repeat dynamics and environmental factors. We show that palm genome size and repeat 'community' composition are best explained by aridity. Specifically, Ty3-gypsy and TIR elements were more abundant in palm species from wetter environments, which generally had larger genomes, suggesting amplification. By contrast, Ty1-copia and LINE elements were more abundant in drier environments. Our results suggest that water stress inhibits repeat expansion through selection on upper genome size limits. However, elements that may associate with stress-response genes (e.g. Ty1-copia) have amplified in arid-adapted palm species. Overall, we provide novel evidence of climate influencing the assembly of repeat 'communities'.
- Keywords
- Arecaceae (palms), adaptation, ecology, genome size, phylogenetic regression, plant evolution, trait evolution, transposable elements,
- MeSH
- Arecaceae * genetics MeSH
- Genome Size MeSH
- Phylogeny MeSH
- Genome, Plant MeSH
- Evolution, Molecular MeSH
- Retroelements * MeSH
- Sequence Analysis, DNA MeSH
- Publication type
- Journal Article MeSH
- Research Support, Non-U.S. Gov't MeSH
- Names of Substances
- Retroelements * MeSH