Most cited article - PubMed ID 30172008
Hybridization and polyploidization within the Chenopodium album aggregate analysed by means of cytological and molecular markers
The genus Chenopodium L. is characterized by its wide geographic distribution and ecological adaptability. Species such as quinoa (Chenopodium quinoa Willd.) have served as domesticated staple crops for centuries. Wild Chenopodium species exhibit diverse niche adaptations and are important genetic reservoirs for beneficial agronomic traits, including disease resistance and climate hardiness. To harness the potential of the wild taxa for crop improvement, we developed a Chenopodium pangenome through the assembly and comparative analyses of 12 Chenopodium species that encompass the eight known genome types (A-H). Six of the species are new chromosome-scale assemblies, and many are polyploids; thus, a total of 20 genomes were included in the pangenome analyses. We show that the genomes vary dramatically in size with the D genome being the smallest (∼370 Mb) and the B genome being the largest (∼700 Mb) and that genome size was correlated with independent expansions of the Copia and Gypsy LTR retrotransposon families, suggesting that transposable elements have played a critical role in the evolution of the Chenopodium genomes. We annotated a total of 33,457 pan-Chenopodium gene families, of which ∼65% were classified as shell (2% private). Phylogenetic analysis clarified the evolutionary relationships among the genome lineages, notably resolving the taxonomic placement of the F genome while highlighting the uniqueness of the A genome in the Western Hemisphere. These genomic resources are particularly important for understanding the secondary and tertiary gene pools available for the improvement of the domesticated chenopods while furthering our understanding of the evolution and complexity within the genus.
- MeSH
- Chenopodium * genetics MeSH
- Genome Size MeSH
- Phylogeny MeSH
- Genome, Plant * MeSH
- Terminal Repeat Sequences * genetics MeSH
- Evolution, Molecular * MeSH
- Retroelements MeSH
- Publication type
- Journal Article MeSH
- Names of Substances
- Retroelements MeSH
BACKGROUND: Telomeres are the nucleoprotein complexes that physically cap the ends of eukaryotic chromosomes. Most plants possess Arabidopsis-type telomere sequences (TSs). In addition to terminal TSs, more diverse interstitial TSs exists in plants. Although telomeres have been sufficiently studied, the actual diversity of TSs in land plants is underestimated. RESULTS: We investigate genotypes from seven natural populations with contrasting environments of four Chenopodium species to reveal the variability in TSs by analyzing Oxford Nanopore reads. Fluorescent in situ hybridization was used to localize telomeric repeats on chromosomes. We identified a number of derivative monomers that arise in part of both terminal and interstitial telomeric arrays of a single genotype. The former presents a case of block-organized double-monomer telomers, where blocks of Arabidopsis-type TTTAGGG motifs were interspersed with blocks of derivative TTTAAAA motifs. The latter is an integral part of the satellitome with transformations specific to the inactive genome fraction. CONCLUSIONS: We suggested two alternative models for the possible formation of derivative monomers from telomeric heptamer motifs of Arabidopsis-type. It was assumed that derivatization of TSs is a ubiquitous process in the plant genome but occurrence and frequencies of derivatives may be genotype-specific. We also propose that the formation of non-canonical arrays of TSs, especially at chromosomal termini, may be a source for genomic variability in nature.
- Keywords
- Evolution, Oxford nanopore sequencing, Plant, Population, Species, Telomere,
- MeSH
- Arabidopsis * genetics MeSH
- Eukaryota MeSH
- Genotype MeSH
- In Situ Hybridization, Fluorescence MeSH
- Humans MeSH
- Telomere genetics MeSH
- Check Tag
- Humans MeSH
- Publication type
- Journal Article MeSH
- Case Reports MeSH
Dormancy and heteromorphism are innate seed properties that control germination timing through adaptation to the prevailing environment. The degree of variation in dormancy depth within a seed population differs considerably depending on the genotype and maternal environment. Dormancy is therefore a key trait of annual weeds to time seedling emergence across seasons. Seed heteromorphism, the production of distinct seed morphs (in color, mass or other morphological characteristics) on the same individual plant, is considered to be a bet-hedging strategy in unpredictable environments. Heteromorphic species evolved independently in several plant families and the distinct seed morphs provide an additional degree of variation. Here we conducted a comparative morphological and molecular analysis of the dimorphic seeds (black and brown) of the Amaranthaceae weed Chenopodium album. Freshly harvested black and brown seeds differed in their dormancy and germination responses to ambient temperature. The black seed morph of seedlot #1 was dormant and 2/3rd of the seed population had non-deep physiological dormancy which was released by after-ripening (AR) or gibberellin (GA) treatment. The deeper dormancy of the remaining 1/3rd non-germinating seeds required in addition ethylene and nitrate for its release. The black seeds of seedlot #2 and the brown seed morphs of both seedlots were non-dormant with 2/3rd of the seeds germinating in the fresh mature state. The dimorphic seeds and seedlots differed in testa (outer seed coat) thickness in that thick testas of black seeds of seedlot #1 conferred coat-imposed dormancy. The dimorphic seeds and seedlots differed in their abscisic acid (ABA) and GA contents in the dry state and during imbibition in that GA biosynthesis was highest in brown seeds and ABA degradation was faster in seedlot #2. Chenopodium genes for GA and ABA metabolism were identified and their distinct transcript expression patterns were quantified in dry and imbibed C. album seeds. Phylogenetic analyses of the Amaranthaceae sequences revealed a high proportion of expanded gene families within the Chenopodium genus. The identified hormonal, molecular and morphological mechanisms and dormancy variation of the dimorphic seeds of C. album and other Amaranthaceae are compared and discussed as adaptations to variable and stressful environments.
- Keywords
- abscisic acid, coat-imposed dormancy, gibberellins, hormone metabolism, seed coat properties, seed heteromorphism, thermal time modelling, weed seed bank,
- Publication type
- Journal Article MeSH
Djulis (Chenopodium formosanum Koidz.) is a crop grown since antiquity in Taiwan. It is a BCD-genome hexaploid (2n = 6x = 54) domesticated form of lambsquarters (C. album L.) and a relative of the allotetraploid (AABB) C. quinoa. As with quinoa, djulis seed contains a complete protein profile and many nutritionally important vitamins and minerals. While still sold locally in Taiwanese markets, its traditional culinary uses are being lost as diets of younger generations change. Moreover, indigenous Taiwanese peoples who have long safeguarded djulis are losing their traditional farmlands. We used PacBio sequencing and Hi-C-based scaffolding to produce a chromosome-scale, reference-quality assembly of djulis. The final genome assembly spans 1.63 Gb in 798 scaffolds, with 97.8% of the sequence contained in 27 scaffolds representing the nine haploid chromosomes of each sub-genome of the species. Benchmarking of universal, single-copy orthologs indicated that 98.5% of the conserved orthologous genes for Viridiplantae are complete within the assembled genome, with 92.9% duplicated, as expected for a polyploid. A total of 67.8% of the assembly is repetitive, with the most common repeat being Gypsy long terminal repeat retrotransposons, which had significantly expanded in the B sub-genome. Gene annotation using Iso-Seq data from multiple tissues identified 75,056 putative gene models. Comparisons to quinoa showed strong patterns of synteny which allowed for the identification of homoeologous chromosomes, and sub-genome-specific sequences were used to assign homoeologs to each sub-genome. These results represent the first hexaploid genome assembly and the first assemblies of the C and D genomes of the Chenopodioideae subfamily.
- Keywords
- Gypsy, LTR, chromosome-scale, long-read sequencing, polyploid evolution, retrotransposon,
- MeSH
- Chenopodium * genetics MeSH
- Chromosomes, Plant genetics MeSH
- Genome, Plant MeSH
- Polyploidy MeSH
- Synteny MeSH
- Publication type
- Journal Article MeSH
- Research Support, Non-U.S. Gov't MeSH
BACKGROUND: CACTA transposable elements (TEs) comprise one of the most abundant superfamilies of Class 2 (cut-and-paste) transposons. Over recent decades, CACTA elements were widely identified in species from the plant, fungi, and animal kingdoms, but sufficiently studied in the genomes of only a few model species although non-model genomes can bring additional and valuable information. It primarily concerned the genomes of species belonging to clades in the base of large taxonomic groups whose genomes, to a certain extent, can preserve relict and/or possesses specific traits. Thus, we sought to investigate the genomes of Chenopodium (Amaranthaceae, Caryophyllales) species to unravel the structural variability of CACTA elements. Caryophyllales is a separate branch of Angiosperms and until recently the diversity of CACTA elements in this clade was unknown. RESULTS: Application of the short-read genome assembly algorithm followed by analysis of detected complete CACTA elements allowed for the determination of their structural diversity in the genomes of 22 Chenopodium album aggregate species. This approach yielded knowledge regarding: (i) the coexistence of two CACTA transposons subtypes in single genome; (ii) gaining of additional protein conserved domains within the coding sequence; (iii) the presence of captured gene fragments, including key genes for flower development; and (iv)) identification of captured satDNA arrays. Wide comparative database analysis revealed that identified events are scattered through Angiosperms in different proportions. CONCLUSIONS: Our study demonstrated that while preserving the basic element structure a wide range of coding and non-coding additions to CACTA transposons occur in the genomes of C. album aggregate species. Ability to relocate additions inside genome in combination with the proposed novel functional features of structural-different CACTA elements can impact evolutionary trajectory of the host genome.
- Keywords
- CACTA transposons, Chenopodium, Flowering plants, Genome evolution, Next generation sequencing,
- Publication type
- Journal Article MeSH
The Mediterranean Basin is a significant hotspot of species diversity and endemism, with various distribution patterns and speciation mechanisms observed in its flora. High species diversity in the Mediterranean is also manifested in the monophyletic lineage of Alyssum annuals (Brassicaceae), but little is known about its origin. These species include both diploids and polyploids that grow mainly in open and disturbed sites across a wide elevational span and show contrasting distribution patterns, ranging from broadly distributed Eurasian species to narrow island endemics. Here, we investigated the evolution of European representatives of this lineage, and aimed to reconstruct their phylogeny, polyploid and genome size evolution using flow cytometric analyses, chloroplast and nuclear high- and low-copy DNA markers. The origin and early diversification of the studied Alyssum lineage could be dated back to the Late Miocene/Pliocene and were likely promoted by the onset of the Mediterranean climate, whereas most of the extant species originated during the Pleistocene. The Aegean region represents a significant diversity center, as it hosts 12 out of 16 recognized European species and comprises several (sub)endemics placed in distinct phylogenetic clades. Because several species, including the closest relatives, occur here sympatrically without apparent niche differences, we can reject simple allopatric speciation via vicariance as well as ecological speciation for most cases. Instead, we suggest scenarios of more complex speciation processes that involved repeated range shifts in response to sea-level changes and recurrent land connections and disconnections since the Pliocene. In addition, multiple polyploidization events significantly contributed to species diversity across the entire distribution range. All seven polyploids, representing both widespread species and endemics to the western or eastern Mediterranean, were inferred to be allopolyploids. Finally, the current distribution patterns have likely been affected also by the human factor (farming and grazing). This study illustrates the complexity of evolutionary and speciation processes in the Mediterranean flora.
- Keywords
- Aegean area, Alyssum, Mediterranean, allopolyploidy, annual species, endemics, phylogeny, sympatry,
- Publication type
- Journal Article MeSH
Recurrent polyploid formation and weak reproductive barriers between independent polyploid lineages generate intricate species complexes with high diversity and reticulate evolutionary history. Uncovering the evolutionary processes that formed their present-day cytotypic and genetic structure is a challenging task. We studied the species complex of Cardamine pratensis, composed of diploid endemics in the European Mediterranean and diploid-polyploid lineages more widely distributed across Europe, focusing on the poorly understood variation in Central Europe. To elucidate the evolution of Central European populations we analyzed ploidy level and genome size variation, genetic patterns inferred from microsatellite markers and target enrichment of low-copy nuclear genes (Hyb-Seq), and environmental niche differentiation. We observed almost continuous variation in chromosome numbers and genome size in C. pratensis s.str., which is caused by the co-occurrence of euploid and dysploid cytotypes, along with aneuploids, and is likely accompanied by inter-cytotype mating. We inferred that the polyploid cytotypes of C. pratensis s.str. are both of single and multiple, spatially and temporally recurrent origins. The tetraploid Cardamine majovskyi evolved at least twice in different regions by autopolyploidy from diploid Cardamine matthioli. The extensive genome size and genetic variation of Cardamine rivularis reflects differentiation induced by the geographic isolation of disjunct populations, establishment of triploids of different origins, and hybridization with sympatric C. matthioli. Geographically structured genetic lineages identified in the species under study, which are also ecologically divergent, are interpreted as descendants from different source populations in multiple glacial refugia. The postglacial range expansion was accompanied by substantial genetic admixture between the lineages of C. pratensis s.str., which is reflected by diffuse borders in their contact zones. In conclusion, we identified an interplay of diverse processes that have driven the evolution of the species studied, including allopatric and ecological divergence, hybridization, multiple polyploid origins, and genetic reshuffling caused by Pleistocene climate-induced range dynamics.
- Keywords
- Brassicaceae, environmental niche, genome size, hybridization, microsatellites, phylogeography, polyploidy, target enrichment,
- Publication type
- Journal Article MeSH
Satellite DNA (satDNA) is one of the major fractions of the eukaryotic nuclear genome. Highly variable satDNA is involved in various genome functions, and a clear link between satellites and phenotypes exists in a wide range of organisms. However, little is known about the origin and temporal dynamics of satDNA. The "library hypothesis" indicates that the rapid evolutionary changes experienced by satDNAs are mostly quantitative. Although this hypothesis has received some confirmation, a number of its aspects are still controversial. A recently developed next-generation sequencing (NGS) method allows the determination of the satDNA landscape and could shed light on unresolved issues. Here, we explore low-coverage NGS data to infer satDNA evolution in the phylogenetic context of the diploid species of the Chenopodium album aggregate. The application of the Illumina read assembly algorithm in combination with Oxford Nanopore sequencing and fluorescent in situ hybridization allowed the estimation of eight satDNA families within the studied group, six of which were newly described. The obtained set of satDNA families of different origins can be divided into several categories, namely group-specific, lineage-specific and species-specific. In the process of evolution, satDNA families can be transmitted vertically and can be eliminated over time. Moreover, transposable element-derived satDNA families may appear repeatedly in the satellitome, creating an illusion of family conservation. Thus, the obtained data refute the "library hypothesis", rather than confirming it, and in our opinion, it is more appropriate to speak about "the library of the mechanisms of origin".
- MeSH
- Chenopodium album genetics growth & development MeSH
- Diploidy * MeSH
- DNA, Plant analysis genetics MeSH
- Species Specificity MeSH
- Phylogeny MeSH
- Genome, Plant * MeSH
- Gene Library MeSH
- Evolution, Molecular * MeSH
- DNA, Satellite analysis genetics MeSH
- High-Throughput Nucleotide Sequencing MeSH
- Publication type
- Journal Article MeSH
- Research Support, Non-U.S. Gov't MeSH
- Names of Substances
- DNA, Plant MeSH
- DNA, Satellite MeSH
Extensive and complex links exist between transposable elements (TEs) and satellite DNA (satDNA), which are the two largest fractions of eukaryotic genome. These relationships have a crucial effect on genome structure, function and evolution. Here, we report a novel case of mutual relationships between TEs and satDNA. In the genomes of Chenopodium s. str. species, the deletion derivatives of tnp2 conserved domain of the newly discovered CACTA-like TE Jozin are involved in generating monomers of the most abundant satDNA family of the Chenopodium satellitome. The analysis of the relative positions of satDNA and different TEs utilizing assembled Illumina reads revealed several associations between satDNA arrays and the transposases of putative CACTA-like elements when an ~ 40 bp fragment of tnp2 served as the start monomer of the satDNA array. The high degree of identity of the consensus satDNA monomers of the investigated species and the tnp2 fragment (from 82.1 to 94.9%) provides evidence of the genesis of CficCl-61-40 satDNA family monomers from analogous regions of their respective parental elements. The results were confirmed via molecular genetic methods and Oxford Nanopore sequencing. The discovered phenomenon leads to the continuous replenishment of species genomes with new identical satDNA monomers, which in turn may increase species satellitomes similarity.
- Keywords
- CACTA transposons, Chenopodium, Next generation sequencing, Oxford Nanopore sequencing, Satellite DNA, Transposase,
- Publication type
- Journal Article MeSH
Chenopodium ficifoliumflowered under long days despite much lower expression ofFLOWERING LOCUS Thomolog than under short days. Frequent duplications of the FLOWERING LOCUS T (FT) gene across various taxonomic lineages resulted in FT paralogs with floral repressor function, whereas others duplicates maintained their floral-promoting role. The FT gene has been confirmed as the inducer of photoperiodic flowering in most angiosperms analyzed to date. We identified all FT homologs in the transcriptome of Chenopodium ficifolium and in the genome of Chenopodium suecicum, which are closely related to diploid progenitors of the tetraploid crop Chenopodium quinoa, and estimated their expression during photoperiodic floral induction. We found that expression of FLOWERING LOCUS T like 1 (FTL1), the ortholog of the sugar beet floral activator BvFT2, correlated with floral induction in C. suecicum and short-day C. ficifolium, but not with floral induction in C. ficifolium with accelerated flowering under long days. This C. ficifolium accession was induced to flowering without the concomitant upregulation of any FT homolog.
- Keywords
- Amaranthaceae, FLOWERING LOCUS T like genes, Floral induction, Gene expression, Photoperiod, Transcriptome,
- MeSH
- Transcriptional Activation MeSH
- Chenopodium genetics growth & development MeSH
- Photoperiod MeSH
- Flowers genetics growth & development MeSH
- Magnoliopsida genetics growth & development MeSH
- Gene Expression Regulation, Plant * MeSH
- Up-Regulation * MeSH
- Publication type
- Journal Article MeSH