Most cited article - PubMed ID 18842687
The dynamic ups and downs of genome size evolution in Brassicaceae
The establishment of Arabidopsis as the most important plant model has also brought other crucifer species into the spotlight of comparative research. While the genus Capsella has become a prominent crucifer model system, its closest relative has been overlooked. The unispecific genus Catolobus is native to temperate Eurasian woodlands, from eastern Europe to the Russian Far East. Here, we analyzed chromosome number, genome structure, intraspecific genetic variation, and habitat suitability of Catolobus pendulus throughout its range. Unexpectedly, all analyzed populations were hypotetraploid (2n = 30, ~330 Mb). Comparative cytogenomic analysis revealed that the Catolobus genome arose by a whole-genome duplication in a diploid genome resembling Ancestral Crucifer Karyotype (ACK, n = 8). In contrast to the much younger Capsella allotetraploid genomes, the presumably autotetraploid Catolobus genome (2n = 32) arose early after the Catolobus/Capsella divergence. Since its origin, the tetraploid Catolobus genome has undergone chromosomal rediploidization, including a reduction in chromosome number from 2n = 32 to 2n = 30. Diploidization occurred through end-to-end chromosome fusion and other chromosomal rearrangements affecting a total of six of 16 ancestral chromosomes. The hypotetraploid Catolobus cytotype expanded toward its present range, accompanied by some longitudinal genetic differentiation. The sister relationship between Catolobus and Capsella allows comparative studies of tetraploid genomes of contrasting ages and different degrees of genome diploidization.
- Keywords
- Arabidopsis-related model systems, Brassicaceae, Cruciferae, Hyb-Seq, chromosome painting, diploidization, polyploidy, whole-genome duplication (WGD),
- Publication type
- Journal Article MeSH
The Balkan Peninsula played an important role in the evolution of many Mediterranean plants and served as a major source for post-Pleistocene colonisation of central and northern Europe. Its complex geo-climatic history and environmental heterogeneity significantly influenced spatiotemporal diversification and resulted in intricate phylogeographic patterns. To explore the evolutionary dynamics and phylogeographic patterns within the widespread eastern Mediterranean and central European species Aurinia saxatilis, we used a combination of phylogenomic (restriction-site associated DNA sequencing, RADseq) and phylogenetic (sequences of the plastid marker ndhF) data as well as species distribution models generated for the present and the Last Glacial Maximum (LGM). The inferred phylogenies retrieved three main geographically distinct lineages. The southern lineage is restricted to the eastern Mediterranean, where it is distributed throughout the Aegean area, the southern Balkan Peninsula, and the southern Apennine Peninsula, and corresponds to the species main distribution area during the LGM. The eastern lineage extends from the eastern Balkan Peninsula over the Carpathians to central Europe, while the central lineage occupies the central Balkan Peninsula. Molecular dating places the divergence among all the three lineages to the early to middle Pleistocene, indicating their long-term independent evolutionary trajectories. Our data revealed an early divergence and stable in situ persistence of the southernmost, eastern Mediterranean lineage, whereas the mainland, south-east European lineages experienced more complex and turbulent evolutionary dynamics triggered by Pleistocene climatic oscillations. Our data also support the existence of multiple glacial refugia in southeast Europe and highlight the central Balkan Peninsula not only as a cradle of lineage diversifications but also as a source of lineage dispersal. Finally, the extant genetic variation within A. saxatilis is congruent with the taxonomic separation of peripatric A. saxatilis subsp. saxatilis and A. saxatilis subsp. orientalis, whereas the taxonomic status of A. saxatilis subsp. megalocarpa remains doubtful.
- Keywords
- Aurinia saxatilis, RAD sequencing, demographic modelling, glacial refugia, ndhF, species distribution modelling,
- Publication type
- Journal Article MeSH
Knowledge of population variation across species' ranges is a prerequisite for correctly assessing the overall variability of any group of organisms and provides an invaluable basis for unraveling evolutionary history, optimizing taxonomy and devising effective conservation strategies. Here, we examine the genus Neotinea, which represents a relatively recently delimited monophyletic genus of orchids, for which a detailed study of its overall variability was lacking. We applied a suite of biosystematic methods, consisting of flow cytometry, multivariate and geometric morphometrics, and analysis of genomic SNP data, to identify phylogenetic lineages within the genus, to delineate phenotypic variation relevant to these lineages, and to identify potential cryptic taxa within lineages. We found clear differentiation into four major lineages corresponding to the groups usually recognized within the genus: Neotinea maculata as a distinct and separate taxon, the Neotinea lactea group comprising two Mediterranean taxa N. lactea and Neotinea conica, the Neotinea ustulata group comprising two phenologically distinct varieties, and the rather complex Neotinea tridentata group comprising two major lineages and various minor lineages of unclear taxonomic value. N. conica constitutes both a monophyletic group within N. lactea and a distinct phenotype within the genus and merits its proposed subspecies-level recognition. By contrast, the spring and summer flowering forms of N. ustulata (var. ustulata and var. aestivalis) were confirmed to be distinct only morphologically, not phylogenetically. The most complex pattern emerged in the N. tridentata group, which splits into two main clades, one containing lineages from the Balkans and eastern Mediterranean and the other consisting of plants from Central Europe and the central Mediterranean. These individual lineages differ in genome size and show moderate degrees of morphological divergence. The tetraploid Neotinea commutata is closely related to the N. tridentata group, but our evidence points to an auto- rather than an allopolyploid origin. Our broad methodological approach proved effective in recognizing cryptic lineages among the orchids, and we propose the joint analysis of flow cytometric data on genome size and endopolyploidy as a useful and beneficial marker for delineating orchid species with partial endoreplication.
- Keywords
- RADseq, cryptic diversity, genome size, geometric morphometric, multivariate morphometric, orchids (Orchidaceae), partial endoreplication,
- Publication type
- Journal Article MeSH
Plant genomes are highly diverse in size and repetitive DNA composition. In the absence of polyploidy, the dynamics of repetitive elements, which make up the bulk of the genome in many species, are the main drivers underpinning changes in genome size and the overall evolution of the genomic landscape. The advent of high-throughput sequencing technologies has enabled investigation of genome evolutionary dynamics beyond model plants to provide exciting new insights in species across the biodiversity of life. Here we analyze the evolution of repetitive DNA in two closely related species of Heloniopsis (Melanthiaceae), which despite having the same chromosome number differ nearly twofold in genome size [i.e., H. umbellata (1C = 4,680 Mb), and H. koreana (1C = 2,480 Mb)]. Low-coverage genome skimming and the RepeatExplorer2 pipeline were used to identify the main repeat families responsible for the significant differences in genome sizes. Patterns of repeat evolution were found to correlate with genome size with the main classes of transposable elements identified being twice as abundant in the larger genome of H. umbellata compared with H. koreana. In addition, among the satellite DNA families recovered, a single shared satellite (HeloSAT) was shown to have contributed significantly to the genome expansion of H. umbellata. Evolutionary changes in repetitive DNA composition and genome size indicate that the differences in genome size between these species have been underpinned by the activity of several distinct repeat lineages.
- Keywords
- C-value, DNA repeats, chromosome, satellite DNA, transposable elements,
- Publication type
- Journal Article MeSH
Angiosperms have become the dominant terrestrial plant group by diversifying for ~145 million years into a broad range of environments. During the course of evolution, numerous morphological innovations arose, often preceded by whole genome duplications (WGD). The mustard family (Brassicaceae), a successful angiosperm clade with ~4000 species, has been diversifying into many evolutionary lineages for more than 30 million years. Here we develop a species inventory, analyze morphological variation, and present a maternal, plastome-based genus-level phylogeny. We show that increased morphological disparity, despite an apparent absence of clade-specific morphological innovations, is found in tribes with WGDs or diversification rate shifts. Both are important processes in Brassicaceae, resulting in an overall high net diversification rate. Character states show frequent and independent gain and loss, and form varying combinations. Therefore, Brassicaceae pave the way to concepts of phylogenetic genome-wide association studies to analyze the evolution of morphological form and function.
Plant genomes vary greatly in composition and size mainly due to the diversity of repetitive DNAs and the inherent propensity for their amplification and removal from the host genome. Most studies addressing repeatome dynamics focus on model organisms, whereas few provide comprehensive investigations across the genomes of related taxa. Herein, we analyze the evolution of repeats of the 13 species in Melampodium sect. Melampodium, representing all but two of its diploid taxa, in a phylogenetic context. The investigated genomes range in size from 0.49 to 2.27 pg/1C (ca. 4.5-fold variation), despite having the same base chromosome number (x = 10) and very strong phylogenetic affinities. Phylogenetic analysis performed in BEAST and ancestral genome size reconstruction revealed mixed patterns of genome size increases and decreases across the group. High-throughput genome skimming and the RepeatExplorer pipeline were utilized to determine the repeat families responsible for the differences in observed genome sizes. Patterns of repeat evolution were found to be highly correlated with phylogenetic position, namely taxonomic series circumscription. Major differences found were in the abundances of the SIRE (Ty1-copia), Athila (Ty3-gypsy), and CACTA (DNA transposon) lineages. Additionally, several satellite DNA families were found to be highly group-specific, although their overall contribution to genome size variation was relatively small. Evolutionary changes in repetitive DNA composition and genome size were complex, with independent patterns of genome up- and downsizing throughout the evolution of the analyzed diploids. A model-based analysis of genome size and repetitive DNA composition revealed evidence for strong phylogenetic signal and differential evolutionary rates of major lineages of repeats in the diploid genomes.
- Keywords
- Bayesian analysis, Melampodium, ancestral state reconstruction, genome size, phylogenetics, repetitive DNA, tandem repeats, transposable elements,
- Publication type
- Journal Article MeSH
BACKGROUND AND AIMS: Most crucifer species (Brassicaceae) have small nuclear genomes (mean 1C-value 617 Mb). The species with the largest genomes occur within the monophyletic Hesperis clade (Mandáková et al., Plant Physiology174: 2062-2071; also known as Clade E or Lineage III). Whereas most chromosome numbers in the clade are 6 or 7, monoploid genome sizes vary 16-fold (256-4264 Mb). To get an insight into genome size evolution in the Hesperis clade (~350 species in ~48 genera), we aimed to identify, quantify and localize in situ the repeats from which these genomes are built. We analysed nuclear repeatomes in seven species, covering the phylogenetic and genome size breadth of the clade, by low-pass whole-genome sequencing. METHODS: Genome size was estimated by flow cytometry. Genomic DNA was sequenced on an Illumina sequencer and DNA repeats were identified and quantified using RepeatExplorer; the most abundant repeats were localized on chromosomes by fluorescence in situ hybridization. To evaluate the feasibility of bacterial artificial chromosome (BAC)-based comparative chromosome painting in Hesperis-clade species, BACs of arabidopsis were used as painting probes. KEY RESULTS: Most biennial and perennial species of the Hesperis clade possess unusually large nuclear genomes due to the proliferation of long terminal repeat retrotransposons. The prevalent genome expansion was rarely, but repeatedly, counteracted by purging of transposable elements in ephemeral and annual species. CONCLUSIONS: The most common ancestor of the Hesperis clade has experienced genome upsizing due to transposable element amplification. Further genome size increases, dominating diversification of all Hesperis-clade tribes, contrast with the overall stability of chromosome numbers. In some subclades and species genome downsizing occurred, presumably as an adaptive transition to an annual life cycle. The amplification versus purging of transposable elements and tandem repeats impacted the chromosomal architecture of the Hesperis-clade species.
- Keywords
- Bunias, Hesperis, Matthiola, Brassicaceae, Genome size evolution, Lineage III, chromosome organization, interstitial telomeric repeats (ITRs), repetitive DNA, retrotransposons, tandem repeats,
- MeSH
- Brassicaceae * MeSH
- Genome Size MeSH
- Phylogeny MeSH
- Genome, Plant * MeSH
- In Situ Hybridization, Fluorescence MeSH
- Evolution, Molecular MeSH
- Cell Proliferation MeSH
- Publication type
- Journal Article MeSH
- Research Support, Non-U.S. Gov't MeSH
Allopolyploidy has played an important role in the evolution of the flowering plants. Genome mergers are often accompanied by significant and rapid alterations of genome size and structure via chromosomal rearrangements and altered dynamics of tandem and dispersed repetitive DNA families. Recent developments in sequencing technologies and bioinformatic methods allow for a comprehensive investigation of the repetitive component of plant genomes. Interpretation of evolutionary dynamics following allopolyploidization requires both the knowledge of parentage and the age of origin of an allopolyploid. Whereas parentage is typically inferred from cytogenetic and phylogenetic data, age inference is hampered by the reticulate nature of the phylogenetic relationships. Treating subgenomes of allopolyploids as if they belonged to different species (i.e., no recombination among subgenomes) and applying cross-bracing (i.e., putting a constraint on the age difference of nodes pertaining to the same event), we can infer the age of allopolyploids within the framework of the multispecies coalescent within BEAST2. Together with a comprehensive characterization of the repetitive DNA fraction using the RepeatExplorer pipeline, we apply the dating approach in a group of closely related allopolyploids and their progenitor species in the plant genus Melampodium (Asteraceae). We dated the origin of both the allotetraploid, Melampodium strigosum, and its two allohexaploid derivatives, Melampodium pringlei and Melampodium sericeum, which share both parentage and the direction of the cross, to the Pleistocene ($<$1.4 Ma). Thus, Pleistocene climatic fluctuations may have triggered formation of allopolyploids possibly in short intervals, contributing to difficulties in inferring the precise temporal order of allopolyploid species divergence of M. sericeum and M. pringlei. The relatively recent origin of the allopolyploids likely played a role in the near-absence of major changes in the repetitive fraction of the polyploids' genomes. The repetitive elements most affected by the postpolyploidization changes represented retrotransposons of the Ty1-copia lineage Maximus and, to a lesser extent, also Athila elements of Ty3-gypsy family.
- MeSH
- Asteraceae classification genetics MeSH
- DNA, Plant genetics MeSH
- Phylogeny MeSH
- Genome, Plant genetics MeSH
- Evolution, Molecular * MeSH
- Polyploidy MeSH
- Repetitive Sequences, Nucleic Acid genetics MeSH
- Publication type
- Journal Article MeSH
- Research Support, Non-U.S. Gov't MeSH
- Names of Substances
- DNA, Plant MeSH
Clade E, or the Hesperis clade, is one of the major Brassicaceae (Crucifereae) clades, comprising some 48 genera and 351 species classified into seven tribes and is distributed predominantly across arid and montane regions of Asia. Several taxa have socioeconomic significance, being important ornamental but also weedy and invasive species. From the comparative genomic perspective, the clade is noteworthy as it harbors species with the largest crucifer genomes but low numbers of chromosomes (n = 5-7). By applying comparative cytogenetic analysis and whole-chloroplast phylogenetics, we constructed, to our knowledge, the first partial and complete cytogenetic maps for selected representatives of clade E tribes and investigated their relationships in a family-wide context. The Hesperis clade is a well-supported monophyletic lineage comprising seven tribes: Anchonieae, Buniadeae, Chorisporeae, Dontostemoneae, Euclidieae, Hesperideae, and Shehbazieae. The clade diverged from other Brassicaceae crown-group clades during the Oligocene, followed by subsequent Miocene tribal diversifications in central/southwestern Asia. The inferred ancestral karyotype of clade E (CEK; n = 7) originated from an older n = 8 genome, which also was the purported progenitor of tribe Arabideae (KAA genome). In most taxa of clade E, the seven linkage groups of CEK either remained conserved (Chorisporeae) or were reshuffled by chromosomal translocations (Euclidieae). In 50% of Anchonieae and Hesperideae species, the CEK genome has undergone descending dysploidy toward n = 6 (-5). These genomic data elucidate early genome evolution in Brassicaceae and pave the way for future whole-genome sequencing and assembly efforts in this as yet genomically neglected group of crucifer plants.
- MeSH
- Brassicaceae genetics MeSH
- Phylogeny * MeSH
- Genome, Plant * MeSH
- Karyotype MeSH
- Evolution, Molecular * MeSH
- Publication type
- Journal Article MeSH
The Brassicaceae include several major crop plants and numerous important model species in comparative evolutionary research such as Arabidopsis, Brassica, Boechera, Thellungiella, and Arabis species. As any evolutionary hypothesis needs to be placed in a temporal context, reliably dated major splits within the evolution of Brassicaceae are essential. We present a comprehensive time-calibrated framework with important divergence time estimates based on whole-chloroplast sequence data for 29 Brassicaceae species. Diversification of the Brassicaceae crown group started at the Eocene-to-Oligocene transition. Subsequent major evolutionary splits are dated to ∼20 million years ago, coinciding with the Oligocene-to-Miocene transition, with increasing drought and aridity and transient glaciation events. The age of the Arabidopsis thaliana crown group is 6 million years ago, at the Miocene and Pliocene border. The overall species richness of the family is well explained by high levels of neopolyploidy (43% in total), but this trend is neither directly associated with an increase in genome size nor is there a general lineage-specific constraint. Our results highlight polyploidization as an important source for generating new evolutionary lineages adapted to changing environments. We conclude that species radiation, paralleled by high levels of neopolyploidization, follows genome size decrease, stabilization, and genetic diploidization.
- MeSH
- Arabidopsis genetics physiology MeSH
- Brassica genetics physiology MeSH
- Brassicaceae genetics physiology MeSH
- Phylogeny MeSH
- Genome, Chloroplast genetics MeSH
- Genome, Plant genetics MeSH
- Evolution, Molecular * MeSH
- Polyploidy MeSH
- Sequence Analysis, DNA MeSH
- Genetic Speciation MeSH
- Publication type
- Journal Article MeSH
- Research Support, Non-U.S. Gov't MeSH