Nejvíce citovaný článek - PubMed ID 29348708
HybPhyloMaker: Target Enrichment Data Analysis From Raw Reads to Species Trees
Genome size variation is a crucial aspect of plant evolution, influenced by a complex interplay of factors. Repetitive elements, which are fundamental components of genomic architecture, often play a role in genome expansion by selectively amplifying specific repeat motifs. This study focuses on Amomum, a genus in the ginger family (Zingiberaceae), known for its 4.4-fold variation in genome size. Using a robust methodology involving PhyloNet reconstruction, RepeatExplorer clustering, and repeat similarity-based phylogenetic network construction, we investigated the repeatome composition, analyzed repeat dynamics, and identified potential hybridization events within the genus. Our analysis confirmed the presence of four major infrageneric clades (A-D) within Amomum, with clades A-C exclusively comprising diploid species (2n = 48) and clade D encompassing both diploid and tetraploid species (2n = 48 and 96). We observed an increase in the repeat content within the genus, ranging from 84% to 89%, compared to outgroup species with 75% of the repeatome. The SIRE lineage of the Ty1-Copia repeat superfamily was prevalent in most analyzed ingroup genomes. We identified significant difference in repeatome structure between the basal Amomum clades (A, B, C) and the most diverged clade D. Our investigation revealed evidence of ancient hybridization events within Amomum, coinciding with a substantial proliferation of multiple repeat groups. This finding supports the hypothesis that ancient hybridization is a driving force in the genomic evolution of Amomum. Furthermore, we contextualize our findings within the broader context of genome size variations and repeatome dynamics observed across major monocot lineages. This study enhances our understanding of evolutionary processes within monocots by highlighting the crucial roles of repetitive elements in shaping genome size and suggesting the mechanisms that drive these changes.
- Klíčová slova
- 5S rDNA, Zingiberaceae, genome evolution, genome size, interspecific hybridization, phylogeny, repeatome, repetitive DNA,
- Publikační typ
- časopisecké články MeSH
The Afromontane and Afroalpine areas constitute some of the main biodiversity hotspots of Africa. They are particularly rich in plant endemics, but the biogeographic origins and evolutionary processes leading to this outstanding diversity are poorly understood. We performed phylogenomic and biogeographic analyses of one of the most species-rich plant genera in these mountains, Helichrysum (Compositae-Gnaphalieae). Most previous studies have focused on Afroalpine elements of Eurasian origin, and the southern African origin of Helichrysum provides an interesting counterexample. We obtained a comprehensive nuclear dataset from 304 species (≈50% of the genus) using target-enrichment with the Compositae1061 probe set. Summary-coalescent and concatenation approaches combined with paralog recovery yielded congruent, well-resolved phylogenies. Ancestral range estimations revealed that Helichrysum originated in arid southern Africa, whereas the southern African grasslands were the source of most lineages that dispersed within and outside Africa. Colonization of the tropical Afromontane and Afroalpine areas occurred repeatedly throughout the Miocene-Pliocene. This timing coincides with mountain uplift and the onset of glacial cycles, which together may have facilitated both speciation and intermountain gene flow, contributing to the evolution of the Afroalpine flora.
- Klíčová slova
- Afroalpine, Afromontane, Asteraceae, Helichrysum, biogeography, evolution, long-distance dispersal, phylogeny, target-enrichment,
- Publikační typ
- časopisecké články MeSH
PREMISE: Custom probe design for target enrichment in phylogenetics is tedious and often hinders broader phylogenetic synthesis. The universal angiosperm probe set Angiosperms353 may be the solution. Here, we test the relative performance of Angiosperms353 on the Rosaceae subtribe Malinae in comparison with custom probes that we specifically designed for this clade. We then address the impact of bioinformatically altering the performance of Angiosperms353 by replacing the original probe sequences with orthologs extracted from the Malus domestica genome. METHODS: To evaluate the relative performance of these probe sets, we compared the enrichment efficiency, locus recovery, alignment length, proportion of parsimony-informative sites, proportion of potential paralogs, the topology and support of the resulting species trees, and the gene tree discordance. RESULTS: Locus recovery was highest for our custom Malinae probe set, and replacing the original Angiosperms353 sequences with a Malus representative improved the locus recovery relative to Angiosperms353. The proportion of parsimony-informative sites was similar between all probe sets, while the gene tree discordance was lower in the case of the custom probes. DISCUSSION: A custom probe set benefits from data completeness and can be tailored toward the specificities of the project of choice; however, Angiosperms353 was equally as phylogenetically informative as the custom probes. We therefore recommend using both a custom probe set and Angiosperms353 to facilitate large-scale systematic studies, where financially possible.
- Klíčová slova
- Angiosperms353, Malinae, customized probe set, target enrichment, universal probe set,
- Publikační typ
- časopisecké články MeSH
A major challenge in phylogenetics and -genomics is to resolve young rapidly radiating groups. The fast succession of species increases the probability of incomplete lineage sorting (ILS), and different topologies of the gene trees are expected, leading to gene tree discordance, i.e., not all gene trees represent the species tree. Phylogenetic discordance is common in phylogenomic datasets, and apart from ILS, additional sources include hybridization, whole-genome duplication, and methodological artifacts. Despite a high degree of gene tree discordance, species trees are often well supported and the sources of discordance are not further addressed in phylogenomic studies, which can eventually lead to incorrect phylogenetic hypotheses, especially in rapidly radiating groups. We chose the high-Andean Asteraceae genus Loricaria to shed light on the potential sources of phylogenetic discordance and generated a phylogenetic hypothesis. By accounting for paralogy during gene tree inference, we generated a species tree based on hundreds of nuclear loci, using Hyb-Seq, and a plastome phylogeny obtained from off-target reads during target enrichment. We observed a high degree of gene tree discordance, which we found implausible at first sight, because the genus did not show evidence of hybridization in previous studies. We used various phylogenomic analyses (trees and networks) as well as the D-statistics to test for ILS and hybridization, which we developed into a workflow on how to tackle phylogenetic discordance in recent radiations. We found strong evidence for ILS and hybridization within the genus Loricaria. Low genetic differentiation was evident between species located in different Andean cordilleras, which could be indicative of substantial introgression between populations, promoted during Pleistocene glaciations, when alpine habitats shifted creating opportunities for secondary contact and hybridization.
- Klíčová slova
- cytonuclear discordance, gene tree discordance, hybridization, incomplete lineage sorting, rapid radiation, workflow,
- Publikační typ
- časopisecké články MeSH
PREMISE: Hybrid capture with high-throughput sequencing (Hyb-Seq) is a powerful tool for evolutionary studies. The applicability of an Asteraceae family-specific Hyb-Seq probe set and the outcomes of different phylogenetic analyses are investigated here. METHODS: Hyb-Seq data from 112 Asteraceae samples were organized into groups at different taxonomic levels (tribe, genus, and species). For each group, data sets of non-paralogous loci were built and proportions of parsimony informative characters estimated. The impacts of analyzing alternative data sets, removing long branches, and type of analysis on tree resolution and inferred topologies were investigated in tribe Cichorieae. RESULTS: Alignments of the Asteraceae family-wide Hyb-Seq locus set were parsimony informative at all taxonomic levels. Levels of resolution and topologies inferred at shallower nodes differed depending on the locus data set and the type of analysis, and were affected by the presence of long branches. DISCUSSION: The approach used to build a Hyb-Seq locus data set influenced resolution and topologies inferred in phylogenetic analyses. Removal of long branches improved the reliability of topological inferences in maximum likelihood analyses. The Astereaceae Hyb-Seq probe set is applicable at multiple taxonomic depths, which demonstrates that probe sets do not necessarily need to be lineage-specific.
- Klíčová slova
- Asteraceae, Compositae, Hyb‐Seq, hybrid capture, non‐paralogy, phylogenetics,
- Publikační typ
- časopisecké články MeSH