Evolution of Tandem Repeats Is Mirroring Post-polyploid Cladogenesis in Heliophila (Brassicaceae)
Status PubMed-not-MEDLINE Jazyk angličtina Země Švýcarsko Médium electronic-ecollection
Typ dokumentu časopisecké články
PubMed
33510751
PubMed Central
PMC7835680
DOI
10.3389/fpls.2020.607893
Knihovny.cz E-zdroje
- Klíčová slova
- Cape flora, Cruciferae, South Africa, plastome phylogeny, rDNA ITS, repeatome, repetitive DNA, whole-genome duplication (WGD),
- Publikační typ
- časopisecké články MeSH
The unigeneric tribe Heliophileae encompassing more than 100 Heliophila species is morphologically the most diverse Brassicaceae lineage. The tribe is endemic to southern Africa, confined chiefly to the southwestern South Africa, home of two biodiversity hotspots (Cape Floristic Region and Succulent Karoo). The monospecific Chamira (C. circaeoides), the only crucifer species with persistent cotyledons, is traditionally retrieved as the closest relative of Heliophileae. Our transcriptome analysis revealed a whole-genome duplication (WGD) ∼26.15-29.20 million years ago, presumably preceding the Chamira/Heliophila split. The WGD was then followed by genome-wide diploidization, species radiations, and cladogenesis in Heliophila. The expanded phylogeny based on nuclear ribosomal DNA internal transcribed spacer (ITS) uncovered four major infrageneric clades (A-D) in Heliophila and corroborated the sister relationship between Chamira and Heliophila. Herein, we analyzed how the diploidization process impacted the evolution of repetitive sequences through low-coverage whole-genome sequencing of 15 Heliophila species, representing the four clades, and Chamira. Despite the firmly established infrageneric cladogenesis and different ecological life histories (four perennials vs. 11 annual species), repeatome analysis showed overall comparable evolution of genome sizes (288-484 Mb) and repeat content (25.04-38.90%) across Heliophila species and clades. Among Heliophila species, long terminal repeat (LTR) retrotransposons were the predominant components of the analyzed genomes (11.51-22.42%), whereas tandem repeats had lower abundances (1.03-12.10%). In Chamira, the tandem repeat content (17.92%, 16 diverse tandem repeats) equals the abundance of LTR retrotransposons (16.69%). Among the 108 tandem repeats identified in Heliophila, only 16 repeats were found to be shared among two or more species; no tandem repeats were shared by Chamira and Heliophila genomes. Six "relic" tandem repeats were shared between any two different Heliophila clades by a common descent. Four and six clade-specific repeats shared among clade A and C species, respectively, support the monophyly of these two clades. Three repeats shared by all clade A species corroborate the recent diversification of this clade revealed by plastome-based molecular dating. Phylogenetic analysis based on repeat sequence similarities separated the Heliophila species to three clades [A, C, and (B+D)], mirroring the post-polyploid cladogenesis in Heliophila inferred from rDNA ITS and plastome sequences.
CEITEC Masaryk University Brno Czechia
Department of Biology Botany Osnabrück University Osnabrück Germany
Department of Experimental Biology Faculty of Science Masaryk University Brno Czechia
Department of Geography and Environmental Studies Stellenbosch University Stellenbosch South Africa
Harry Butler Institute Murdoch University Perth WA Australia
Institute of Botany Czech Academy of Sciences Prùhonice Czechia
Missouri Botanical Garden St Louis MO United States
NCBR Faculty of Science Masaryk University Brno Czechia
South African National Biodiversity Institute Kirstenbosch Cape Town South Africa
Zobrazit více v PubMed
Al-Shehbaz I. A. (2012). A generic and tribal synopsis of the Brassicaceae (Cruciferae). DOI
Altschul S. F., Gish W., Miller W., Myers E. W., Lipman D. J. (1990). Basic local alignment search tool. PubMed
Andrews S. (2010). FastQC: A Quality Control Tool for High Throughput Sequence Data. Available online at: http://www.bioinformatics.babraham.ac.uk/projects/fastqc/
Benson G. (1998). “An algorithm for finding tandem repeats of unspecified pattern size,” in DOI
Bolger A. M., Lohse M., Usadel B. (2014). Trimmomatic: a flexible trimmer for Illumina sequence data. PubMed DOI PMC
Bolsheva N. L., Melnikova N. V., Kirov I. V., Dmitriev A. A., Krasnov G. S., Amosova ÀV., et al. (2019). Characterization of repeated DNA sequences in genomes of blue-flowered flax. PubMed DOI PMC
Brown J. W., Walker J. F., Smith S. A. (2017). Phyx: phylogenetic tools for unix. PubMed DOI PMC
Cechova M., Harris R. S., Tomaszkiewicz M., Arbeithuber B., Chiaromonte F., Makova K. D. (2019). High satellite repeat turnover in great apes studied with short-and long-read technologies. PubMed DOI PMC
Davidson N. M., Oshlack A. (2014). Corset: enabling differential gene expression analysis for de novo assembled transcriptomes. PubMed PMC
Dierckxsens N., Mardulyn P., Smits G. (2016). NOVOPlasty: de novo assembly of organelle genomes from whole genome data. PubMed PMC
Dodsworth S., Chase M. W., Kelly L. J., Leitch I. J., Macas J., Novák P., et al. (2014). Genomic repeat abundances contain phylogenetic signal. PubMed DOI PMC
Dodsworth S., Chase M. W., Särkinen T., Knapp S., Leitch A. R. (2016). Using genomic repeats for phylogenomics: a case study in wild tomatoes ( DOI
Dodsworth S., Jang T.-S., Struebig M., Chase M. W., Weiss-Schneeweiss H., Leitch A. R. (2017). Genome-wide repeat dynamics reflect phylogenetic distance in closely related allotetraploid PubMed DOI PMC
Doležel J., Greilhuber J., Suda J. (2007). Estimation of nuclear DNA content in plants using flow cytometry. PubMed DOI
Doronina L., Churakov G., Kuritzin A., Shi J., Baertsch R., Clawson H., et al. (2017). Speciation network in Laurasiatheria: retrophylogenomic signals. PubMed DOI PMC
Emms D. M., Kelly S. (2015). OrthoFinder: solving fundamental biases in whole genome comparisons dramatically improves orthogroup inference accuracy. PubMed PMC
Franzke A., Koch M. A., Mummenhoff K. (2016). Turnip time travels: age estimates in Brassicaceae. PubMed DOI
Fu L., Niu B., Zhu Z., Wu S., Li W. (2012). CD-HIT: accelerated for clustering the next-generation sequencing data. PubMed DOI PMC
García-Robledo C., Erickson D. L., Staines C. L., Erwin T. L., Kress W. J. (2013). Tropical plant–herbivore networks: reconstructing species interactions using DNA barcodes. PubMed DOI PMC
Garrido-Ramos M. A. (2015). Satellite DNA in plants: more than just rubbish. PubMed DOI
Garrido-Ramos M. A. (2017). Satellite DNA: an evolving topic. PubMed DOI PMC
Guo X., Liu J., Hao G., Zhang L., Mao K., Wang X., et al. (2017). Plastome phylogeny and early diversification of Brassicaceae. PubMed DOI PMC
Haas B., Papanicolaou A. (2016).
Haas B. J., Papanicolaou A., Yassour M., Grabherr M., Blood P. D., Bowden J., et al. (2013). PubMed DOI PMC
Harkess A., Mercati F., Abbate L., McKain M., Pires J. C., Sala T., et al. (2016). Retrotransposon proliferation coincident with the evolution of dioecy in PubMed DOI PMC
Henikoff S., Ahmad K., Malik H. S. (2001). The centromere paradox: stable inheritance with rapidly evolving DNA. PubMed DOI
Hohmann N., Wolf E. M., Lysak M. A., Koch M. A. (2015). A time-calibrated road map of Brassicaceae species radiation and evolutionary history. PubMed PMC
Huang D. I., Cronk Q. C. B. (2015). Plann: a command-line application for annotating plastome sequences. PubMed DOI PMC
Huson D. H., Bryant D. (2006). Application of phylogenetic networks in evolutionary studies. PubMed DOI
Jurka J., Bao W., Kojima K. K. (2011). Families of transposable elements, population structure and the origin of species. PubMed DOI PMC
Kagale S., Robinson S. J., Nixon J., Xiao R., Huebert T., Condie J., et al. (2014). Polyploid evolution of the Brassicaceae during the Cenozoic era. PubMed DOI PMC
Kalyaanamoorthy S., Minh B. Q., Wong T. K. F., von Haeseler A., Jermiin L. S. (2017). ModelFinder: fast model selection for accurate phylogenetic estimates. PubMed DOI PMC
Katoh K., Standley D. M. (2013). MAFFT multiple sequence alignment software version 7: improvements in performance and usability. PubMed DOI PMC
Kiefer C., Willing E.-M., Jiao W.-B., Sun H., Piednoël M., Hümann U., et al. (2019). Interspecies association mapping links reduced CG to TG substitution rates to the loss of gene-body methylation. PubMed DOI
Kohany O., Gentles A. J., Hankus L., Jurka J. (2006). Annotation, submission and screening of repetitive elements in Repbase: RepbaseSubmitter and Censor. PubMed DOI PMC
Kolde R., Kolde M. R. (2015).
Koukalova B., Moraes A. P., Renny-Byfield S., Matyasek R., Leitch A. R., Kovarik A. (2010). Fall and rise of satellite repeats in allopolyploids of PubMed DOI
Kumwenda M. W. (2003).
Lanfear R., Frandsen P. B., Wright A. M., Senfeld T., Calcott B. (2016). PartitionFinder 2: new methods for selecting partitioned models of evolution for molecular and morphological phylogenetic analyses. PubMed
Langmead B., Salzberg S. L. (2012). Fast gapped-read alignment with Bowtie 2. PubMed DOI PMC
Lysak M. A., Koch M. A. (2011). “Phylogeny, genome, and karyotype evolution of crucifers (Brassicaceae),” in DOI
Macas J., Kejnovský E., Neumann P., Novák P., Koblížková A., Vyskot B. (2011). Next generation sequencing-based analysis of repetitive DNA in the model dioceous plant PubMed DOI PMC
Mandáková T., Mummenhoff K., Al-Shehbaz I. A., Mucina L., Mühlhausen A., Lysak M. A. (2012). Whole-genome triplication and species radiation in the southern African tribe Heliophileae (Brassicaceae). DOI
Mandáková T., Winter P., Al-Shehbaz I. A., Mucina L., Mummenhoff K., Lysak M. A., et al. (2015). “Brassicaceae. IAPT/IOPB chromosome data 19,” in
Mandáková T., Lysak M. A. (2016a). Chromosome preparation for cytogenetic analyses in Arabidopsis. PubMed DOI
Mandáková T., Lysak M. A. (2016b). Painting of Arabidopsis chromosomes with chromosome-specific BAC clones. PubMed DOI
Mandáková T., Li Z., Barker M. S., Lysak M. A. (2017). Diverse genome organization following 13 independent mesopolyploid events in Brassicaceae contrasts with convergent patterns of gene retention. PubMed DOI
Marais W. (1970). “Cruciferae,” in
McCann J., Macas J., Novák P., Stuessy T. F., Villaseñor J. L., Weiss-Schneeweiss H. (2020). Differential genome size and repetitive DNA evolution in diploid species of PubMed DOI PMC
Melters D. P., Bradnam K. R., Young H. A., Telis N., May M. R., Ruby J. G., et al. (2013). Comparative analysis of tandem repeats from hundreds of species reveals unique insights into centromere evolution. PubMed DOI PMC
Meraldi P., McAinsh A. D., Rheinbay E., Sorger P. K. (2006). Phylogenetic and structural analysis of centromeric DNA and kinetochore proteins. PubMed DOI PMC
Miller M. A., Pfeiffer W., Schwartz T. (2010). “Creating the CIPRES science gateway for inference of large phylogenetic trees,” in
Minamoto T., Uchii K., Takahara T., Kitayoshi T., Tsuji S., Yamanaka H., et al. (2017). Nuclear internal transcribed spacer−1 as a sensitive genetic marker for environmental DNA studies in common carp PubMed DOI
Moisy C., Schulman A. H., Kalendar R., Buchmann J. P., Pelsy F. (2014). The Tvv1 retrotransposon family is conserved between plant genomes separated by over 100 million years. PubMed DOI
Mummenhoff K., Al-Shehbaz I. A., Bakker F. T., Linder H. P., Mühlhausen A. (2005). Phylogeny, morphological evolution, and speciation of endemic Brassicaceae genera in the Cape flora of southern Africa.
Mummenhoff K., Linder P., Friesen N., Bowman J. L., Lee J., Franzke A. (2004). Molecular evidence for bicontinental hybridogenous genomic constitution in PubMed DOI
Nguyen L.-T., Schmidt H. A., von Haeseler A., Minh B. Q. (2014). IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. PubMed DOI PMC
Novák P., Neumann P., Macas J. (2010). Graph-based clustering and characterization of repetitive sequences in next-generation sequencing data. PubMed DOI PMC
Novák P., Neumann P., Pech J., Steinhaisl J., MacAs J. (2013). RepeatExplorer: a Galaxy-based web server for genome-wide characterization of eukaryotic repetitive elements from next-generation sequence reads. PubMed DOI
Novák P., Robledillo L. Á, Koblížková A., Vrbová I., Neumann P., Macas J. (2017). TAREAN: a computational tool for identification and characterization of satellite DNA from unassembled short reads. PubMed DOI PMC
Oberlander K. C., Dreyer L. L., Goldblatt P., Suda J., Linder H. P. (2016). Species-rich and polyploid-poor: insights into the evolutionary role of whole-genome duplication from the Cape flora biodiversity hotspot. PubMed DOI
Paradis E., Schliep K. (2019). ape 5.0: an environment for modern phylogenetics and evolutionary analyses in R. PubMed DOI
Patro R., Duggal G., Love M. I., Irizarry R. A., Kingsford C. (2017). Salmon provides fast and bias-aware quantification of transcript expression. PubMed DOI PMC
Poplin R., Ruano-Rubio V., DePristo M. A., Fennell T. J., Carneiro M. O., Van der Auwera G. A., et al. (2017). Scaling accurate genetic variant discovery to tens of thousands of samples. DOI
R Core Team (2013).
Rambaut A., Drummond A. J., Xie D., Baele G., Suchard M. A. (2018). Posterior summarization in Bayesian phylogenetics using Tracer 1.7. PubMed DOI PMC
Rannala B., Yang Z. (2007). Inferring speciation times under an episodic molecular clock. PubMed DOI
Renny-Byfield S., Kovarik A., Kelly L. J., Macas J., Novak P., Chase M. W., et al. (2013). Diploidization and genome size change in allopolyploids is associated with differential dynamics of low- and high-copy sequences. PubMed DOI
Ronquist F., Teslenko M., Van Der Mark P., Ayres D. L., Darling A., Höhna S., et al. (2012). MrBayes 3.2: efficient Bayesian phylogenetic inference and model choice across a large model space. PubMed DOI PMC
Simão F. A., Waterhouse R. M., Ioannidis P., Kriventseva E. V, Zdobnov E. M. (2015). BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. PubMed DOI
Sinha S., Siggia E. D. (2005). Sequence turnover and tandem repeats in cis-regulatory modules in PubMed DOI
Smith-Unna R., Boursnell C., Patro R., Hibberd J. M., Kelly S. (2016). TransRate: reference-free quality assessment of de novo transcriptome assemblies. PubMed DOI PMC
Song L., Florea L. (2015). Rcorrector: efficient and accurate error correction for Illumina RNA-seq reads. PubMed PMC
Sonnhammer E. L. L., Durbin R. (1995). A dot-matrix program with dynamic threshold control suited for genomic DNA and protein sequence analysis. PubMed
Talavera G., Castresana J. (2007). Improvement of phylogenies after removing divergent and ambiguously aligned blocks from protein sequence alignments. PubMed DOI
Temsch E. M., Greilhuber J., Krisai R. (2010). Genome size in liverworts.
Thomas G. W. C., Ather S. H., Hahn M. W. (2017). Gene-tree reconciliation with MUL-trees to resolve polyploidy events. PubMed DOI
Towns J., Cockerill T., Dahan M., Foster I., Gaither K., Grimshaw A., et al. (2014). XSEDE: accelerating scientific discovery.
Van Dongen S., Abreu-Goodger C. (2012). Using MCL to extract clusters from networks. PubMed DOI
Vitales D., Garcia S., Dodsworth S. (2020). Reconstructing phylogenetic relationships based on repeat sequence similarities. PubMed DOI
Wang X., Liu C., Huang L., Bengtsson-Palme J., Chen H., Zhang J., et al. (2015). ITS 1: a DNA barcode better than ITS 2 in eukaryotes? PubMed DOI
Wicker T., Gundlach H., Spannagl M., Uauy C., Borrill P., Ramírez-González R. H., et al. (2018). Impact of transposable elements on genome structure and evolution in bread wheat. PubMed PMC
Yang R.-H., Su J.-H., Shang J.-J., Wu Y.-Y., Li Y., Bao D.-P., et al. (2018). Evaluation of the ribosomal DNA internal transcribed spacer (ITS), specifically ITS1 and ITS2, for the analysis of fungal diversity by deep sequencing. PubMed DOI PMC
Yang Y., Smith S. A. (2014). Orthology inference in nonmodel organisms using transcriptomes and low-coverage genomes: improving accuracy and matrix occupancy for phylogenomics. PubMed DOI PMC
Zhang C., Rabiee M., Sayyari E., Mirarab S. (2018). ASTRAL-III: polynomial time species tree reconstruction from partially resolved gene trees. PubMed DOI PMC
Zwaenepoel A., Van de Peer Y. (2019). wgd—simple command line tools for the analysis of ancient whole-genome duplications. PubMed DOI PMC
An updated classification of the Brassicaceae (Cruciferae)