Evolution of Tandem Repeats Is Mirroring Post-polyploid Cladogenesis in Heliophila (Brassicaceae)

. 2020 ; 11 () : 607893. [epub] 20210112

Status PubMed-not-MEDLINE Jazyk angličtina Země Švýcarsko Médium electronic-ecollection

Typ dokumentu časopisecké články

Perzistentní odkaz   https://www.medvik.cz/link/pmid33510751

The unigeneric tribe Heliophileae encompassing more than 100 Heliophila species is morphologically the most diverse Brassicaceae lineage. The tribe is endemic to southern Africa, confined chiefly to the southwestern South Africa, home of two biodiversity hotspots (Cape Floristic Region and Succulent Karoo). The monospecific Chamira (C. circaeoides), the only crucifer species with persistent cotyledons, is traditionally retrieved as the closest relative of Heliophileae. Our transcriptome analysis revealed a whole-genome duplication (WGD) ∼26.15-29.20 million years ago, presumably preceding the Chamira/Heliophila split. The WGD was then followed by genome-wide diploidization, species radiations, and cladogenesis in Heliophila. The expanded phylogeny based on nuclear ribosomal DNA internal transcribed spacer (ITS) uncovered four major infrageneric clades (A-D) in Heliophila and corroborated the sister relationship between Chamira and Heliophila. Herein, we analyzed how the diploidization process impacted the evolution of repetitive sequences through low-coverage whole-genome sequencing of 15 Heliophila species, representing the four clades, and Chamira. Despite the firmly established infrageneric cladogenesis and different ecological life histories (four perennials vs. 11 annual species), repeatome analysis showed overall comparable evolution of genome sizes (288-484 Mb) and repeat content (25.04-38.90%) across Heliophila species and clades. Among Heliophila species, long terminal repeat (LTR) retrotransposons were the predominant components of the analyzed genomes (11.51-22.42%), whereas tandem repeats had lower abundances (1.03-12.10%). In Chamira, the tandem repeat content (17.92%, 16 diverse tandem repeats) equals the abundance of LTR retrotransposons (16.69%). Among the 108 tandem repeats identified in Heliophila, only 16 repeats were found to be shared among two or more species; no tandem repeats were shared by Chamira and Heliophila genomes. Six "relic" tandem repeats were shared between any two different Heliophila clades by a common descent. Four and six clade-specific repeats shared among clade A and C species, respectively, support the monophyly of these two clades. Three repeats shared by all clade A species corroborate the recent diversification of this clade revealed by plastome-based molecular dating. Phylogenetic analysis based on repeat sequence similarities separated the Heliophila species to three clades [A, C, and (B+D)], mirroring the post-polyploid cladogenesis in Heliophila inferred from rDNA ITS and plastome sequences.

Erratum v

PubMed

Zobrazit více v PubMed

Al-Shehbaz I. A. (2012). A generic and tribal synopsis of the Brassicaceae (Cruciferae). DOI

Altschul S. F., Gish W., Miller W., Myers E. W., Lipman D. J. (1990). Basic local alignment search tool. PubMed

Andrews S. (2010). FastQC: A Quality Control Tool for High Throughput Sequence Data. Available online at: http://www.bioinformatics.babraham.ac.uk/projects/fastqc/

Benson G. (1998). “An algorithm for finding tandem repeats of unspecified pattern size,” in DOI

Bolger A. M., Lohse M., Usadel B. (2014). Trimmomatic: a flexible trimmer for Illumina sequence data. PubMed DOI PMC

Bolsheva N. L., Melnikova N. V., Kirov I. V., Dmitriev A. A., Krasnov G. S., Amosova ÀV., et al. (2019). Characterization of repeated DNA sequences in genomes of blue-flowered flax. PubMed DOI PMC

Brown J. W., Walker J. F., Smith S. A. (2017). Phyx: phylogenetic tools for unix. PubMed DOI PMC

Cechova M., Harris R. S., Tomaszkiewicz M., Arbeithuber B., Chiaromonte F., Makova K. D. (2019). High satellite repeat turnover in great apes studied with short-and long-read technologies. PubMed DOI PMC

Davidson N. M., Oshlack A. (2014). Corset: enabling differential gene expression analysis for de novo assembled transcriptomes. PubMed PMC

Dierckxsens N., Mardulyn P., Smits G. (2016). NOVOPlasty: de novo assembly of organelle genomes from whole genome data. PubMed PMC

Dodsworth S., Chase M. W., Kelly L. J., Leitch I. J., Macas J., Novák P., et al. (2014). Genomic repeat abundances contain phylogenetic signal. PubMed DOI PMC

Dodsworth S., Chase M. W., Särkinen T., Knapp S., Leitch A. R. (2016). Using genomic repeats for phylogenomics: a case study in wild tomatoes ( DOI

Dodsworth S., Jang T.-S., Struebig M., Chase M. W., Weiss-Schneeweiss H., Leitch A. R. (2017). Genome-wide repeat dynamics reflect phylogenetic distance in closely related allotetraploid PubMed DOI PMC

Doležel J., Greilhuber J., Suda J. (2007). Estimation of nuclear DNA content in plants using flow cytometry. PubMed DOI

Doronina L., Churakov G., Kuritzin A., Shi J., Baertsch R., Clawson H., et al. (2017). Speciation network in Laurasiatheria: retrophylogenomic signals. PubMed DOI PMC

Emms D. M., Kelly S. (2015). OrthoFinder: solving fundamental biases in whole genome comparisons dramatically improves orthogroup inference accuracy. PubMed PMC

Franzke A., Koch M. A., Mummenhoff K. (2016). Turnip time travels: age estimates in Brassicaceae. PubMed DOI

Fu L., Niu B., Zhu Z., Wu S., Li W. (2012). CD-HIT: accelerated for clustering the next-generation sequencing data. PubMed DOI PMC

García-Robledo C., Erickson D. L., Staines C. L., Erwin T. L., Kress W. J. (2013). Tropical plant–herbivore networks: reconstructing species interactions using DNA barcodes. PubMed DOI PMC

Garrido-Ramos M. A. (2015). Satellite DNA in plants: more than just rubbish. PubMed DOI

Garrido-Ramos M. A. (2017). Satellite DNA: an evolving topic. PubMed DOI PMC

Guo X., Liu J., Hao G., Zhang L., Mao K., Wang X., et al. (2017). Plastome phylogeny and early diversification of Brassicaceae. PubMed DOI PMC

Haas B., Papanicolaou A. (2016).

Haas B. J., Papanicolaou A., Yassour M., Grabherr M., Blood P. D., Bowden J., et al. (2013). PubMed DOI PMC

Harkess A., Mercati F., Abbate L., McKain M., Pires J. C., Sala T., et al. (2016). Retrotransposon proliferation coincident with the evolution of dioecy in PubMed DOI PMC

Henikoff S., Ahmad K., Malik H. S. (2001). The centromere paradox: stable inheritance with rapidly evolving DNA. PubMed DOI

Hohmann N., Wolf E. M., Lysak M. A., Koch M. A. (2015). A time-calibrated road map of Brassicaceae species radiation and evolutionary history. PubMed PMC

Huang D. I., Cronk Q. C. B. (2015). Plann: a command-line application for annotating plastome sequences. PubMed DOI PMC

Huson D. H., Bryant D. (2006). Application of phylogenetic networks in evolutionary studies. PubMed DOI

Jurka J., Bao W., Kojima K. K. (2011). Families of transposable elements, population structure and the origin of species. PubMed DOI PMC

Kagale S., Robinson S. J., Nixon J., Xiao R., Huebert T., Condie J., et al. (2014). Polyploid evolution of the Brassicaceae during the Cenozoic era. PubMed DOI PMC

Kalyaanamoorthy S., Minh B. Q., Wong T. K. F., von Haeseler A., Jermiin L. S. (2017). ModelFinder: fast model selection for accurate phylogenetic estimates. PubMed DOI PMC

Katoh K., Standley D. M. (2013). MAFFT multiple sequence alignment software version 7: improvements in performance and usability. PubMed DOI PMC

Kiefer C., Willing E.-M., Jiao W.-B., Sun H., Piednoël M., Hümann U., et al. (2019). Interspecies association mapping links reduced CG to TG substitution rates to the loss of gene-body methylation. PubMed DOI

Kohany O., Gentles A. J., Hankus L., Jurka J. (2006). Annotation, submission and screening of repetitive elements in Repbase: RepbaseSubmitter and Censor. PubMed DOI PMC

Kolde R., Kolde M. R. (2015).

Koukalova B., Moraes A. P., Renny-Byfield S., Matyasek R., Leitch A. R., Kovarik A. (2010). Fall and rise of satellite repeats in allopolyploids of PubMed DOI

Kumwenda M. W. (2003).

Lanfear R., Frandsen P. B., Wright A. M., Senfeld T., Calcott B. (2016). PartitionFinder 2: new methods for selecting partitioned models of evolution for molecular and morphological phylogenetic analyses. PubMed

Langmead B., Salzberg S. L. (2012). Fast gapped-read alignment with Bowtie 2. PubMed DOI PMC

Lysak M. A., Koch M. A. (2011). “Phylogeny, genome, and karyotype evolution of crucifers (Brassicaceae),” in DOI

Macas J., Kejnovský E., Neumann P., Novák P., Koblížková A., Vyskot B. (2011). Next generation sequencing-based analysis of repetitive DNA in the model dioceous plant PubMed DOI PMC

Mandáková T., Mummenhoff K., Al-Shehbaz I. A., Mucina L., Mühlhausen A., Lysak M. A. (2012). Whole-genome triplication and species radiation in the southern African tribe Heliophileae (Brassicaceae). DOI

Mandáková T., Winter P., Al-Shehbaz I. A., Mucina L., Mummenhoff K., Lysak M. A., et al. (2015). “Brassicaceae. IAPT/IOPB chromosome data 19,” in

Mandáková T., Lysak M. A. (2016a). Chromosome preparation for cytogenetic analyses in Arabidopsis. PubMed DOI

Mandáková T., Lysak M. A. (2016b). Painting of Arabidopsis chromosomes with chromosome-specific BAC clones. PubMed DOI

Mandáková T., Li Z., Barker M. S., Lysak M. A. (2017). Diverse genome organization following 13 independent mesopolyploid events in Brassicaceae contrasts with convergent patterns of gene retention. PubMed DOI

Marais W. (1970). “Cruciferae,” in

McCann J., Macas J., Novák P., Stuessy T. F., Villaseñor J. L., Weiss-Schneeweiss H. (2020). Differential genome size and repetitive DNA evolution in diploid species of PubMed DOI PMC

Melters D. P., Bradnam K. R., Young H. A., Telis N., May M. R., Ruby J. G., et al. (2013). Comparative analysis of tandem repeats from hundreds of species reveals unique insights into centromere evolution. PubMed DOI PMC

Meraldi P., McAinsh A. D., Rheinbay E., Sorger P. K. (2006). Phylogenetic and structural analysis of centromeric DNA and kinetochore proteins. PubMed DOI PMC

Miller M. A., Pfeiffer W., Schwartz T. (2010). “Creating the CIPRES science gateway for inference of large phylogenetic trees,” in

Minamoto T., Uchii K., Takahara T., Kitayoshi T., Tsuji S., Yamanaka H., et al. (2017). Nuclear internal transcribed spacer−1 as a sensitive genetic marker for environmental DNA studies in common carp PubMed DOI

Moisy C., Schulman A. H., Kalendar R., Buchmann J. P., Pelsy F. (2014). The Tvv1 retrotransposon family is conserved between plant genomes separated by over 100 million years. PubMed DOI

Mummenhoff K., Al-Shehbaz I. A., Bakker F. T., Linder H. P., Mühlhausen A. (2005). Phylogeny, morphological evolution, and speciation of endemic Brassicaceae genera in the Cape flora of southern Africa.

Mummenhoff K., Linder P., Friesen N., Bowman J. L., Lee J., Franzke A. (2004). Molecular evidence for bicontinental hybridogenous genomic constitution in PubMed DOI

Nguyen L.-T., Schmidt H. A., von Haeseler A., Minh B. Q. (2014). IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. PubMed DOI PMC

Novák P., Neumann P., Macas J. (2010). Graph-based clustering and characterization of repetitive sequences in next-generation sequencing data. PubMed DOI PMC

Novák P., Neumann P., Pech J., Steinhaisl J., MacAs J. (2013). RepeatExplorer: a Galaxy-based web server for genome-wide characterization of eukaryotic repetitive elements from next-generation sequence reads. PubMed DOI

Novák P., Robledillo L. Á, Koblížková A., Vrbová I., Neumann P., Macas J. (2017). TAREAN: a computational tool for identification and characterization of satellite DNA from unassembled short reads. PubMed DOI PMC

Oberlander K. C., Dreyer L. L., Goldblatt P., Suda J., Linder H. P. (2016). Species-rich and polyploid-poor: insights into the evolutionary role of whole-genome duplication from the Cape flora biodiversity hotspot. PubMed DOI

Paradis E., Schliep K. (2019). ape 5.0: an environment for modern phylogenetics and evolutionary analyses in R. PubMed DOI

Patro R., Duggal G., Love M. I., Irizarry R. A., Kingsford C. (2017). Salmon provides fast and bias-aware quantification of transcript expression. PubMed DOI PMC

Poplin R., Ruano-Rubio V., DePristo M. A., Fennell T. J., Carneiro M. O., Van der Auwera G. A., et al. (2017). Scaling accurate genetic variant discovery to tens of thousands of samples. DOI

R Core Team (2013).

Rambaut A., Drummond A. J., Xie D., Baele G., Suchard M. A. (2018). Posterior summarization in Bayesian phylogenetics using Tracer 1.7. PubMed DOI PMC

Rannala B., Yang Z. (2007). Inferring speciation times under an episodic molecular clock. PubMed DOI

Renny-Byfield S., Kovarik A., Kelly L. J., Macas J., Novak P., Chase M. W., et al. (2013). Diploidization and genome size change in allopolyploids is associated with differential dynamics of low- and high-copy sequences. PubMed DOI

Ronquist F., Teslenko M., Van Der Mark P., Ayres D. L., Darling A., Höhna S., et al. (2012). MrBayes 3.2: efficient Bayesian phylogenetic inference and model choice across a large model space. PubMed DOI PMC

Simão F. A., Waterhouse R. M., Ioannidis P., Kriventseva E. V, Zdobnov E. M. (2015). BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. PubMed DOI

Sinha S., Siggia E. D. (2005). Sequence turnover and tandem repeats in cis-regulatory modules in PubMed DOI

Smith-Unna R., Boursnell C., Patro R., Hibberd J. M., Kelly S. (2016). TransRate: reference-free quality assessment of de novo transcriptome assemblies. PubMed DOI PMC

Song L., Florea L. (2015). Rcorrector: efficient and accurate error correction for Illumina RNA-seq reads. PubMed PMC

Sonnhammer E. L. L., Durbin R. (1995). A dot-matrix program with dynamic threshold control suited for genomic DNA and protein sequence analysis. PubMed

Talavera G., Castresana J. (2007). Improvement of phylogenies after removing divergent and ambiguously aligned blocks from protein sequence alignments. PubMed DOI

Temsch E. M., Greilhuber J., Krisai R. (2010). Genome size in liverworts.

Thomas G. W. C., Ather S. H., Hahn M. W. (2017). Gene-tree reconciliation with MUL-trees to resolve polyploidy events. PubMed DOI

Towns J., Cockerill T., Dahan M., Foster I., Gaither K., Grimshaw A., et al. (2014). XSEDE: accelerating scientific discovery.

Van Dongen S., Abreu-Goodger C. (2012). Using MCL to extract clusters from networks. PubMed DOI

Vitales D., Garcia S., Dodsworth S. (2020). Reconstructing phylogenetic relationships based on repeat sequence similarities. PubMed DOI

Wang X., Liu C., Huang L., Bengtsson-Palme J., Chen H., Zhang J., et al. (2015). ITS 1: a DNA barcode better than ITS 2 in eukaryotes? PubMed DOI

Wicker T., Gundlach H., Spannagl M., Uauy C., Borrill P., Ramírez-González R. H., et al. (2018). Impact of transposable elements on genome structure and evolution in bread wheat. PubMed PMC

Yang R.-H., Su J.-H., Shang J.-J., Wu Y.-Y., Li Y., Bao D.-P., et al. (2018). Evaluation of the ribosomal DNA internal transcribed spacer (ITS), specifically ITS1 and ITS2, for the analysis of fungal diversity by deep sequencing. PubMed DOI PMC

Yang Y., Smith S. A. (2014). Orthology inference in nonmodel organisms using transcriptomes and low-coverage genomes: improving accuracy and matrix occupancy for phylogenomics. PubMed DOI PMC

Zhang C., Rabiee M., Sayyari E., Mirarab S. (2018). ASTRAL-III: polynomial time species tree reconstruction from partially resolved gene trees. PubMed DOI PMC

Zwaenepoel A., Van de Peer Y. (2019). wgd—simple command line tools for the analysis of ancient whole-genome duplications. PubMed DOI PMC

Najít záznam

Citační ukazatele

Pouze přihlášení uživatelé

Možnosti archivace

Nahrávání dat ...