Most cited article - PubMed ID 23259460
Next generation sequencing analysis reveals a relationship between rDNA unit diversity and locus number in Nicotiana diploids
The classical model of concerted evolution states that hundreds to thousands of ribosomal DNA (rDNA) units undergo homogenization, making the multiple copies of the individual units more uniform across the genome than would be expected given mutation frequencies and gene redundancy. While the universality of this over 50-year-old model has been confirmed in a range of organisms, advanced high throughput sequencing techniques have also revealed that rDNA homogenization in many organisms is partial and, in rare cases, even apparently failing. The potential underpinning processes leading to unexpected intragenomic variation have been discussed in a number of studies, but a comprehensive understanding remains to be determined. In this work, we summarize information on variation or polymorphisms in rDNAs across a wide range of taxa amongst animals, fungi, plants, and protists. We discuss the definition and description of concerted evolution and describe whether incomplete concerted evolution of rDNAs predominantly affects coding or non-coding regions of rDNA units and if it leads to the formation of pseudogenes or not. We also discuss the factors contributing to rDNA variation, such as interspecific hybridization, meiotic cycles, rDNA expression status, genome size, and the activity of effector genes involved in genetic recombination, epigenetic modifications, and DNA editing. Finally, we argue that a combination of approaches is needed to target genetic and epigenetic phenomena influencing incomplete concerted evolution, to give a comprehensive understanding of the evolution and functional consequences of intragenomic variation in rDNA.
- MeSH
- Phylogeny MeSH
- Genetic Variation * MeSH
- Fungi genetics MeSH
- Evolution, Molecular MeSH
- Mutation MeSH
- Polymorphism, Genetic * MeSH
- DNA, Ribosomal genetics MeSH
- Animals MeSH
- Check Tag
- Animals MeSH
- Publication type
- Journal Article MeSH
- Research Support, Non-U.S. Gov't MeSH
- Review MeSH
- Names of Substances
- DNA, Ribosomal MeSH
The genus Trifolium L. is characterized by basic chromosome numbers 8, 7, 6, and 5. We conducted a genus-wide study of ribosomal DNA (rDNA) structure variability in diploids and polyploids to gain insight into evolutionary history. We used fluorescent in situ hybridization to newly investigate rDNA variation by number and position in 30 Trifolium species. Evolutionary history among species was examined using 85 available sequences of internal transcribed spacer 1 (ITS1) of 35S rDNA. In diploid species with ancestral basic chromosome number (x = 8), one pair of 5S and 26S rDNA in separate or adjacent positions on a pair of chromosomes was prevalent. Genomes of species with reduced basic chromosome numbers were characterized by increased number of signals determined on one pair of chromosomes or all chromosomes. Increased number of signals was observed also in diploids Trifolium alpestre and Trifolium microcephalum and in polyploids. Sequence alignment revealed ITS1 sequences with mostly single nucleotide polymorphisms, and ITS1 diversity was greater in diploids with reduced basic chromosome numbers compared to diploids with ancestral basic chromosome number (x = 8) and polyploids. Our results suggest the presence of one 5S rDNA site and one 26S rDNA site as an ancestral state.
- Keywords
- 26S rDNA, 5S rDNA, clover, fluorescent in situ hybridization, genome structure, nucleotide polymorphism,
- Publication type
- Journal Article MeSH
INTRODUCTION: Ribosomal DNA (rDNA) loci have been widely used for identification of allopolyploids and hybrids, although few of these studies employed high-throughput sequencing data. Here we use graph clustering implemented in the RepeatExplorer (RE) pipeline to analyze homoeologous 5S rDNA arrays at the genomic level searching for hybridogenic origin of species. Data were obtained from more than 80 plant species, including several well-defined allopolyploids and homoploid hybrids of different evolutionary ages and from widely dispersed taxonomic groups. RESULTS: (i) Diploids show simple circular-shaped graphs of their 5S rDNA clusters. In contrast, most allopolyploids and other interspecific hybrids exhibit more complex graphs composed of two or more interconnected loops representing intergenic spacers (IGS). (ii) There was a relationship between graph complexity and locus numbers. (iii) The sequences and lengths of the 5S rDNA units reconstituted in silico from k-mers were congruent with those experimentally determined. (iv) Three-genomic comparative cluster analysis of reads from allopolyploids and progenitor diploids allowed identification of homoeologous 5S rRNA gene families even in relatively ancient (c. 1 Myr) Gossypium and Brachypodium allopolyploids which already exhibit uniparental partial loss of rDNA repeats. (v) Finally, species harboring introgressed genomes exhibit exceptionally complex graph structures. CONCLUSION: We found that the cluster graph shapes and graph parameters (k-mer coverage scores and connected component index) well-reflect the organization and intragenomic homogeneity of 5S rDNA repeats. We propose that the analysis of 5S rDNA cluster graphs computed by the RE pipeline together with the cytogenetic analysis might be a reliable approach for the determination of the hybrid or allopolyploid plant species parentage and may also be useful for detecting historical introgression events.
- Keywords
- 5S rRNA genes, allopolyploidy, evolution, graph structure clustering, high-throughput sequencing, hybridization, repeatome,
- Publication type
- Journal Article MeSH
INTRODUCTION: Gnetophytes, comprising the genera Ephedra, Gnetum and Welwitschia, are an understudied, enigmatic lineage of gymnosperms with a controversial phylogenetic relationship to other seed plants. Here we examined the organization of ribosomal DNA (rDNA) across representative species. METHODS: We applied high-throughput sequencing approaches to isolate and reconstruct rDNA units and to determine their intragenomic homogeneity. In addition, fluorescent in situ hybridization and Southern blot hybridization techniques were used to reveal the chromosome and genomic organization of rDNA. KEY RESULTS: The 5S and 35S rRNA genes were separate (S-type) in Gnetum montanum, Gnetum gnemon and Welwitschia mirabilis and linked (L-type) in Ephedra altissima. There was considerable variability in 5S rDNA abundance, ranging from as few as ~4000 (W. mirabilis) to >100 000 (G. montanum) copies. A similar large variation was also observed in 5S rDNA locus numbers (two to 16 sites per diploid cell). 5S rRNA pseudogenes were interspersed between functional genes forming a single unit in E. altissima and G. montanum. Their copy number was comparable or even higher than that of functional 5S rRNA genes. In E. altissima internal transcribed spacers of 35S rDNA were long and intrinsically repetitive while in G. montanum and W. mirabilis they were short without the subrepeats. CONCLUSIONS: Gnetophytes are distinct from other gymnosperms and angiosperms as they display surprisingly large variability in rDNA organization and rDNA copy and locus numbers between genera, with no relationship between copy numbers and genome sizes apparent. Concerted evolution of 5S rDNA units seems to have led to the amplification of 5S pseudogenes in both G. montanum and E. altissima. Evolutionary patterns of rDNA show both gymnosperm and angiosperm features underlining the diversity of the group.
- Keywords
- Gnetophytes, chromosome evolution, concerted evolution, high-throughput sequencing, intragenomic diversity, pseudogenes, rDNA organization,
- MeSH
- Cycadopsida * MeSH
- Phylogeny MeSH
- In Situ Hybridization, Fluorescence MeSH
- Evolution, Molecular MeSH
- DNA, Ribosomal MeSH
- DNA Copy Number Variations MeSH
- Publication type
- Journal Article MeSH
- Research Support, Non-U.S. Gov't MeSH
- Names of Substances
- DNA, Ribosomal MeSH
BACKGROUND: Pikes represent an important genus (Esox) harbouring a pre-duplication karyotype (2n = 2x = 50) of economically important salmonid pseudopolyploids. Here, we have characterized the 5S ribosomal RNA genes (rDNA) in Esox lucius and its closely related E. cisalpinus using cytogenetic, molecular and genomic approaches. Intragenomic homogeneity and copy number estimation was carried out using Illumina reads. The higher-order structure of rDNA arrays was investigated by the analysis of long PacBio reads. Position of loci on chromosomes was determined by FISH. DNA methylation was analysed by methylation-sensitive restriction enzymes. RESULTS: The 5S rDNA loci occupy exclusively (peri)centromeric regions on 30-38 acrocentric chromosomes in both E. lucius and E. cisalpinus. The large number of loci is accompanied by extreme amplification of genes (>20,000 copies), which is to the best of our knowledge one of the highest copy number of rRNA genes in animals ever reported. Conserved secondary structures of predicted 5S rRNAs indicate that most of the amplified genes are potentially functional. Only few SNPs were found in genic regions indicating their high homogeneity while intergenic spacers were more heterogeneous and several families were identified. Analysis of 10-30 kb-long molecules sequenced by the PacBio technology (containing about 40% of total 5S rDNA) revealed that the vast majority (96%) of genes are organised in large several kilobase-long blocks. Dispersed genes or short tandems were less common (4%). The adjacent 5S blocks were directly linked, separated by intervening DNA and even inverted. The 5S units differing in the intergenic spacers formed both homogeneous and heterogeneous (mixed) blocks indicating variable degree of homogenisation between the loci. Both E. lucius and E. cisalpinus 5S rDNA was heavily methylated at CG dinucleotides. CONCLUSIONS: Extreme amplification of 5S rRNA genes in the Esox genome occurred in the absence of significant pseudogenisation suggesting its recent origin and/or intensive homogenisation processes. The dense methylation of units indicates that powerful epigenetic mechanisms have evolved in this group of fish to silence amplified genes. We discuss how the higher-order repeat structures impact on homogenisation of 5S rDNA in the genome.
- Keywords
- Chromosome, Esox, Evolution, Fish, Single cell PacBio sequencing, rDNA,
- MeSH
- Esocidae genetics MeSH
- Phylogeny MeSH
- Genetic Loci genetics MeSH
- Genomics * MeSH
- Gene Dosage MeSH
- Heterochromatin metabolism MeSH
- Conserved Sequence MeSH
- DNA Methylation * MeSH
- DNA, Ribosomal genetics MeSH
- Base Sequence MeSH
- Oligonucleotide Array Sequence Analysis MeSH
- Animals MeSH
- Check Tag
- Animals MeSH
- Publication type
- Journal Article MeSH
- Research Support, Non-U.S. Gov't MeSH
- Names of Substances
- Heterochromatin MeSH
- DNA, Ribosomal MeSH
BACKGROUND AND AIMS: Brassica napus (AACC, 2n = 38, oilseed rape) is a relatively recent allotetraploid species derived from the putative progenitor diploid species Brassica rapa (AA, 2n = 20) and Brassica oleracea (CC, 2n = 18). To determine the influence of intensive breeding conditions on the evolution of its genome, we analysed structure and copy number of rDNA in 21 cultivars of B. napus, representative of genetic diversity. METHODS: We used next-generation sequencing genomic approaches, Southern blot hybridization, expression analysis and fluorescence in situ hybridization (FISH). Subgenome-specific sequences derived from rDNA intergenic spacers (IGS) were used as probes for identification of loci composition on chromosomes. KEY RESULTS: Most B. napus cultivars (18/21, 86 %) had more A-genome than C-genome rDNA copies. Three cultivars analysed by FISH ('Darmor', 'Yudal' and 'Asparagus kale') harboured the same number (12 per diploid set) of loci. In B. napus 'Darmor', the A-genome-specific rDNA probe hybridized to all 12 rDNA loci (eight on the A-genome and four on the C-genome) while the C-genome-specific probe showed weak signals on the C-genome loci only. Deep sequencing revealed high homogeneity of arrays suggesting that the C-genome genes were largely overwritten by the A-genome variants in B. napus 'Darmor'. In contrast, B. napus 'Yudal' showed a lack of gene conversion evidenced by additive inheritance of progenitor rDNA variants and highly localized hybridization signals of subgenome-specific probes on chromosomes. Brassica napus 'Asparagus kale' showed an intermediate pattern to 'Darmor' and 'Yudal'. At the expression level, most cultivars (95 %) exhibited stable A-genome nucleolar dominance while one cultivar ('Norin 9') showed co-dominance. CONCLUSIONS: The B. napus cultivars differ in the degree and direction of rDNA homogenization. The prevalent direction of gene conversion (towards the A-genome) correlates with the direction of expression dominance indicating that gene activity may be needed for interlocus gene conversion.
- Keywords
- Brassica napus, allopolyploidy, chromosome evolution, gene conversion, rDNA,
- MeSH
- Brassica napus genetics MeSH
- Genetic Variation genetics MeSH
- Genetic Loci genetics MeSH
- Gene Conversion genetics MeSH
- In Situ Hybridization, Fluorescence MeSH
- DNA, Ribosomal genetics MeSH
- Blotting, Southern MeSH
- Gene Expression Profiling MeSH
- High-Throughput Nucleotide Sequencing MeSH
- Publication type
- Journal Article MeSH
- Research Support, Non-U.S. Gov't MeSH
- Names of Substances
- DNA, Ribosomal MeSH
Genes encoding ribosomal RNA (rDNA) are universal key constituents of eukaryotic genomes, and the nuclear genome harbours hundreds to several thousand copies of each species. Knowledge about the number of rDNA loci and gene copy number provides information for comparative studies of organismal and molecular evolution at various phylogenetic levels. With the exception of seed plants, the range of 45S rDNA locus (encoding 18S, 5.8S and 26S rRNA) and gene copy number variation within key evolutionary plant groups is largely unknown. This is especially true for the three earliest land plant lineages Marchantiophyta (liverworts), Bryophyta (mosses), and Anthocerotophyta (hornworts). In this work, we report the extent of rDNA variation in early land plants, assessing the number of 45S rDNA loci and gene copy number in 106 species and 25 species, respectively, of mosses, liverworts and hornworts. Unexpectedly, the results show a narrow range of ribosomal locus variation (one or two 45S rDNA loci) and gene copies not present in vascular plant lineages, where a wide spectrum is recorded. Mutation analysis of whole genomic reads showed higher (3-fold) intragenomic heterogeneity of Marchantia polymorpha (Marchantiophyta) rDNA compared to Physcomitrella patens (Bryophyta) and two angiosperms (Arabidopsis thaliana and Nicotiana tomentosifomis) suggesting the presence of rDNA pseudogenes in its genome. No association between phylogenetic position, taxonomic adscription and the number of rDNA loci and gene copy number was found. Our results suggest a likely evolutionary rDNA stasis during land colonisation and diversification across 480 myr of bryophyte evolution. We hypothesise that strong selection forces may be acting against ribosomal gene locus amplification. Despite showing a predominant haploid phase and infrequent meiosis, overall rDNA homogeneity is not severely compromised in bryophytes.
- MeSH
- Anthocerotophyta classification genetics MeSH
- Arabidopsis classification genetics MeSH
- Bryophyta classification genetics MeSH
- Chromosomes, Plant genetics MeSH
- Cytogenetic Analysis MeSH
- DNA, Plant genetics MeSH
- Species Specificity MeSH
- Phylogeny MeSH
- Gene Dosage MeSH
- In Situ Hybridization, Fluorescence MeSH
- Conserved Sequence MeSH
- Real-Time Polymerase Chain Reaction MeSH
- Hepatophyta classification genetics MeSH
- Evolution, Molecular MeSH
- DNA, Ribosomal genetics MeSH
- RNA, Ribosomal genetics MeSH
- RNA, Plant genetics MeSH
- Genes, Plant MeSH
- Embryophyta classification genetics MeSH
- Publication type
- Journal Article MeSH
- Names of Substances
- DNA, Plant MeSH
- DNA, Ribosomal MeSH
- RNA, Ribosomal MeSH
- RNA, Plant MeSH
- RNA, ribosomal, 45S MeSH Browser
In all eukaryotes, the highly repeated 35S ribosomal DNA (rDNA) sequences encoding 18S-5.8S-26S ribosomal RNA (rRNA) typically show high levels of intragenomic uniformity due to homogenisation processes, leading to concerted evolution of 35S rDNA repeats. Here, we compared 35S rDNA divergence in several seed plants using next generation sequencing and a range of molecular and cytogenetic approaches. Most species showed similar 35S rDNA homogeneity indicating concerted evolution. However, Cycas revoluta exhibits an extraordinary diversity of rDNA repeats (nucleotide sequence divergence of different copies averaging 12 %), influencing both the coding and non-coding rDNA regions nearly equally. In contrast, its rRNA transcriptome was highly homogeneous suggesting that only a minority of genes (<20 %) encode functional rRNA. The most common SNPs were C > T substitutions located in symmetrical CG and CHG contexts which were also highly methylated. Both functional genes and pseudogenes appear to cluster on chromosomes. The extraordinary high levels of 35S rDNA diversity in C. revoluta, and probably other species of cycads, indicate that the frequency of repeat homogenisation has been much lower in this lineage, compared with all other land plant lineages studied. This has led to the accumulation of methylation-driven mutations and pseudogenisation. Potentially, the reduced homology between paralogs prevented their elimination by homologous recombination, resulting in long-term retention of rDNA pseudogenes in the genome.
- Keywords
- Concerted evolution, Cycadales, Cytosine methylation, Living fossil, rDNA,
- MeSH
- Cycas genetics MeSH
- DNA, Plant genetics MeSH
- Transcription, Genetic genetics MeSH
- In Situ Hybridization, Fluorescence MeSH
- Polymorphism, Single Nucleotide genetics MeSH
- DNA, Ribosomal Spacer genetics MeSH
- DNA, Ribosomal genetics MeSH
- RNA, Ribosomal, 18S genetics MeSH
- RNA, Ribosomal, 5.8S genetics MeSH
- RNA, Ribosomal genetics MeSH
- Base Sequence MeSH
- Sequence Analysis, DNA MeSH
- Publication type
- Journal Article MeSH
- Research Support, Non-U.S. Gov't MeSH
- Names of Substances
- DNA, Plant MeSH
- DNA, Ribosomal Spacer MeSH
- DNA, Ribosomal MeSH
- RNA, Ribosomal, 18S MeSH
- RNA, Ribosomal, 5.8S MeSH
- RNA, Ribosomal MeSH
- RNA, ribosomal, 26S MeSH Browser
Gene and whole-genome duplications are widespread in plant nuclear genomes, resulting in sequence heterogeneity. Identification of duplicated genes may be particularly challenging in highly redundant genomes, especially when there are no diploid parents as a reference. Here, we developed a pipeline to detect the different copies in the ribosomal RNA gene family in the hexaploid grass Spartina maritima from next-generation sequencing (Roche-454) reads. The heterogeneity of the different domains of the highly repeated 45S unit was explored by identifying single nucleotide polymorphisms (SNPs) and assembling reads based on shared polymorphisms. SNPs were validated using comparisons with Illumina sequence data sets and by cloning and Sanger (re)sequencing. Using this approach, 29 validated polymorphisms and 11 validated haplotypes were reported (out of 34 and 20, respectively, that were initially predicted by our program). The rDNA domains of S. maritima have similar lengths as those found in other Poaceae, apart from the 5'-ETS, which is approximately two-times longer in S. maritima. Sequence homogeneity was encountered in coding regions and both internal transcribed spacers (ITS), whereas high intragenomic variability was detected in the intergenic spacer (IGS) and the external transcribed spacer (ETS). Molecular cytogenetic analysis by fluorescent in situ hybridization (FISH) revealed the presence of one pair of 45S rDNA signals on the chromosomes of S. maritima instead of three expected pairs for a hexaploid genome, indicating loss of duplicated homeologous loci through the diploidization process. The procedure developed here may be used at any ploidy level and using different sequencing technologies.
- Keywords
- bioinformatics, duplication, paralogy, poaceae, polyploidy,
- MeSH
- Molecular Sequence Annotation MeSH
- Phylogeny * MeSH
- Genome, Plant MeSH
- Genomics methods MeSH
- Haplotypes * MeSH
- In Situ Hybridization, Fluorescence MeSH
- Polymorphism, Single Nucleotide MeSH
- Poaceae classification genetics MeSH
- Open Reading Frames MeSH
- Polyploidy * MeSH
- Reproducibility of Results MeSH
- DNA, Ribosomal MeSH
- RNA, Ribosomal genetics MeSH
- Genes, Plant MeSH
- Computational Biology methods MeSH
- High-Throughput Nucleotide Sequencing * MeSH
- Publication type
- Journal Article MeSH
- Research Support, Non-U.S. Gov't MeSH
- Names of Substances
- DNA, Ribosomal MeSH
- RNA, Ribosomal MeSH
- RNA, ribosomal, 45S MeSH Browser
To study the relationship between uniparental rDNA (encoding 18S, 5.8S and 26S ribosomal RNA) silencing (nucleolar dominance) and rRNA gene dosage, we studied a recently emerged (within the last 80 years) allotetraploid Tragopogon mirus (2n=24), formed from the diploid progenitors T. dubius (2n=12, D-genome donor) and T. porrifolius (2n=12, P-genome donor). Here, we used molecular, cytogenetic and genomic approaches to analyse rRNA gene activity in two sibling T. mirus plants (33A and 33B) with widely different rRNA gene dosages. Plant 33B had ~400 rRNA genes at the D-genome locus, which is typical for T. mirus, accounting for ~25% of total rDNA. We observed characteristic expression dominance of T. dubius-origin genes in all organs. Its sister plant 33A harboured a homozygous macrodeletion that reduced the number of T. dubius-origin genes to about 70 copies (~4% of total rDNA). It showed biparental rDNA expression in root, flower and callus, but not in leaf where D-genome rDNA dominance was maintained. There was upregulation of minor rDNA variants in some tissues. The RNA polymerase I promoters of reactivated T. porrifolius-origin rRNA genes showed reduced DNA methylation, mainly at symmetrical CG and CHG nucleotide motifs. We hypothesise that active, decondensed rDNA units are most likely to be deleted via recombination. The silenced homeologs could be used as a 'first reserve' to ameliorate mutational damage and contribute to evolutionary success of polyploids. Deletion and reactivation cycles may lead to bidirectional homogenisation of rRNA arrays in the long term.
- MeSH
- Gene Dosage * MeSH
- Genes, rRNA * MeSH
- DNA Methylation MeSH
- Evolution, Molecular MeSH
- Molecular Sequence Data MeSH
- Nucleolus Organizer Region MeSH
- Polyploidy MeSH
- Promoter Regions, Genetic MeSH
- RNA, Ribosomal, 18S genetics MeSH
- RNA, Ribosomal, 5.8S genetics MeSH
- RNA, Ribosomal genetics MeSH
- Genes, Plant * MeSH
- Sequence Analysis, DNA MeSH
- Sequence Deletion MeSH
- Tragopogon genetics MeSH
- Gene Silencing * MeSH
- Publication type
- Journal Article MeSH
- Research Support, Non-U.S. Gov't MeSH
- Research Support, U.S. Gov't, Non-P.H.S. MeSH
- Names of Substances
- RNA, Ribosomal, 18S MeSH
- RNA, Ribosomal, 5.8S MeSH
- RNA, Ribosomal MeSH
- RNA, ribosomal, 26S MeSH Browser