repeatome
Dotaz
Zobrazit nápovědu
Speciation mechanisms remain controversial. Two speciation models occur in Israeli subterranean mole rats, genus Spalax: a regional speciation cline southward of four peripatric climatic chromosomal species and a local, geologic-edaphic, genic, and sympatric speciation. Here we highlight their genome evolution. The five species were separated into five genetic clusters by single nucleotide polymorphisms, copy number variations (CNVs), repeatome, and methylome in sympatry. The regional interspecific divergence correspond to Pleistocene climatic cycles. Climate warmings caused chromosomal speciation. Triple effective population size, Ne , declines match glacial cold cycles. Adaptive genes evolved under positive selection to underground stresses and to divergent climates, involving interspecies reproductive isolation. Genomic islands evolved mainly due to adaptive evolution involving ancient polymorphisms. Repeatome, including both CNV and LINE1 repetitive elements, separated the five species. Methylation in sympatry identified geologically chalk-basalt species that differentially affect thermoregulation, hypoxia, DNA repair, P53, and other pathways. Genome adaptive evolution highlights climatic and geologic-edaphic stress evolution and the two speciation models, peripatric and sympatric.
- MeSH
- biologická adaptace MeSH
- biologická evoluce * MeSH
- epigeneze genetická MeSH
- genetická variace MeSH
- genom MeSH
- jednonukleotidový polymorfismus MeSH
- molekulární evoluce MeSH
- populační genetika MeSH
- reprodukční izolace MeSH
- Spalax genetika fyziologie MeSH
- sympatrie * MeSH
- tok genů MeSH
- variabilita počtu kopií segmentů DNA MeSH
- vazebná nerovnováha MeSH
- zvířata MeSH
- Check Tag
- mužské pohlaví MeSH
- zvířata MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Geografické názvy
- Izrael MeSH
BACKGROUND: The number and complexity of repetitive elements varies between species, being in general most represented in those with larger genomes. Combining the flow-sorted chromosome arms approach to genome analysis with second generation DNA sequencing technologies provides a unique opportunity to study the repetitive portion of each chromosome, enabling comparisons among them. Additionally, different sequencing approaches may produce different depth of insight to repeatome content and structure. In this work we analyze and characterize the repetitive sequences of Triticum aestivum cv. Chinese Spring homeologous group 4 chromosome arms, obtained through Roche 454 and Illumina sequencing technologies, hereinafter marked by subscripts 454 and I, respectively. Repetitive sequences were identified with the RepeatMasker software using the interspersed repeat database mips-REdat_v9.0p. The input sequences consisted of our 4DS454 and 4DL454 scaffolds and 4ASI, 4ALI, 4BSI, 4BLI, 4DSI and 4DLI contigs, downloaded from the International Wheat Genome Sequencing Consortium (IWGSC). RESULTS: Repetitive sequences content varied from 55% to 63% for all chromosome arm assemblies except for 4DLI, in which the repeat content was 38%. Transposable elements, small RNA, satellites, simple repeats and low complexity sequences were analyzed. SSR frequency was found one per 24 to 27 kb for all chromosome assemblies except 4DLI, where it was three times higher. Dinucleotides and trinucleotides were the most abundant SSR repeat units. (GA)n/(TC)n was the most abundant SSR except for 4DLI where the most frequently identified SSR was (CCG/CGG)n. Retrotransposons followed by DNA transposons were the most highly represented sequence repeats, mainly composed of CACTA/En-Spm and Gypsy superfamilies, respectively. This whole chromosome sequence analysis allowed identification of three new LTR retrotransposon families belonging to the Copia superfamily, one belonging to the Gypsy superfamily and two TRIM retrotransposon families. Their physical distribution in wheat genome was analyzed by fluorescent in situ hybridization (FISH) and one of them, the Carmen retrotransposon, was found specific for centromeric regions of all wheat chromosomes. CONCLUSION: The presented work is the first deep report of wheat repetitive sequences analyzed at the chromosome arm level, revealing the first insight into the repeatome of T. aestivum chromosomes of homeologous group 4.
Satellite DNA (satDNA) is the most variable fraction of the eukaryotic genome. Related species share a common ancestral satDNA library and changing of any library component in a particular lineage results in interspecific differences. Although the general developmental trend is clear, our knowledge of the origin and dynamics of satDNAs is still fragmentary. Here, we explore whole genome shotgun Illumina reads using the RepeatExplorer (RE) pipeline to infer satDNA family life stories in the genomes of Chenopodium species. The seven diploids studied represent separate lineages and provide an example of a species complex typical for angiosperms. Application of the RE pipeline allowed by similarity searches a determination of the satDNA family with a basic monomer of ~40 bp and to trace its transformation from the reconstructed ancestral to the species-specific sequences. As a result, three types of satDNA family evolutionary development were distinguished: (i) concerted evolution with mutation and recombination events; (ii) concerted evolution with a trend toward increased complexity and length of the satellite monomer; and (iii) non-concerted evolution, with low levels of homogenization and multidirectional trends. The third type is an example of entire repeatome transformation, thus producing a novel set of satDNA families, and genomes showing non-concerted evolution are proposed as a significant source for genomic diversity.
- MeSH
- Chenopodium genetika MeSH
- diploidie MeSH
- DNA rostlinná genetika MeSH
- druhová specificita MeSH
- fylogeneze MeSH
- genom rostlinný MeSH
- komponenty genomu MeSH
- molekulární evoluce MeSH
- satelitní DNA genetika MeSH
- sekvenční analýza DNA MeSH
- vysoce účinné nukleotidové sekvenování MeSH
- Publikační typ
- časopisecké články MeSH
BACKGROUND: Cultivated grasses are an important source of food for domestic animals worldwide. Increased knowledge of their genomes can speed up the development of new cultivars with better quality and greater resistance to biotic and abiotic stresses. The most widely grown grasses are tetraploid ryegrass species (Lolium) and diploid and hexaploid fescue species (Festuca). In this work, we characterized repetitive DNA sequences and their contribution to genome size in five fescue and two ryegrass species as well as one fescue and two ryegrass cultivars. RESULTS: Partial genome sequences produced by Illumina sequencing technology were used for genome-wide comparative analyses with the RepeatExplorer pipeline. Retrotransposons were the most abundant repeat type in all seven grass species. The Athila element of the Ty3/gypsy family showed the most striking differences in copy number between fescues and ryegrasses. The sequence data enabled the assembly of the long terminal repeat (LTR) element Fesreba, which is highly enriched in centromeric and (peri)centromeric regions in all species. A combination of fluorescence in situ hybridization (FISH) with a probe specific to the Fesreba element and immunostaining with centromeric histone H3 (CENH3) antibody showed their co-localization and indicated a possible role of Fesreba in centromere function. CONCLUSIONS: Comparative repeatome analyses in a set of fescues and ryegrasses provided new insights into their genome organization and divergence, including the assembly of the LTR element Fesreba. A new LTR element Fesreba was identified and found in abundance in centromeric regions of the fescues and ryegrasses. It may play a role in the function of their centromeres.
BACKGROUND AND AIMS: Most crucifer species (Brassicaceae) have small nuclear genomes (mean 1C-value 617 Mb). The species with the largest genomes occur within the monophyletic Hesperis clade (Mandáková et al., Plant Physiology174: 2062-2071; also known as Clade E or Lineage III). Whereas most chromosome numbers in the clade are 6 or 7, monoploid genome sizes vary 16-fold (256-4264 Mb). To get an insight into genome size evolution in the Hesperis clade (~350 species in ~48 genera), we aimed to identify, quantify and localize in situ the repeats from which these genomes are built. We analysed nuclear repeatomes in seven species, covering the phylogenetic and genome size breadth of the clade, by low-pass whole-genome sequencing. METHODS: Genome size was estimated by flow cytometry. Genomic DNA was sequenced on an Illumina sequencer and DNA repeats were identified and quantified using RepeatExplorer; the most abundant repeats were localized on chromosomes by fluorescence in situ hybridization. To evaluate the feasibility of bacterial artificial chromosome (BAC)-based comparative chromosome painting in Hesperis-clade species, BACs of arabidopsis were used as painting probes. KEY RESULTS: Most biennial and perennial species of the Hesperis clade possess unusually large nuclear genomes due to the proliferation of long terminal repeat retrotransposons. The prevalent genome expansion was rarely, but repeatedly, counteracted by purging of transposable elements in ephemeral and annual species. CONCLUSIONS: The most common ancestor of the Hesperis clade has experienced genome upsizing due to transposable element amplification. Further genome size increases, dominating diversification of all Hesperis-clade tribes, contrast with the overall stability of chromosome numbers. In some subclades and species genome downsizing occurred, presumably as an adaptive transition to an annual life cycle. The amplification versus purging of transposable elements and tandem repeats impacted the chromosomal architecture of the Hesperis-clade species.