centromeric tandem repeat
Dotaz
Zobrazit nápovědu
Amplification of monomer sequences into long contiguous arrays is the main feature distinguishing satellite DNA from other tandem repeats, yet it is also the main obstacle in its investigation because these arrays are in principle difficult to assemble. Here we explore an alternative, assembly-free approach that utilizes ultra-long Oxford Nanopore reads to infer the length distribution of satellite repeat arrays, their association with other repeats and the prevailing sequence periodicities. Using the satellite DNA-rich legume plant Lathyrus sativus as a model, we demonstrated this approach by analyzing 11 major satellite repeats using a set of nanopore reads ranging from 30 to over 200 kb in length and representing 0.73× genome coverage. We found surprising differences between the analyzed repeats because only two of them were predominantly organized in long arrays typical for satellite DNA. The remaining nine satellites were found to be derived from short tandem arrays located within LTR-retrotransposons that occasionally expanded in length. While the corresponding LTR-retrotransposons were dispersed across the genome, this array expansion occurred mainly in the primary constrictions of the L. sativus chromosomes, which suggests that these genome regions are favourable for satellite DNA accumulation.
- Klíčová slova
- Lathyrus sativus, centromeres, fluorescence in situ hybridization (FISH), heterochromatin, long-range organization, nanopore sequencing, satellite DNA, sequence evolution, technical advance,
- MeSH
- centromera MeSH
- chromozomy rostlin MeSH
- DNA rostlinná genetika MeSH
- frekvence genu * MeSH
- genom rostlinný MeSH
- heterochromatin MeSH
- Lathyrus genetika MeSH
- molekulární evoluce MeSH
- nanopóry * MeSH
- retroelementy * MeSH
- satelitní DNA * MeSH
- tandemové repetitivní sekvence * MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Názvy látek
- DNA rostlinná MeSH
- heterochromatin MeSH
- retroelementy * MeSH
- satelitní DNA * MeSH
Holocentric chromosomes lack a primary constriction, in contrast to monocentrics. They form kinetochores distributed along almost the entire poleward surface of the chromatids, to which spindle fibers attach. No centromere-specific DNA sequence has been found for any holocentric organism studied so far. It was proposed that centromeric repeats, typical for many monocentric species, could not occur in holocentrics, most likely because of differences in the centromere organization. Here we show that the holokinetic centromeres of the Cyperaceae Rhynchospora pubera are highly enriched by a centromeric histone H3 variant-interacting centromere-specific satellite family designated "Tyba" and by centromeric retrotransposons (i.e., CRRh) occurring as genome-wide interspersed arrays. Centromeric arrays vary in length from 3 to 16 kb and are intermingled with gene-coding sequences and transposable elements. We show that holocentromeres of metaphase chromosomes are composed of multiple centromeric units rather than possessing a diffuse organization, thus favoring the polycentric model. A cell-cycle-dependent shuffling of multiple centromeric units results in the formation of functional (poly)centromeres during mitosis. The genome-wide distribution of centromeric repeat arrays interspersing the euchromatin provides a previously unidentified type of centromeric chromatin organization among eukaryotes. Thus, different types of holocentromeres exist in different species, namely with and without centromeric repetitive sequences.
- Klíčová slova
- centromere, chromosome, evolution, holokinetic, satellite DNA,
- MeSH
- centromera * MeSH
- euchromatin genetika MeSH
- genom rostlinný * MeSH
- molekulární sekvence - údaje MeSH
- šáchorovité genetika MeSH
- satelitní DNA genetika MeSH
- tandemové repetitivní sekvence * MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Názvy látek
- euchromatin MeSH
- satelitní DNA MeSH
Sex chromosomes in mammals are about 300 million years old and typically have a highly degenerated Y chromosome. The sex chromosomes in the dioecious plant Silene latifolia in contrast, represent an early stage of evolution in which functional X-Y gene pairs are still frequent. In this study, we characterize a novel tandem repeat called TRAYC, which has accumulated on the Y chromosome in S. latifolia. Its presence demonstrates that processes of satellite accumulation are at work even in this early stage of sex chromosome evolution. The presence of TRAYC in other species of the Elisanthe section suggests that this repeat had spread after the sex chromosomes evolved but before speciation within this section. TRAYC possesses a palindromic character and a strong potential to form secondary structures, which could play a role in satellite evolution. TRAYC accumulation is most prominent near the centromere of the Y chromosome. We propose a role for the centromere as a starting point for the cessation of recombination between the X and Y chromosomes.
- MeSH
- chromozom Y genetika MeSH
- DNA primery genetika MeSH
- DNA rostlinná chemie genetika MeSH
- druhová specificita MeSH
- hybridizace in situ fluorescenční MeSH
- konformace nukleové kyseliny MeSH
- molekulární evoluce * MeSH
- molekulární sekvence - údaje MeSH
- pohlavní chromozomy genetika MeSH
- sekvence nukleotidů MeSH
- sekvenční homologie nukleových kyselin MeSH
- Silene klasifikace genetika MeSH
- tandemové repetitivní sekvence MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Názvy látek
- DNA primery MeSH
- DNA rostlinná MeSH
Linear chromosomes of eukaryotic organisms invariably possess centromeres and telomeres to ensure proper chromosome segregation during nuclear divisions and to protect the chromosome ends from deterioration and fusion, respectively. While centromeric sequences may differ between species, with arrays of tandemly repeated sequences and retrotransposons being the most abundant sequence types in plant centromeres, telomeric sequences are usually highly conserved among plants and other organisms. The genome size of the carnivorous genus Genlisea (Lentibulariaceae) is highly variable. Here we study evolutionary sequence plasticity of these chromosomal domains at an intrageneric level. We show that Genlisea nigrocaulis (1C = 86 Mbp; 2n = 40) and G. hispidula (1C = 1550 Mbp; 2n = 40) differ as to their DNA composition at centromeres and telomeres. G. nigrocaulis and its close relative G. pygmaea revealed mainly 161 bp tandem repeats, while G. hispidula and its close relative G. subglabra displayed a combination of four retroelements at centromeric positions. G. nigrocaulis and G. pygmaea chromosome ends are characterized by the Arabidopsis-type telomeric repeats (TTTAGGG); G. hispidula and G. subglabra instead revealed two intermingled sequence variants (TTCAGG and TTTCAGG). These differences in centromeric and, surprisingly, also in telomeric DNA sequences, uncovered between groups with on average a > 9-fold genome size difference, emphasize the fast genome evolution within this genus. Such intrageneric evolutionary alteration of telomeric repeats with cytosine in the guanine-rich strand, not yet known for plants, might impact the epigenetic telomere chromatin modification.
- Klíčová slova
- G. hispidula, Genlisea nigrocaulis, Lentibulariaceae, centromeric retrotransposons, centromeric tandem repeat, genome evolution, plant telomeric repeat variants, telomerase,
- MeSH
- biologická evoluce * MeSH
- časové faktory MeSH
- centromera genetika MeSH
- chromozomy rostlin genetika MeSH
- druhová specificita MeSH
- genetická variace MeSH
- genom rostlinný genetika fyziologie MeSH
- Magnoliopsida genetika fyziologie MeSH
- molekulární sekvence - údaje MeSH
- sekvence nukleotidů MeSH
- telomery genetika MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
Centromere position may change despite conserved chromosomal collinearity. Centromere repositioning and evolutionary new centromeres (ENCs) were frequently encountered during vertebrate genome evolution but only rarely observed in plants. The largest crucifer tribe, Arabideae (∼550 species; Brassicaceae, the mustard family), diversified into several well-defined subclades in the virtual absence of chromosome number variation. Bacterial artificial chromosome-based comparative chromosome painting uncovered a constancy of genome structures among 10 analyzed genomes representing seven Arabideae subclades classified as four genera: Arabis, Aubrieta, Draba, and Pseudoturritis Interestingly, the intra-tribal diversification was marked by a high frequency of ENCs on five of the eight homoeologous chromosomes in the crown-group genera, but not in the most ancestral Pseudoturritis genome. From the 32 documented ENCs, at least 26 originated independently, including 4 ENCs recurrently formed at the same position in not closely related species. While chromosomal localization of ENCs does not reflect the phylogenetic position of the Arabideae subclades, centromere seeding was usually confined to long chromosome arms, transforming acrocentric chromosomes to (sub)metacentric chromosomes. Centromere repositioning is proposed as the key mechanism differentiating overall conserved homoeologous chromosomes across the crown-group Arabideae subclades. The evolutionary significance of centromere repositioning is discussed in the context of possible adaptive effects on recombination and epigenetic regulation of gene expression.
BACKGROUND: Tandemly repeated satellite DNA sequences are an important part of animal genomes. They are involved in chromosome interactions and the maintenance of the integral structure of the nucleus, regulation of chromatin conformation and gene expression, and chromosome condensation and movement during cell division. Satellite DNAs located in the centromeric heterochromatin evolve rapidly and likely affect hybrid fertility and fitness. However, their studies are taxonomically highly biased. In lacertid lizards, satDNA has been extensively studied in the subfamily Lacertinae, but the subfamily Eremiadinae has been largely overlooked. RESULTS: In this work, we describe a novel 177-bp-long centromeric satDNA family EremSat177, which is present in all studied species of the genus Eremias, but not in related genera. EremSat177 is not homologous to any previously identified centromeric satellites. Using fluorescence in situ hybridization, we demonstrate its centromeric localization in E. velox and E. arguta. We also show its tandem organization and intra-genomic homogenization by in silico analysis in the genome of E. argus. The phylogenetic analysis of consensus EremSat177 sequences from 12 Eremias species demonstrates that the same monomer subfamily is the most abundant in all these species, and its evolution mainly follows the species phylogeny as revealed by the mtDNA sequences. CONCLUSION: The EremSat177 represents a novel, lineage-specific centromeric satellite DNA, and its role in centromere functioning should be revealed in further research.
- Klíčová slova
- Chromosomes, Genomics, Lizards, Phylogeny, Repetitive DNA,
- MeSH
- centromera * genetika MeSH
- druhová specificita MeSH
- fylogeneze MeSH
- heterochromatin genetika MeSH
- hybridizace in situ fluorescenční MeSH
- ještěři * genetika klasifikace MeSH
- mitochondriální DNA genetika MeSH
- molekulární evoluce * MeSH
- satelitní DNA * genetika MeSH
- zvířata MeSH
- Check Tag
- zvířata MeSH
- Publikační typ
- časopisecké články MeSH
- Názvy látek
- heterochromatin MeSH
- mitochondriální DNA MeSH
- satelitní DNA * MeSH
FISH is a useful method to identify individual chromosomes in a karyotype and to discover their structural changes accompanying genome evolution and speciation. DNA probes for FISH should be chromosome specific and/or exhibit specific patterns of distribution along each chromosome. Such probes are not available in many plants including meadow fescue (Festuca pratensis Huds.), an important forage grass species. In the present study, various DNA repeats identified in Illumina shotgun sequences specific to chromosome 4F of F. pratensis were used as probes for FISH to develop the molecular karyotype of meadow fescue and to reveal a long-range molecular organization of its chromosomes. Five tandem repeats produced specific patterns on individual chromosomes. Their use in combination with probes for rRNA genes enabled the establishment of the molecular karyotype of meadow fescue. Most of the mobile genetic elements were dispersed along all the chromosomes except for the DNA transposon CACTA, which was localized preferentially to telomeric and subtelomeric regions, and a putative LTR element, which was localized to (peri)centromeric regions. Cytogenetic mapping of the 5 tandem repeats in other accessions of meadow fescue showed a highly similar distribution and confirmed the versatility and robustness of these probes.
- Klíčová slova
- Fluorescence in situ hybridization, Karyotyping, Meadow fescue, Repetitive DNA, Tandem organized repeats,
- MeSH
- chromozomy rostlin MeSH
- DNA rostlinná MeSH
- Festuca genetika MeSH
- fylogeneze MeSH
- hybridizace in situ fluorescenční MeSH
- karyotyp MeSH
- karyotypizace metody MeSH
- tandemové repetitivní sekvence * MeSH
- Publikační typ
- časopisecké články MeSH
- Názvy látek
- DNA rostlinná MeSH
The centromere has a conserved function across eukaryotes; however, the associated DNA sequences exhibit remarkable diversity in both size and structure. In plants, some species possess well-defined centromeres dominated by tandem satellite repeats and centromeric retrotransposons, while others have centromeric regions composed almost entirely of retrotransposons. Using a combination of bioinformatic, molecular, and cytogenetic approaches, we analyzed the centromeric landscape of Humulus lupulus. We identified novel centromeric repeats and characterized two types of centromeric organization. Cytogenetic localization on metaphase chromosomes confirmed the genomic distribution of the major repeats and revealed unique centromeric organization specifically on chromosomes 2, 8, and Y. Two centromeric types are composed of the major repeats SaazCEN and SaazCRM1 (Ty3/Gypsy) which are further accompanied by chromosome-specific centromeric satellites, Saaz40, Saaz293, Saaz85, and HuluTR120. Chromosome 2 displays unbalanced segregation during mitosis and meiosis, implicating an important role for its centromere structure in segregation patterns. Moreover, chromosome 2-specific centromeric repeat Saaz293 is a new marker for studying aneuploidy in hops. Our findings provide new insights into chromosome segregation in hops and highlight the diversity and complexity of the centromere organization in H. lupulus.
- Klíčová slova
- Cannabaceae, asymmetric cell division, centromere, retrotransposons, sex chromosomes,
- MeSH
- centromera * genetika MeSH
- chromozomy rostlin genetika MeSH
- Humulus * genetika MeSH
- meióza genetika MeSH
- repetitivní sekvence nukleových kyselin * genetika MeSH
- retroelementy * genetika MeSH
- segregace chromozomů genetika MeSH
- Publikační typ
- časopisecké články MeSH
- Názvy látek
- retroelementy * MeSH
The importance of DNA structure in the regulation of basic cellular processes is an emerging field of research. Among local non-B DNA structures, inverted repeat (IR) sequences that form cruciforms and G-rich sequences that form G-quadruplexes (G4) are found in all prokaryotic and eukaryotic organisms and are targets for regulatory proteins. We analyzed IRs and G4 sequences in the genome of the most important biotechnology microorganism, S. cerevisiae. IR and G4-prone sequences are enriched in specific genomic locations and differ markedly between mitochondrial and nuclear DNA. While G4s are overrepresented in telomeres and regions surrounding tRNAs, IRs are most enriched in centromeres, rDNA, replication origins and surrounding tRNAs. Mitochondrial DNA is enriched in both IR and G4-prone sequences relative to the nuclear genome. This extensive analysis of local DNA structures adds to the emerging picture of their importance in genome maintenance, DNA replication and transcription of subsets of genes.
- Klíčová slova
- G-quadruplex, Inverted repeat, Saccharomyces cerevisiae,
- MeSH
- centromera genetika MeSH
- DNA fungální chemie genetika MeSH
- G-kvadruplexy * MeSH
- genom fungální MeSH
- obrácené repetice * MeSH
- RNA ribozomální genetika MeSH
- Saccharomyces cerevisiae MeSH
- telomery genetika MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Názvy látek
- DNA fungální MeSH
- RNA ribozomální MeSH
We carried out a global survey of all major types of transposable elements in Silene latifolia, a model species with sex chromosomes that are in the early stages of their evolution. A shotgun genomic library was screened with genomic DNA to isolate and characterize the most abundant elements. We found that the most common types of elements were the subtelomeric tandem repeat X-43.1 and Gypsy retrotransposons, followed by Copia retrotransposons and LINE non-LTR elements. SINE elements and DNA transposons were less abundant. We also amplified transposable elements with degenerate primers and used them to screen the library. The localization of elements by FISH revealed that most of the Copia elements were accumulated on the Y chromosome. Surprisingly, one type of Gypsy element, which was similar to Ogre elements known from legumes, was almost absent on the Y chromosome but otherwise uniformly distributed on all chromosomes. Other types of elements were ubiquitous on all chromosomes. Moreover, we isolated and characterized two new tandem repeats. One of them, STAR-C, was localized at the centromeres of all chromosomes except the Y chromosome, where it was present on the p-arm. Its variant, STAR-Y, carrying a small deletion, was specifically localized on the q-arm of the Y chromosome. The second tandem repeat, TR1, co-localized with the 45S rDNA cluster in the subtelomeres of five pairs of autosomes. FISH analysis of other Silene species revealed that some elements (e.g., Ogre-like elements) are confined to the section Elisanthe while others (e.g. Copia or Athila-like elements) are present also in more distant species. Similarly, the centromeric satellite STAR-C was conserved in the genus Silene whereas the subtelomeric satellite X-43.1 was specific for Elisanthe section. Altogether, our data provide an overview of the repetitive sequences in Silene latifolia and revealed that genomic distribution and evolutionary dynamics differ among various repetitive elements. The unique pattern of repeat distribution is found on the Y chromosome, where some elements are accumulated while other elements are conspicuously absent, which probably reflects different forces shaping the Y chromosome.
- MeSH
- chromozomy rostlin genetika MeSH
- DNA rostlinná genetika MeSH
- druhová specificita MeSH
- hybridizace in situ fluorescenční MeSH
- repetitivní sekvence nukleových kyselin genetika MeSH
- Silene klasifikace genetika MeSH
- tandemové repetitivní sekvence genetika MeSH
- transpozibilní elementy DNA genetika MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Názvy látek
- DNA rostlinná MeSH
- transpozibilní elementy DNA MeSH