repetitive DNAs
Dotaz
Zobrazit nápovědu
Repetitive DNAs comprise large portion of eukaryote genomes. In genome projects, the assembly of repetitive DNAs is challenging due to the similarity between repeats, which generate ambiguities for alignment. Fluorescence in situ hybridization (FISH) is a powerful technique for the physical mapping of various sequences on chromosomes. This technique is thus very helpful in chromosome-based genome assemblies, providing information on the fine architecture of genomes and their evolution. However, various protocols are currently used for FISH mapping, most of which are relatively laborious and expensive, or work properly only with a specific type of probes or sequences, and there is a need for a universal and affordable FISH protocol. Here we tested a FISH protocol for mapping of different DNA repeats, such as multigene families (rDNAs, U snDNAs, histone genes), satellite DNAs, microsatellites, transposable elements, DOP-PCR products, and telomeric motif (TTAGG)n, on the chromosomes of various insects and other arthropods. Different cell types and stages obtained from diverse tissues were used. The FISH procedure proved high quality and reliable results in all experiments performed. We obtained data on the chromosomal distribution of DNA repeats in representatives of insects and other arthropods. Thus, our results allow us to conclude that the protocol is universal and requires only time adjustment for chromosome/DNA denaturation. The use of this FISH protocol will facilitate studies focused on understanding the evolution and role of repetitive DNA in arthropod genomes.
- MeSH
- členovci genetika MeSH
- DNA genetika MeSH
- fluorescence MeSH
- hmyz genetika MeSH
- hybridizace in situ fluorescenční metody MeSH
- mapování chromozomů metody MeSH
- molekulární evoluce MeSH
- multigenová rodina genetika MeSH
- repetitivní sekvence nukleových kyselin genetika MeSH
- telomery genetika MeSH
- zvířata MeSH
- Check Tag
- zvířata MeSH
- Publikační typ
- časopisecké články MeSH
Structurally and functionally diverged sex chromosomes have evolved in many animals as well as in some plants. Sex chromosomes represent a specific genomic region(s) with locally suppressed recombination. As a consequence, repetitive sequences involving transposable elements, tandem repeats (satellites and microsatellites), and organellar DNA accumulate on the Y (W) chromosomes. In this paper, we review the main types of repetitive elements, their gathering on the Y chromosome, and discuss new findings showing that not only accumulation of various repeats in non-recombining regions but also opposite processes form Y chromosome. The aim of this review is also to discuss the mechanisms of repetitive DNA spread involving (retro) transposition, DNA polymerase slippage or unequal crossing-over, as well as modes of repeat removal by ectopic recombination. The intensity of these processes differs in non-recombining region(s) of sex chromosomes when compared to the recombining parts of genome. We also speculate about the relationship between heterochromatinization and the formation of heteromorphic sex chromosomes.
- MeSH
- chromozomy rostlin * MeSH
- DNA rostlinná * MeSH
- molekulární evoluce * MeSH
- pohlavní chromozomy genetika MeSH
- regulace genové exprese u rostlin MeSH
- repetitivní sekvence nukleových kyselin * MeSH
- rostliny genetika MeSH
- transpozibilní elementy DNA MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
BACKGROUND: The number and complexity of repetitive elements varies between species, being in general most represented in those with larger genomes. Combining the flow-sorted chromosome arms approach to genome analysis with second generation DNA sequencing technologies provides a unique opportunity to study the repetitive portion of each chromosome, enabling comparisons among them. Additionally, different sequencing approaches may produce different depth of insight to repeatome content and structure. In this work we analyze and characterize the repetitive sequences of Triticum aestivum cv. Chinese Spring homeologous group 4 chromosome arms, obtained through Roche 454 and Illumina sequencing technologies, hereinafter marked by subscripts 454 and I, respectively. Repetitive sequences were identified with the RepeatMasker software using the interspersed repeat database mips-REdat_v9.0p. The input sequences consisted of our 4DS454 and 4DL454 scaffolds and 4ASI, 4ALI, 4BSI, 4BLI, 4DSI and 4DLI contigs, downloaded from the International Wheat Genome Sequencing Consortium (IWGSC). RESULTS: Repetitive sequences content varied from 55% to 63% for all chromosome arm assemblies except for 4DLI, in which the repeat content was 38%. Transposable elements, small RNA, satellites, simple repeats and low complexity sequences were analyzed. SSR frequency was found one per 24 to 27 kb for all chromosome assemblies except 4DLI, where it was three times higher. Dinucleotides and trinucleotides were the most abundant SSR repeat units. (GA)n/(TC)n was the most abundant SSR except for 4DLI where the most frequently identified SSR was (CCG/CGG)n. Retrotransposons followed by DNA transposons were the most highly represented sequence repeats, mainly composed of CACTA/En-Spm and Gypsy superfamilies, respectively. This whole chromosome sequence analysis allowed identification of three new LTR retrotransposon families belonging to the Copia superfamily, one belonging to the Gypsy superfamily and two TRIM retrotransposon families. Their physical distribution in wheat genome was analyzed by fluorescent in situ hybridization (FISH) and one of them, the Carmen retrotransposon, was found specific for centromeric regions of all wheat chromosomes. CONCLUSION: The presented work is the first deep report of wheat repetitive sequences analyzed at the chromosome arm level, revealing the first insight into the repeatome of T. aestivum chromosomes of homeologous group 4.
RepeatExplorer2 is a novel version of a computational pipeline that uses graph-based clustering of next-generation sequencing reads for characterization of repetitive DNA in eukaryotes. The clustering algorithm facilitates repeat identification in any genome by using relatively small quantities of short sequence reads, and additional tools within the pipeline perform automatic annotation and quantification of the identified repeats. The pipeline is integrated into the Galaxy platform, which provides a user-friendly web interface for script execution and documentation of the results. Compared to the original version of the pipeline, RepeatExplorer2 provides automated annotation of transposable elements, identification of tandem repeats and enhanced visualization of analysis results. Here, we present an overview of the RepeatExplorer2 workflow and provide procedures for its application to (i) de novo repeat identification in a single species, (ii) comparative repeat analysis in a set of species, (iii) development of satellite DNA probes for cytogenetic experiments and (iv) identification of centromeric repeats based on ChIP-seq data. Each procedure takes approximately 2 d to complete. RepeatExplorer2 is available at https://repeatexplorer-elixir.cerit-sc.cz .
- MeSH
- DNA sondy chemie genetika MeSH
- DNA chemie genetika MeSH
- genomika metody MeSH
- lidé MeSH
- repetitivní sekvence nukleových kyselin MeSH
- sekvenční analýza DNA metody MeSH
- shluková analýza MeSH
- software MeSH
- transpozibilní elementy DNA MeSH
- vysoce účinné nukleotidové sekvenování metody MeSH
- zvířata MeSH
- Check Tag
- lidé MeSH
- zvířata MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
BACKGROUND: Silene latifolia is a dioecious [corrected] plant with well distinguished X and Y chromosomes that is used as a model to study sex determination and sex chromosome evolution in plants. However, efficient utilization of this species has been hampered by the lack of large-scale sequencing resources and detailed analysis of its genome composition, especially with respect to repetitive DNA, which makes up the majority of the genome. METHODOLOGY/PRINCIPAL FINDINGS: We performed low-pass 454 sequencing followed by similarity-based clustering of 454 reads in order to identify and characterize sequences of all major groups of S. latifolia repeats. Illumina sequencing data from male and female genomes were also generated and employed to quantify the genomic proportions of individual repeat families. The majority of identified repeats belonged to LTR-retrotransposons, constituting about 50% of genomic DNA, with Ty3/gypsy elements being more frequent than Ty1/copia. While there were differences between the male and female genome in the abundance of several repeat families, their overall repeat composition was highly similar. Specific localization patterns on sex chromosomes were found for several satellite repeats using in situ hybridization with probes based on k-mer frequency analysis of Illumina sequencing data. CONCLUSIONS/SIGNIFICANCE: This study provides comprehensive information about the sequence composition and abundance of repeats representing over 60% of the S. latifolia genome. The results revealed generally low divergence in repeat composition between the sex chromosomes, which is consistent with their relatively recent origin. In addition, the study generated various data resources that are available for future exploration of the S. latifolia genome.
BACKGROUND: The investigation of plant genome structure and evolution requires comprehensive characterization of repetitive sequences that make up the majority of higher plant nuclear DNA. Since genome-wide characterization of repetitive elements is complicated by their high abundance and diversity, novel approaches based on massively-parallel sequencing are being adapted to facilitate the analysis. It has recently been demonstrated that the low-pass genome sequencing provided by a single 454 sequencing reaction is sufficient to capture information about all major repeat families, thus providing the opportunity for efficient repeat investigation in a wide range of species. However, the development of appropriate data mining tools is required in order to fully utilize this sequencing data for repeat characterization. RESULTS: We adapted a graph-based approach for similarity-based partitioning of whole genome 454 sequence reads in order to build clusters made of the reads derived from individual repeat families. The information about cluster sizes was utilized for assessing the proportion and composition of repeats in the genomes of two model species, Pisum sativum and Glycine max, differing in genome size and 454 sequencing coverage. Moreover, statistical analysis and visual inspection of the topology of the cluster graphs using a newly developed program tool, SeqGrapheR, were shown to be helpful in distinguishing basic types of repeats and investigating sequence variability within repeat families. CONCLUSIONS: Repetitive regions of plant genomes can be efficiently characterized by the presented graph-based analysis and the graph representation of repeats can be further used to assess the variability and evolutionary divergence of repeat families, discover and characterize novel elements, and aid in subsequent assembly of their consensus sequences.
Moths of the family Crambidae include a number of pests that cause economic losses to agricultural crops. Despite their economic importance, little is known about their genome architecture and chromosome evolution. Here, we characterized the chromosomes and repetitive DNA of the sugarcane borer Diatraea saccharalis using a combination of low-pass genome sequencing, bioinformatics, and cytogenetic methods, focusing on the sex chromosomes. Diploid chromosome numbers differed between the sexes, i.e., 2n = 33 in females and 2n = 34 in males. This difference was caused by the occurrence of a WZ1Z2 trivalent in female meiosis, indicating a multiple sex-chromosome system WZ1Z2/Z1Z1Z2Z2. A strong interstitial telomeric signal was observed on the W chromosome, indicating a fusion of the ancestral W chromosome with an autosome. Among repetitive DNAs, transposable elements (TEs) accounted for 39.18% (males) to 41.35% (females), while satDNAs accounted for only 0.214% (males) and 0.215% (females) of the genome. FISH mapping revealed different chromosomal organization of satDNAs, such as single localized clusters, spread repeats, and non-clustered repeats. Two TEs mapped by FISH were scattered. Although we found a slight enrichment of some satDNAs in the female genome, they were not differentially enriched on the W chromosome. However, we found enriched FISH signals for TEs on the W chromosome, suggesting their involvement in W chromosome degeneration and differentiation. These data shed light on karyotype and repetitive DNA dynamics due to multiple chromosome fusions in D. saccharalis, contribute to the understanding of genome structure in Lepidoptera and are important for future genomic studies.
- MeSH
- karyotyp MeSH
- molekulární evoluce MeSH
- můry * genetika MeSH
- pohlavní chromozomy genetika MeSH
- Saccharum * genetika MeSH
- transpozibilní elementy DNA MeSH
- zvířata MeSH
- Check Tag
- mužské pohlaví MeSH
- ženské pohlaví MeSH
- zvířata MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH