Nejvíce citovaný článek - PubMed ID 19665255
Contrasting evolutionary dynamics between angiosperm and mammalian genomes
Understanding the evolutionary conservation of complex eukaryotic transcriptomes significantly illuminates the physiological relevance of alternative splicing (AS). Examining the evolutionary depth of a given AS event with ordinary homology searches is generally challenging and time-consuming. Here, we present Catsnap, an algorithmic pipeline for assessing the conservation of putative protein isoforms generated by AS. It employs a machine learning approach following a database search with the provided pair of protein sequences. We used the Catsnap algorithm for analyzing the conservation of emerging experimentally characterized alternative proteins from plants and animals. Indeed, most of them are conserved among other species. Catsnap can detect the conserved functional protein isoforms regardless of the AS type by which they are generated. Notably, we found that while the primary amino acid sequence is maintained, the type of AS determining the inclusion or exclusion of protein regions varies throughout plant phylogenetic lineages in these proteins. We also document that this phenomenon is less seen among animals. In sum, our algorithm highlights the presence of unexpectedly frequent hotspots where protein isoforms recurrently arise to carry physiologically relevant functions. The user web interface is available at https://catsnap.cesnet.cz/.
- Klíčová slova
- alternative splicing, bioinformatics, determinism, isoforms, machine learning, molecular evolution, transcriptome,
- MeSH
- algoritmy * MeSH
- alternativní sestřih * genetika MeSH
- fylogeneze MeSH
- konzervovaná sekvence genetika MeSH
- molekulární evoluce MeSH
- mutantní proteiny MeSH
- protein - isoformy genetika MeSH
- rostliny MeSH
- sekvence aminokyselin MeSH
- zvířata MeSH
- Check Tag
- zvířata MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Názvy látek
- mutantní proteiny MeSH
- protein - isoformy MeSH
LTR retrotransposons constitute a significant part of plant genomes and their evolutionary dynamics play an important role in genome size changes. Current methods of LTR retrotransposon age estimation are based only on LTR (long terminal repeat) divergence. This has prompted us to analyze sequence similarity of LTRs in 25,144 LTR retrotransposons from fifteen plant species as well as formation of solo LTRs. We found that approximately one fourth of nested retrotransposons showed a higher LTR divergence than the pre-existing retrotransposons into which they had been inserted. Moreover, LTR similarity was correlated with LTR length. We propose that gene conversion can contribute to this phenomenon. Gene conversion prediction in LTRs showed potential converted regions in 25% of LTR pairs. Gene conversion was higher in species with smaller genomes while the proportion of solo LTRs did not change with genome size in analyzed species. The negative correlation between the extent of gene conversion and the abundance of solo LTRs suggests interference between gene conversion and ectopic recombination. Since such phenomena limit the traditional methods of LTR retrotransposon age estimation, we recommend an improved approach based on the exclusion of regions affected by gene conversion.
- Klíčová slova
- LTR retrotransposons, age estimation, ectopic recombination, gene conversion, nesting, plants, transposable elements,
- Publikační typ
- časopisecké články MeSH
BACKGROUND: Nesting is common in LTR retrotransposons, especially in large genomes containing a high number of elements. RESULTS: We analyzed 12 plant genomes and obtained 1491 pairs of nested and original (pre-existing) LTR retrotransposons. We systematically analyzed mutual nesting of individual LTR retrotransposons and found that certain families, more often belonging to the Ty3/gypsy than Ty1/copia superfamilies, showed a higher nesting frequency as well as a higher preference for older copies of the same family ("autoinsertions"). Nested LTR retrotransposons were preferentially located in the 3'UTR of other LTR retrotransposons, while coding and regulatory regions (LTRs) are not commonly targeted. Insertions displayed a weak preference for palindromes and were associated with a strong positional pattern of higher predicted nucleosome occupancy. Deviation from randomness in target site choice was also found in 13,983 non-nested plant LTR retrotransposons. CONCLUSIONS: We reveal that nesting of LTR retrotransposons is not random. Integration is correlated with sequence composition, secondary structure and the chromatin environment. Insertion into retrotransposon positions with a low negative impact on family fitness supports the concept of the genome being viewed as an ecosystem of various elements.
- Klíčová slova
- Chromatin, LTR retrotransposons, Nesting, Nucleosomes, Plants, Transposable elements,
- Publikační typ
- časopisecké články MeSH
BACKGROUND AND AIMS: Most crucifer species (Brassicaceae) have small nuclear genomes (mean 1C-value 617 Mb). The species with the largest genomes occur within the monophyletic Hesperis clade (Mandáková et al., Plant Physiology174: 2062-2071; also known as Clade E or Lineage III). Whereas most chromosome numbers in the clade are 6 or 7, monoploid genome sizes vary 16-fold (256-4264 Mb). To get an insight into genome size evolution in the Hesperis clade (~350 species in ~48 genera), we aimed to identify, quantify and localize in situ the repeats from which these genomes are built. We analysed nuclear repeatomes in seven species, covering the phylogenetic and genome size breadth of the clade, by low-pass whole-genome sequencing. METHODS: Genome size was estimated by flow cytometry. Genomic DNA was sequenced on an Illumina sequencer and DNA repeats were identified and quantified using RepeatExplorer; the most abundant repeats were localized on chromosomes by fluorescence in situ hybridization. To evaluate the feasibility of bacterial artificial chromosome (BAC)-based comparative chromosome painting in Hesperis-clade species, BACs of arabidopsis were used as painting probes. KEY RESULTS: Most biennial and perennial species of the Hesperis clade possess unusually large nuclear genomes due to the proliferation of long terminal repeat retrotransposons. The prevalent genome expansion was rarely, but repeatedly, counteracted by purging of transposable elements in ephemeral and annual species. CONCLUSIONS: The most common ancestor of the Hesperis clade has experienced genome upsizing due to transposable element amplification. Further genome size increases, dominating diversification of all Hesperis-clade tribes, contrast with the overall stability of chromosome numbers. In some subclades and species genome downsizing occurred, presumably as an adaptive transition to an annual life cycle. The amplification versus purging of transposable elements and tandem repeats impacted the chromosomal architecture of the Hesperis-clade species.
- Klíčová slova
- Bunias, Hesperis, Matthiola, Brassicaceae, Genome size evolution, Lineage III, chromosome organization, interstitial telomeric repeats (ITRs), repetitive DNA, retrotransposons, tandem repeats,
- MeSH
- Brassicaceae * MeSH
- délka genomu MeSH
- fylogeneze MeSH
- genom rostlinný * MeSH
- hybridizace in situ fluorescenční MeSH
- molekulární evoluce MeSH
- proliferace buněk MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
In contrast to animals, separate sexes and sex chromosomes in plants are very rare. Although the evolution of sex chromosomes has been the subject of numerous studies, the impact of repetitive sequences on sex chromosome architecture is not fully understood. New genomic approaches shed light on the role of satellites and transposable elements in the process of Y chromosome evolution. We discuss the impact of repetitive sequences on the structure and dynamics of sex chromosomes with specific focus on Rumex acetosa and Silene latifolia. Recent papers showed that both the expansion and shrinkage of the Y chromosome is influenced by sex-specific regulation of repetitive DNA spread. We present a view that the dynamics of Y chromosome formation is an interplay of genetic and epigenetic processes.
- Klíčová slova
- Y chromosome, satellites, sex chromosomes, transposable elements,
- Publikační typ
- časopisecké články MeSH
- přehledy MeSH
This work discusses several selected topics of plant genetics and breeding in relation to the 150th anniversary of the seminal work of Gregor Johann Mendel. In 2015, we celebrated the 150th anniversary of the presentation of the seminal work of Gregor Johann Mendel. While Darwin's theory of evolution was based on differential survival and differential reproductive success, Mendel's theory of heredity relies on equality and stability throughout all stages of the life cycle. Darwin's concepts were continuous variation and "soft" heredity; Mendel espoused discontinuous variation and "hard" heredity. Thus, the combination of Mendelian genetics with Darwin's theory of natural selection was the process that resulted in the modern synthesis of evolutionary biology. Although biology, genetics, and genomics have been revolutionized in recent years, modern genetics will forever rely on simple principles founded on pea breeding using seven single gene characters. Purposeful use of mutants to study gene function is one of the essential tools of modern genetics. Today, over 100 plant species genomes have been sequenced. Mapping populations and their use in segregation of molecular markers and marker-trait association to map and isolate genes, were developed on the basis of Mendel's work. Genome-wide or genomic selection is a recent approach for the development of improved breeding lines. The analysis of complex traits has been enhanced by high-throughput phenotyping and developments in statistical and modeling methods for the analysis of phenotypic data. Introgression of novel alleles from landraces and wild relatives widens genetic diversity and improves traits; transgenic methodologies allow for the introduction of novel genes from diverse sources, and gene editing approaches offer possibilities to manipulate gene in a precise manner.
- MeSH
- dějiny 19. století MeSH
- dějiny 20. století MeSH
- dějiny 21. století MeSH
- fenotyp MeSH
- genetická variace MeSH
- geneticky modifikované rostliny genetika MeSH
- genetika dějiny MeSH
- genom rostlinný MeSH
- genomika MeSH
- hrách setý genetika MeSH
- lokus kvantitativního znaku MeSH
- mapování chromozomů MeSH
- selekce (genetika) MeSH
- šlechtění rostlin * MeSH
- Check Tag
- dějiny 19. století MeSH
- dějiny 20. století MeSH
- dějiny 21. století MeSH
- Publikační typ
- biografie MeSH
- časopisecké články MeSH
- historické články MeSH
- přehledy MeSH
- O autorovi
- Mendel, Gregor
A large proportion of genomic information, particularly repetitive elements, is usually ignored when researchers are using next-generation sequencing. Here we demonstrate the usefulness of this repetitive fraction in phylogenetic analyses, utilizing comparative graph-based clustering of next-generation sequence reads, which results in abundance estimates of different classes of genomic repeats. Phylogenetic trees are then inferred based on the genome-wide abundance of different repeat types treated as continuously varying characters; such repeats are scattered across chromosomes and in angiosperms can constitute a majority of nuclear genomic DNA. In six diverse examples, five angiosperms and one insect, this method provides generally well-supported relationships at interspecific and intergeneric levels that agree with results from more standard phylogenetic analyses of commonly used markers. We propose that this methodology may prove especially useful in groups where there is little genetic differentiation in standard phylogenetic markers. At the same time as providing data for phylogenetic inference, this method additionally yields a wealth of data for comparative studies of genome evolution.
- Klíčová slova
- Repetitive DNA, continuous characters, genomics, molecular systematics, next-generation sequencing, phylogenetics,
- MeSH
- DNA rostlinná genetika MeSH
- Drosophila klasifikace genetika MeSH
- fylogeneze * MeSH
- genom genetika MeSH
- hmyzí geny genetika MeSH
- Magnoliopsida genetika MeSH
- repetitivní sekvence nukleových kyselin genetika MeSH
- shluková analýza MeSH
- zvířata MeSH
- Check Tag
- zvířata MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Názvy látek
- DNA rostlinná MeSH
Retrotransposons with long terminal repeats (LTR) form a significant proportion of eukaryotic genomes, especially in plants. They have gag and pol genes and several regulatory regions necessary for transcription and reverse transcription. We searched for potential quadruplex-forming sequences (PQSs) and potential triplex-forming sequences (PTSs) in 18 377 full-length LTR retrotransposons collected from 21 plant species. We found that PQSs were often located in LTRs, both upstream and downstream of promoters from which the whole retrotransposon is transcribed. Upstream-located guanine PQSs were dominant in the minus DNA strand, whereas downstream-located guanine PQSs prevailed in the plus strand, indicating their role both at transcriptional and post-transcriptional levels. Our circular dichroism spectroscopy measurements confirmed that these PQSs readily adopted guanine quadruplex structures-some of them were paralell-stranded, while others were anti-parallel-stranded. The PQS often formed doublets at a mutual distance of up to 400 bp. PTSs were most abundant in 3'UTR (but were also present in 5'UTR). We discuss the potential role of quadruplexes and triplexes as the regulators of various processes participating in LTR retrotransposon life cycle and as potential recombination sites during post-insertional retrotransposon-based genome rearrangements.
We analysed the size, relative age and chromosomal localization of nuclear sequences of plastid and mitochondrial origin (NUPTs-nuclear plastid DNA and NUMTs-nuclear mitochondrial DNA) in six completely sequenced plant species. We found that the largest insertions showed lower divergence from organelle DNA than shorter insertions in all species, indicating their recent origin. The largest NUPT and NUMT insertions were localized in the vicinity of the centromeres in the small genomes of Arabidopsis and rice. They were also present in other chromosomal regions in the large genomes of soybean and maize. Localization of NUPTs and NUMTs correlated positively with distribution of transposable elements (TEs) in Arabidopsis and sorghum, negatively in grapevine and soybean, and did not correlate in rice or maize. We propose a model where new plastid and mitochondrial DNA sequences are inserted close to centromeres and are later fragmented by TE insertions and reshuffled away from the centromere or removed by ectopic recombination. The mode and tempo of TE dynamism determines the turnover of NUPTs and NUMTs resulting in their species-specific chromosomal distributions.
- MeSH
- buněčné jádro MeSH
- chromozomy rostlin genetika MeSH
- druhová specificita MeSH
- genom rostlinný MeSH
- mitochondriální DNA genetika MeSH
- mitochondrie genetika MeSH
- mutace INDEL genetika MeSH
- plastidy genetika MeSH
- rostliny genetika MeSH
- sekvenční analýza DNA MeSH
- transpozibilní elementy DNA genetika MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Názvy látek
- mitochondriální DNA MeSH
- transpozibilní elementy DNA MeSH
Rumex acetosa is a dioecious plant with the XY1Y2 sex chromosome system. Both Y chromosomes are heterochromatic and are thought to be degenerated. We performed low-pass 454 sequencing and similarity-based clustering of male and female genomic 454 reads to identify and characterize major groups of R. acetosa repetitive DNA. We found that Copia and Gypsy retrotransposons dominated, followed by DNA transposons and nonlong terminal repeat retrotransposons. CRM and Tat/Ogre retrotransposons dominated the Gypsy superfamily, whereas Maximus/Sireviruses were most abundant among Copia retrotransposons. Only one Gypsy subfamily had accumulated on Y1 and Y2 chromosomes, whereas many retrotransposons were ubiquitous on autosomes and the X chromosome, but absent on Y1 and Y2 chromosomes, and others were depleted from the X chromosome. One group of CRM Gypsy was specifically localized to centromeres. We also found that majority of previously described satellites (RAYSI, RAYSII, RAYSIII, and RAE180) are accumulated on the Y chromosomes where we identified Y chromosome-specific variant of RAE180. We discovered two novel satellites-RA160 satellite dominating on the X chromosome and RA690 localized mostly on the Y1 chromosome. The expression pattern obtained from Illumina RNA sequencing showed that the expression of transposable elements is similar in leaves of both sexes and that satellites are also expressed. Contrasting patterns of transposable elements (TEs) and satellite localization on sex chromosomes in R. acetosa, where not only accumulation but also depletion of repetitive DNA was observed, suggest that a plethora of evolutionary processes can shape sex chromosomes.
- MeSH
- chromozomy rostlin genetika MeSH
- fylogeneze MeSH
- molekulární evoluce MeSH
- molekulární sekvence - údaje MeSH
- pohlavní chromozomy genetika MeSH
- retroelementy * MeSH
- Rumex klasifikace genetika MeSH
- satelitní DNA * MeSH
- sekvence nukleotidů MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Názvy látek
- retroelementy * MeSH
- satelitní DNA * MeSH
