Nejvíce citovaný článek - PubMed ID 37260773
BACKGROUND: In trypanosomatids, a group of unicellular eukaryotes that includes numerous important human parasites, cis-splicing has been previously reported for only two genes: a poly(A) polymerase and an RNA helicase. Conversely, trans-splicing, which involves the attachment of a spliced leader sequence, is observed for nearly every protein-coding transcript. So far, our understanding of splicing in this protistan group has stemmed from the analysis of only a few medically relevant species. In this study, we used an extensive dataset encompassing all described trypanosomatid genera to investigate the distribution of intron-containing genes and the evolution of splice sites. RESULTS: We identified a new conserved intron-containing gene encoding an RNA-binding protein that is universally present in Kinetoplastea. We show that Perkinsela sp., a kinetoplastid endosymbiont of Amoebozoa, represents the first eukaryote completely devoid of cis-splicing, yet still preserving trans-splicing. We also provided evidence for reverse transcriptase-mediated intron loss in Kinetoplastea, extensive conservation of 5' splice sites, and the presence of non-coding RNAs within a subset of retained trypanosomatid introns. CONCLUSIONS: All three intron-containing genes identified in Kinetoplastea encode RNA-interacting proteins, with a potential to fine-tune the expression of multiple genes, thus challenging the perception of cis-splicing in these protists as a mere evolutionary relic. We suggest that there is a selective pressure to retain cis-splicing in trypanosomatids and that this is likely associated with overall control of mRNA processing. Our study provides new insights into the evolution of introns and, consequently, the regulation of gene expression in eukaryotes.
- Klíčová slova
- Introns, Kinetoplastea, Poly(A) polymerase, RNA helicase, RNA-binding protein, Splicing, Trypanosomatidae,
- MeSH
- fylogeneze MeSH
- introny * genetika MeSH
- Kinetoplastida genetika MeSH
- molekulární evoluce MeSH
- protozoální geny genetika MeSH
- protozoální proteiny genetika MeSH
- trans-splicing * genetika MeSH
- Trypanosomatina genetika MeSH
- Publikační typ
- časopisecké články MeSH
- Názvy látek
- protozoální proteiny MeSH
Despite the fact that introns mean an energy and time burden for eukaryotic cells, they play an irreplaceable role in the diversification and regulation of protein production. As a common feature of eukaryotic genomes, it has been reported that in protein-coding genes, the longest intron is usually one of the first introns. The goal of our work was to find a possible difference in the biological function of genes that fulfill this common feature compared to genes that do not. Data on the lengths of all introns in genes were extracted from the genomes of six vertebrates (human, mouse, koala, chicken, zebrafish and fugu) and two other model organisms (nematode worm and arabidopsis). We showed that more than 40% of protein-coding genes have the relative position of the longest intron located in the second or third tertile of all introns. Genes divided according to the relative position of the longest intron were found to be significantly increased in different KEGG pathways. Genes with the longest intron in the first tertile predominate in a range of pathways for amino acid and lipid metabolism, various signaling, cell junctions or ABC transporters. Genes with the longest intron in the second or third tertile show increased representation in pathways associated with the formation and function of the spliceosome and ribosomes. In the two groups of genes defined in this way, we further demonstrated the difference in the length of the longest introns and the distribution of their absolute positions. We also pointed out other characteristics, namely the positive correlation between the length of the longest intron and the sum of the lengths of all other introns in the gene and the preservation of the exact same absolute and relative position of the longest intron between orthologous genes.
- Klíčová slova
- Eukaryotes, Gene function, Gene structure, Genome, Longest intron, Ribosome biogenesis, Spliceosome,
- MeSH
- Arabidopsis genetika MeSH
- introny * genetika MeSH
- lidé MeSH
- spliceozomy genetika metabolismus MeSH
- zvířata MeSH
- Check Tag
- lidé MeSH
- zvířata MeSH
- Publikační typ
- časopisecké články MeSH