Most cited article - PubMed ID 37605127
Shining the spotlight on the neglected: new high-quality genome assemblies as a gateway to understanding the evolution of Trypanosomatidae
UNLABELLED: Trypanosomatids are among the most extensively studied protists due to their parasitic interactions with insects, vertebrates, and plants. Recently, Blastocrithidia nonstop was found to depart from the canonical genetic code, with all three stop codons reassigned to encode amino acids (UAR for glutamate and UGA for tryptophan), and UAA having dual meaning also as a termination signal (glutamate and stop). To explore features linked to this phenomenon, we analyzed the genomes of four Blastocrithidia and four Obscuromonas species, the latter representing a sister group employing the canonical genetic code. We found that all Blastocrithidia species encode cognate tRNAs for UAR codons, possess a distinct 4 bp anticodon stem tRNATrpCCA decoding UGA, and utilize UAA as the only stop codon. The distribution of in-frame reassigned codons is consistently non-random, suggesting a translational burden avoided in highly expressed genes. Frame-specific enrichment of UAA codons immediately following the genuine UAA stop codon, not observed in Obscuromonas, points to a specific mode of termination. All Blastocrithidia species possess specific mutations in eukaryotic release factor 1 and a unique acidic region following the prion-like N-terminus of eukaryotic release factor 3 that may be associated with stop codon readthrough. We infer that the common ancestor of the genus Blastocrithidia already exhibited a GC-poor genome with the non-canonical genetic code. Our comparative analysis highlights features associated with this extensive stop codon reassignment. This cascade of mutually dependent adaptations, driven by increasing AU-richness in transcripts and frequent emergence of in-frame stops, underscores the dynamic interplay between genome composition and genetic code plasticity to maintain vital functionality. IMPORTANCE: The genetic code, assigning amino acids to codons, is almost universal, yet an increasing number of its alterations keep emerging, mostly in organelles and unicellular eukaryotes. One such case is the trypanosomatid genus Blastocrithidia, where all three stop codons were reassigned to amino acids, with UAA also serving as a sole termination signal. We conducted a comparative analysis of four Blastocrithidia species, all with the same non-canonical genetic code, and their close relatives of the genus Obscuromonas, which retain the canonical code. This across-genome comparison allowed the identification of key traits associated with genetic code reassignment in Blastocrithidia. This work provides insight into the evolutionary steps, facilitating an extensive departure from the canonical genetic code that occurred independently in several eukaryotic lineages.
- Keywords
- AT-rich genomes, eukaryotic release factors, nuclear genetic code, reassigned codon, tRNA structure, termination of translation,
- MeSH
- Cell Nucleus * genetics MeSH
- Phylogeny MeSH
- Genetic Code * MeSH
- Genome, Protozoan * MeSH
- Genomics MeSH
- Evolution, Molecular MeSH
- RNA, Transfer genetics MeSH
- Codon, Terminator genetics MeSH
- Trypanosomatina * genetics classification MeSH
- Publication type
- Journal Article MeSH
- Comparative Study MeSH
- Names of Substances
- RNA, Transfer MeSH
- Codon, Terminator MeSH
Blastocrithidia nonstop is a protist with a highly unusual nuclear genetic code, in which all three standard stop codons are reassigned to encode amino acids, with UAA also serving as a sole termination codon. In this study, we demonstrate that this parasitic flagellate is amenable to genetic manipulation, enabling gene ablation and protein tagging. Using preassembled Cas9 ribonucleoprotein complexes, we successfully disrupted and tagged the non-essential gene encoding catalase. These advances establish this single-celled eukaryote as a model organism for investigating the malleability and evolution of the genetic code in eukaryotes.
- Keywords
- CRISPR‐Cas9, codon reassignment, genetic code, model organism, trypanosomatids,
- MeSH
- Genetic Code * genetics MeSH
- Catalase genetics MeSH
- Protozoan Proteins genetics MeSH
- Codon, Terminator genetics MeSH
- Trypanosomatina * genetics MeSH
- Publication type
- Journal Article MeSH
- Names of Substances
- Catalase MeSH
- Protozoan Proteins MeSH
- Codon, Terminator MeSH
Transposable elements (TEs) have the ability to move and amplify inside the host genome, making them a pivotal source of genome plasticity. Presently, only 4 TE clades (all classified as Class I retrotransposons) have been identified in trypanosomatids. We predicted repeat content and manually curated TEs across the genomes of 57 trypanosomatids, shedding light on their proportions, diversity and dynamics. Our analysis yielded 214 TE consensus sequence models across the dataset, with abundance ranging from 0.1% to 7.2%. We found evidence of recent transposon activity in most species, with notable bursts in the Vickermania, Lafontella, Porcisia and Angomonas spp., along with Leishmania (Mundinia) chancei, L. (M.) orientalis and L. (M.) procaviensis. We confirmed that the 4 TE clades have colonized virtually all lineages of trypanosomatids, potentially playing a role in shaping their genome architecture. The effort of this work culminated in the establishment of the Trypanosomatid TE Database 1.0, a resource designed to standardize the TE annotation process that can serve as a foundation for future studies on trypanosomatid TEs.
- Keywords
- CRE, INGI, SLACS, TATE, VIPER, mobilome, transposable elements, trypanosomatids,
- MeSH
- Phylogeny MeSH
- Genetic Variation * MeSH
- Genome, Protozoan * MeSH
- Evolution, Molecular * MeSH
- Retroelements genetics MeSH
- DNA Transposable Elements * genetics MeSH
- Trypanosomatina * genetics classification MeSH
- Animals MeSH
- Check Tag
- Animals MeSH
- Publication type
- Journal Article MeSH
- Comparative Study MeSH
- Names of Substances
- Retroelements MeSH
- DNA Transposable Elements * MeSH
Lotmaria passim is a ubiquitous trypanosomatid parasite of honey bees nestled within the medically important subfamily Leishmaniinae. Although this parasite is associated with honey bee colony losses, the original draft genome-which was completed before its differentiation from the closely related Crithidia mellificae-has remained the reference for this species despite lacking improvements from newer methodologies. Here, we report the updated sequencing, assembly, and annotation of the BRL-type (Bee Research Laboratory) strain (ATCC PRA-422) of Lotmaria passim. The nuclear genome assembly has been resolved into 31 complete chromosomes and is paired with an assembled kinetoplast genome consisting of a maxicircle and 30 minicircle sequences. The assembly spans 33.7 Mb and contains very little repetitive content, from which our annotation of both the nuclear assembly and kinetoplast predicted 10,288 protein-coding genes. Analyses of the assembly revealed evidence of a recent chromosomal duplication event within chromosomes 5 and 6 and provided evidence for a high level of aneuploidy in this species, mirroring the genomic flexibility employed by other trypanosomatids as a means of adaptation to different environments. This high-quality reference can therefore provide insights into adaptations of trypanosomatids to the thermally regulated, acidic, and phytochemically rich honey bee hindgut niche, which offers parallels to the challenges faced by other Leishmaniinae during the challenges they undergo within insect vectors, during infection of mammals, and exposure to antiparasitic drugs throughout their multi-host life cycles. This reference will also facilitate investigations of strain-specific genomic polymorphisms, their role in pathogenicity, and the development of treatments for pollinator infection.
- Keywords
- Lotmaria passim strain BRL, ATCC PRA-422, Hi-C, Leishmaniinae, PacBio, Trypanosomatidae, aneuploidy, monoxenous, polyploidy, trypanosomatid,
- MeSH
- Molecular Sequence Annotation MeSH
- Phylogeny MeSH
- Genome, Protozoan MeSH
- Genomics methods MeSH
- Evolution, Molecular * MeSH
- Trypanosomatina * genetics classification MeSH
- Bees parasitology MeSH
- Animals MeSH
- Check Tag
- Animals MeSH
- Publication type
- Journal Article MeSH
BACKGROUND: In trypanosomatids, a group of unicellular eukaryotes that includes numerous important human parasites, cis-splicing has been previously reported for only two genes: a poly(A) polymerase and an RNA helicase. Conversely, trans-splicing, which involves the attachment of a spliced leader sequence, is observed for nearly every protein-coding transcript. So far, our understanding of splicing in this protistan group has stemmed from the analysis of only a few medically relevant species. In this study, we used an extensive dataset encompassing all described trypanosomatid genera to investigate the distribution of intron-containing genes and the evolution of splice sites. RESULTS: We identified a new conserved intron-containing gene encoding an RNA-binding protein that is universally present in Kinetoplastea. We show that Perkinsela sp., a kinetoplastid endosymbiont of Amoebozoa, represents the first eukaryote completely devoid of cis-splicing, yet still preserving trans-splicing. We also provided evidence for reverse transcriptase-mediated intron loss in Kinetoplastea, extensive conservation of 5' splice sites, and the presence of non-coding RNAs within a subset of retained trypanosomatid introns. CONCLUSIONS: All three intron-containing genes identified in Kinetoplastea encode RNA-interacting proteins, with a potential to fine-tune the expression of multiple genes, thus challenging the perception of cis-splicing in these protists as a mere evolutionary relic. We suggest that there is a selective pressure to retain cis-splicing in trypanosomatids and that this is likely associated with overall control of mRNA processing. Our study provides new insights into the evolution of introns and, consequently, the regulation of gene expression in eukaryotes.
- Keywords
- Introns, Kinetoplastea, Poly(A) polymerase, RNA helicase, RNA-binding protein, Splicing, Trypanosomatidae,
- MeSH
- Phylogeny MeSH
- Introns * genetics MeSH
- Kinetoplastida genetics MeSH
- Evolution, Molecular MeSH
- Genes, Protozoan genetics MeSH
- Protozoan Proteins genetics MeSH
- Trans-Splicing * genetics MeSH
- Trypanosomatina genetics MeSH
- Publication type
- Journal Article MeSH
- Names of Substances
- Protozoan Proteins MeSH
Leishmania is a genus of the family Trypanosomatidae that unites obligatory parasitic flagellates causing a variety of vector-borne diseases collectively called leishmaniasis. The symptoms range from relatively innocuous skin lesions to complete failures of visceral organs. The disease is exacerbated if a parasite harbors Leishmania RNA viruses (LRVs) of the family Pseudototiviridae. Screening a novel isolate of L. braziliensis, we revealed that it possesses not a toti-, but a bunyavirus of the family Leishbuviridae. To the best of our knowledge, this is a very first discovery of a bunyavirus infecting a representative of the Leishmania subgenus Viannia. We suggest that these viruses may serve as potential factors of virulence in American leishmaniasis and encourage researchers to test leishmanial strains for the presence of not only LRVs, but also other RNA viruses.
- MeSH
- Bunyaviridae classification genetics isolation & purification MeSH
- Phylogeny MeSH
- Leishmania braziliensis * genetics isolation & purification MeSH
- Humans MeSH
- Orthobunyavirus genetics classification isolation & purification physiology MeSH
- RNA Viruses genetics classification isolation & purification MeSH
- Check Tag
- Humans MeSH
- Publication type
- Journal Article MeSH
Trypanosomatids (Euglenozoa) are a diverse group of unicellular flagellates predominately infecting insects (monoxenous species) or circulating between insects and vertebrates or plants (dixenous species). Monoxenous trypanosomatids harbor a wide range of RNA viruses belonging to the families Narnaviridae, Totiviridae, Qinviridae, Leishbuviridae, and a putative group of tombus-like viruses. Here, we focus on the subfamily Blastocrithidiinae, a previously unexplored divergent group of monoxenous trypanosomatids comprising two related genera: Obscuromonas and Blastocrithidia. Members of the genus Blastocrithidia employ a unique genetic code, in which all three stop codons are repurposed to encode amino acids, with TAA also used to terminate translation. Obscuromonas isolates studied here bear viruses of three families: Narnaviridae, Qinviridae, and Mitoviridae. The latter viral group is documented in trypanosomatid flagellates for the first time. While other known mitoviruses replicate in the mitochondria, those of trypanosomatids appear to reside in the cytoplasm. Although no RNA viruses were detected in Blastocrithidia spp., we identified an endogenous viral element in the genome of B. triatomae indicating its past encounter(s) with tombus-like viruses.
- Keywords
- Blastocrithidia, Mitoviridae, Narnaviridae, Obscuromonas, Qin-like virus, dsRNA viruses,
- Publication type
- Journal Article MeSH
Trypanosomatids are obligate parasites of animals, predominantly insects and vertebrates, and flowering plants. Monoxenous species, representing the vast majority of trypanosomatid diversity, develop in a single host, whereas dixenous species cycle between two hosts, of which primarily insect serves as a vector. To explore in-depth the diversity of insect trypanosomatids including their co-infections, sequence profiling of their 18S rRNA gene was used for true bugs (Hemiptera; 18% infection rate) and flies (Diptera; 10%) in Cuba. Out of 48 species (molecular operational taxonomic units) belonging to the genera Vickermania (16 spp.), Blastocrithidia (7), Obscuromonas (4), Phytomonas (5), Leptomonas/Crithidia (5), Herpetomonas (5), Wallacemonas (2), Kentomonas (1), Angomonas (1) and two unnamed genera (1 + 1), 38 species have been encountered for the first time. The detected Wallacemonas and Angomonas species constitute the most basal lineages of their respective genera, while Vickermania emerged as the most diverse group. The finding of Leptomonas seymouri, which is known to rarely infect humans, confirms that Dysdercus bugs are its natural hosts. A clear association of Phytomonas with the heteropteran family Pentatomidae hints at its narrow host association with the insect rather than plant hosts. With a focus on multiple infections of a single fly host, using deep Nanopore sequencing of 18S rRNA, we have identified co-infections with up to 8 trypanosomatid species. The fly midgut was usually occupied by several Vickermania species, while Herpetomonas and/or Kentomonas species prevailed in the hindgut. Metabarcoding was instrumental for analysing extensive co-infections and also allowed the identification of trypanosomatid lineages and genera.
- Keywords
- biodiversity, diptera, heteroptera, host specificity, monoxenous trypanosomatids, multiple infections, nanopore sequencing, phylogeny, systematics,
- MeSH
- Diptera genetics MeSH
- Phylogeny * MeSH
- Hemiptera parasitology genetics MeSH
- Coinfection * parasitology MeSH
- DNA, Protozoan genetics analysis MeSH
- RNA, Ribosomal, 18S * genetics analysis MeSH
- Trypanosomatina * genetics classification isolation & purification MeSH
- Animals MeSH
- Check Tag
- Animals MeSH
- Publication type
- Journal Article MeSH
- Geographicals
- Cuba epidemiology MeSH
- Names of Substances
- DNA, Protozoan MeSH
- RNA, Ribosomal, 18S * MeSH
Isocitrate dehydrogenase is an enzyme converting isocitrate to α-ketoglutarate in the canonical tricarboxylic acid (TCA) cycle. There are three different types of isocitrate dehydrogenase documented in eukaryotes. Our study points out the complex evolutionary history of isocitrate dehydrogenases across kinetoplastids, where the common ancestor of Trypanosomatidae and Bodonidae was equipped with two isoforms of the isocitrate dehydrogenase enzyme: the NADP+-dependent isocitrate dehydrogenase 1 with possibly dual localization in the cytosol and mitochondrion and NADP+-dependent mitochondrial isocitrate dehydrogenase 2. In the extant trypanosomatids, isocitrate dehydrogenase 1 is present only in a few species suggesting that it was lost upon separation of Trypanosoma spp. and replaced by the mainly NADP+-dependent cytosolic isocitrate dehydrogenase 3 of bacterial origin in all the derived lineages. In this study, we experimentally demonstrate that the omnipresent isocitrate dehydrogenase 2 has a dual localization in both mitochondrion and cytosol in at least four species that possess only this isoform. The apparent lack of the NAD+-dependent isocitrate dehydrogenase activity in trypanosomatid mitochondrion provides further support to the existence of the noncanonical TCA cycle across trypanosomatids and the bidirectional activity of isocitrate dehydrogenase 3 when operating with NADP+ cofactor instead of NAD+. This observation can be extended to all 17 species analyzed in this study, except for Leishmania mexicana, which showed only low isocitrate dehydrogenase activity in the cytosol. The variability in isocitrate oxidation capacity among species may reflect the distinct metabolic strategies and needs for reduced cofactors in particular environments.
- Keywords
- Krebs cycle, NAD+, NADP+, TCA cycle, cofactor preference, isocitrate dehydrogenase,
- MeSH
- Isocitrate Dehydrogenase * genetics metabolism MeSH
- Isocitrates metabolism MeSH
- NAD * metabolism MeSH
- NADP metabolism MeSH
- Protein Isoforms MeSH
- Publication type
- Journal Article MeSH
- Research Support, Non-U.S. Gov't MeSH
- Names of Substances
- Isocitrate Dehydrogenase * MeSH
- Isocitrates MeSH
- isocitric acid MeSH Browser
- NAD * MeSH
- NADP MeSH
- Protein Isoforms MeSH
BACKGROUND: Almost all extant organisms use the same, so-called canonical, genetic code with departures from it being very rare. Even more exceptional are the instances when a eukaryote with non-canonical code can be easily cultivated and has its whole genome and transcriptome sequenced. This is the case of Blastocrithidia nonstop, a trypanosomatid flagellate that reassigned all three stop codons to encode amino acids. RESULTS: We in silico predicted the metabolism of B. nonstop and compared it with that of the well-studied human parasites Trypanosoma brucei and Leishmania major. The mapped mitochondrial, glycosomal and cytosolic metabolism contains all typical features of these diverse and important parasites. We also provided experimental validation for some of the predicted observations, concerning, specifically presence of glycosomes, cellular respiration, and assembly of the respiratory complexes. CONCLUSIONS: In an unusual comparison of metabolism between a parasitic protist with a massively altered genetic code and its close relatives that rely on a canonical code we showed that the dramatic differences on the level of nucleic acids do not seem to be reflected in the metabolisms. Moreover, although the genome of B. nonstop is extremely AT-rich, we could not find any alterations of its pyrimidine synthesis pathway when compared to other trypanosomatids. Hence, we conclude that the dramatic alteration of the genetic code of B. nonstop has no significant repercussions on the metabolism of this flagellate.
- Keywords
- Blastocrithidia, In silico, Metabolic predictions, Non-canonical genetic code, Trypanosomatid,
- MeSH
- Eukaryota genetics MeSH
- Genetic Code MeSH
- Parasites * genetics MeSH
- Codon, Terminator MeSH
- Trypanosoma brucei brucei * genetics MeSH
- Trypanosomatina * genetics MeSH
- Animals MeSH
- Check Tag
- Animals MeSH
- Publication type
- Journal Article MeSH
- Names of Substances
- Codon, Terminator MeSH