Most cited article - PubMed ID 27593378
An Unprecedented Non-canonical Nuclear Genetic Code with All Three Termination Codons Reassigned as Sense Codons
UNLABELLED: Trypanosomatids are among the most extensively studied protists due to their parasitic interactions with insects, vertebrates, and plants. Recently, Blastocrithidia nonstop was found to depart from the canonical genetic code, with all three stop codons reassigned to encode amino acids (UAR for glutamate and UGA for tryptophan), and UAA having dual meaning also as a termination signal (glutamate and stop). To explore features linked to this phenomenon, we analyzed the genomes of four Blastocrithidia and four Obscuromonas species, the latter representing a sister group employing the canonical genetic code. We found that all Blastocrithidia species encode cognate tRNAs for UAR codons, possess a distinct 4 bp anticodon stem tRNATrpCCA decoding UGA, and utilize UAA as the only stop codon. The distribution of in-frame reassigned codons is consistently non-random, suggesting a translational burden avoided in highly expressed genes. Frame-specific enrichment of UAA codons immediately following the genuine UAA stop codon, not observed in Obscuromonas, points to a specific mode of termination. All Blastocrithidia species possess specific mutations in eukaryotic release factor 1 and a unique acidic region following the prion-like N-terminus of eukaryotic release factor 3 that may be associated with stop codon readthrough. We infer that the common ancestor of the genus Blastocrithidia already exhibited a GC-poor genome with the non-canonical genetic code. Our comparative analysis highlights features associated with this extensive stop codon reassignment. This cascade of mutually dependent adaptations, driven by increasing AU-richness in transcripts and frequent emergence of in-frame stops, underscores the dynamic interplay between genome composition and genetic code plasticity to maintain vital functionality. IMPORTANCE: The genetic code, assigning amino acids to codons, is almost universal, yet an increasing number of its alterations keep emerging, mostly in organelles and unicellular eukaryotes. One such case is the trypanosomatid genus Blastocrithidia, where all three stop codons were reassigned to amino acids, with UAA also serving as a sole termination signal. We conducted a comparative analysis of four Blastocrithidia species, all with the same non-canonical genetic code, and their close relatives of the genus Obscuromonas, which retain the canonical code. This across-genome comparison allowed the identification of key traits associated with genetic code reassignment in Blastocrithidia. This work provides insight into the evolutionary steps, facilitating an extensive departure from the canonical genetic code that occurred independently in several eukaryotic lineages.
- Keywords
- AT-rich genomes, eukaryotic release factors, nuclear genetic code, reassigned codon, tRNA structure, termination of translation,
- MeSH
- Cell Nucleus * genetics MeSH
- Phylogeny MeSH
- Genetic Code * MeSH
- Genome, Protozoan * MeSH
- Genomics MeSH
- Evolution, Molecular MeSH
- RNA, Transfer genetics MeSH
- Codon, Terminator genetics MeSH
- Trypanosomatina * genetics classification MeSH
- Publication type
- Journal Article MeSH
- Comparative Study MeSH
- Names of Substances
- RNA, Transfer MeSH
- Codon, Terminator MeSH
Blastocrithidia nonstop is a protist with a highly unusual nuclear genetic code, in which all three standard stop codons are reassigned to encode amino acids, with UAA also serving as a sole termination codon. In this study, we demonstrate that this parasitic flagellate is amenable to genetic manipulation, enabling gene ablation and protein tagging. Using preassembled Cas9 ribonucleoprotein complexes, we successfully disrupted and tagged the non-essential gene encoding catalase. These advances establish this single-celled eukaryote as a model organism for investigating the malleability and evolution of the genetic code in eukaryotes.
- Keywords
- CRISPR‐Cas9, codon reassignment, genetic code, model organism, trypanosomatids,
- MeSH
- Genetic Code * genetics MeSH
- Catalase genetics MeSH
- Protozoan Proteins genetics MeSH
- Codon, Terminator genetics MeSH
- Trypanosomatina * genetics MeSH
- Publication type
- Journal Article MeSH
- Names of Substances
- Catalase MeSH
- Protozoan Proteins MeSH
- Codon, Terminator MeSH
Trypanosomatids (Euglenozoa) are a diverse group of unicellular flagellates predominately infecting insects (monoxenous species) or circulating between insects and vertebrates or plants (dixenous species). Monoxenous trypanosomatids harbor a wide range of RNA viruses belonging to the families Narnaviridae, Totiviridae, Qinviridae, Leishbuviridae, and a putative group of tombus-like viruses. Here, we focus on the subfamily Blastocrithidiinae, a previously unexplored divergent group of monoxenous trypanosomatids comprising two related genera: Obscuromonas and Blastocrithidia. Members of the genus Blastocrithidia employ a unique genetic code, in which all three stop codons are repurposed to encode amino acids, with TAA also used to terminate translation. Obscuromonas isolates studied here bear viruses of three families: Narnaviridae, Qinviridae, and Mitoviridae. The latter viral group is documented in trypanosomatid flagellates for the first time. While other known mitoviruses replicate in the mitochondria, those of trypanosomatids appear to reside in the cytoplasm. Although no RNA viruses were detected in Blastocrithidia spp., we identified an endogenous viral element in the genome of B. triatomae indicating its past encounter(s) with tombus-like viruses.
- Keywords
- Blastocrithidia, Mitoviridae, Narnaviridae, Obscuromonas, Qin-like virus, dsRNA viruses,
- Publication type
- Journal Article MeSH
Trypanosomatids are obligate parasites of animals, predominantly insects and vertebrates, and flowering plants. Monoxenous species, representing the vast majority of trypanosomatid diversity, develop in a single host, whereas dixenous species cycle between two hosts, of which primarily insect serves as a vector. To explore in-depth the diversity of insect trypanosomatids including their co-infections, sequence profiling of their 18S rRNA gene was used for true bugs (Hemiptera; 18% infection rate) and flies (Diptera; 10%) in Cuba. Out of 48 species (molecular operational taxonomic units) belonging to the genera Vickermania (16 spp.), Blastocrithidia (7), Obscuromonas (4), Phytomonas (5), Leptomonas/Crithidia (5), Herpetomonas (5), Wallacemonas (2), Kentomonas (1), Angomonas (1) and two unnamed genera (1 + 1), 38 species have been encountered for the first time. The detected Wallacemonas and Angomonas species constitute the most basal lineages of their respective genera, while Vickermania emerged as the most diverse group. The finding of Leptomonas seymouri, which is known to rarely infect humans, confirms that Dysdercus bugs are its natural hosts. A clear association of Phytomonas with the heteropteran family Pentatomidae hints at its narrow host association with the insect rather than plant hosts. With a focus on multiple infections of a single fly host, using deep Nanopore sequencing of 18S rRNA, we have identified co-infections with up to 8 trypanosomatid species. The fly midgut was usually occupied by several Vickermania species, while Herpetomonas and/or Kentomonas species prevailed in the hindgut. Metabarcoding was instrumental for analysing extensive co-infections and also allowed the identification of trypanosomatid lineages and genera.
- Keywords
- biodiversity, diptera, heteroptera, host specificity, monoxenous trypanosomatids, multiple infections, nanopore sequencing, phylogeny, systematics,
- MeSH
- Diptera genetics MeSH
- Phylogeny * MeSH
- Hemiptera parasitology genetics MeSH
- Coinfection * parasitology MeSH
- DNA, Protozoan genetics analysis MeSH
- RNA, Ribosomal, 18S * genetics analysis MeSH
- Trypanosomatina * genetics classification isolation & purification MeSH
- Animals MeSH
- Check Tag
- Animals MeSH
- Publication type
- Journal Article MeSH
- Geographicals
- Cuba epidemiology MeSH
- Names of Substances
- DNA, Protozoan MeSH
- RNA, Ribosomal, 18S * MeSH
The canonical stop codons of the nuclear genome of the trypanosomatid Blastocrithidia nonstop are recoded. Here, we investigated the effect of this recoding on the mitochondrial genome and gene expression. Trypanosomatids possess a single mitochondrion and protein-coding transcripts of this genome require RNA editing in order to generate open reading frames of many transcripts encoded as 'cryptogenes'. Small RNAs that can number in the hundreds direct editing and produce a mitochondrial transcriptome of unusual complexity. We find B. nonstop to have a typical trypanosomatid mitochondrial genetic code, which presumably requires the mitochondrion to disable utilization of the two nucleus-encoded suppressor tRNAs, which appear to be imported into the organelle. Alterations of the protein factors responsible for mRNA editing were also documented, but they have likely originated from sources other than B. nonstop nuclear genome recoding. The population of guide RNAs directing editing is minimal, yet virtually all genes for the plethora of known editing factors are still present. Most intriguingly, despite lacking complex I cryptogene guide RNAs, these cryptogene transcripts are stochastically edited to high levels.
- MeSH
- Cell Nucleus * genetics metabolism MeSH
- RNA Editing * MeSH
- Genetic Code MeSH
- Genome, Mitochondrial * MeSH
- RNA, Guide, Kinetoplastida genetics metabolism MeSH
- Codon genetics MeSH
- RNA, Messenger genetics metabolism MeSH
- Mitochondria genetics metabolism MeSH
- Open Reading Frames genetics MeSH
- Protozoan Proteins genetics metabolism MeSH
- RNA, Transfer * genetics metabolism MeSH
- Codon, Terminator genetics MeSH
- Trypanosomatina genetics metabolism MeSH
- Publication type
- Journal Article MeSH
- Research Support, Non-U.S. Gov't MeSH
- Names of Substances
- RNA, Guide, Kinetoplastida MeSH
- Codon MeSH
- RNA, Messenger MeSH
- Protozoan Proteins MeSH
- RNA, Transfer * MeSH
- Codon, Terminator MeSH
BACKGROUND: Almost all extant organisms use the same, so-called canonical, genetic code with departures from it being very rare. Even more exceptional are the instances when a eukaryote with non-canonical code can be easily cultivated and has its whole genome and transcriptome sequenced. This is the case of Blastocrithidia nonstop, a trypanosomatid flagellate that reassigned all three stop codons to encode amino acids. RESULTS: We in silico predicted the metabolism of B. nonstop and compared it with that of the well-studied human parasites Trypanosoma brucei and Leishmania major. The mapped mitochondrial, glycosomal and cytosolic metabolism contains all typical features of these diverse and important parasites. We also provided experimental validation for some of the predicted observations, concerning, specifically presence of glycosomes, cellular respiration, and assembly of the respiratory complexes. CONCLUSIONS: In an unusual comparison of metabolism between a parasitic protist with a massively altered genetic code and its close relatives that rely on a canonical code we showed that the dramatic differences on the level of nucleic acids do not seem to be reflected in the metabolisms. Moreover, although the genome of B. nonstop is extremely AT-rich, we could not find any alterations of its pyrimidine synthesis pathway when compared to other trypanosomatids. Hence, we conclude that the dramatic alteration of the genetic code of B. nonstop has no significant repercussions on the metabolism of this flagellate.
- Keywords
- Blastocrithidia, In silico, Metabolic predictions, Non-canonical genetic code, Trypanosomatid,
- MeSH
- Eukaryota genetics MeSH
- Genetic Code MeSH
- Parasites * genetics MeSH
- Codon, Terminator MeSH
- Trypanosoma brucei brucei * genetics MeSH
- Trypanosomatina * genetics MeSH
- Animals MeSH
- Check Tag
- Animals MeSH
- Publication type
- Journal Article MeSH
- Names of Substances
- Codon, Terminator MeSH
Under certain circumstances, any of the three termination codons can be read through by a near-cognate tRNA; i.e., a tRNA whose two out of three anticodon nucleotides base pair with those of the stop codon. Unless programed to synthetize C-terminally extended protein variants with expanded physiological roles, readthrough represents an undesirable translational error. On the other side of a coin, a significant number of human genetic diseases is associated with the introduction of nonsense mutations (premature termination codons [PTCs]) into coding sequences, where stopping is not desirable. Here, the tRNA's ability to induce readthrough opens up the intriguing possibility of mitigating the deleterious effects of PTCs on human health. In yeast, the UGA and UAR stop codons were described to be read through by four readthrough-inducing rti-tRNAs-tRNATrp and tRNACys, and tRNATyr and tRNAGln, respectively. The readthrough-inducing potential of tRNATrp and tRNATyr was also observed in human cell lines. Here, we investigated the readthrough-inducing potential of human tRNACys in the HEK293T cell line. The tRNACys family consists of two isoacceptors, one with ACA and the other with GCA anticodons. We selected nine representative tRNACys isodecoders (differing in primary sequence and expression level) and tested them using dual luciferase reporter assays. We found that at least two tRNACys can significantly elevate UGA readthrough when overexpressed. This indicates a mechanistically conserved nature of rti-tRNAs between yeast and human, supporting the idea that they could be used in the PTC-associated RNA therapies.
- Keywords
- cysteine tRNA, near-cognate tRNA, readthrough-inducing tRNA, stop codon readthrough, translation,
- MeSH
- Anticodon MeSH
- Cysteine * genetics metabolism MeSH
- HEK293 Cells MeSH
- Humans MeSH
- Codon, Nonsense genetics MeSH
- Protein Biosynthesis MeSH
- RNA, Transfer, Cys metabolism MeSH
- RNA, Transfer, Trp metabolism MeSH
- RNA, Transfer, Tyr MeSH
- RNA, Transfer genetics metabolism MeSH
- Saccharomyces cerevisiae * genetics MeSH
- Codon, Terminator genetics MeSH
- Check Tag
- Humans MeSH
- Publication type
- Journal Article MeSH
- Research Support, Non-U.S. Gov't MeSH
- Names of Substances
- Anticodon MeSH
- Cysteine * MeSH
- Codon, Nonsense MeSH
- RNA, Transfer, Cys MeSH
- RNA, Transfer, Trp MeSH
- RNA, Transfer, Tyr MeSH
- RNA, Transfer MeSH
- Codon, Terminator MeSH
BACKGROUND: Protists of the family Trypanosomatidae (phylum Euglenozoa) have gained notoriety as parasites affecting humans, domestic animals, and agricultural plants. However, the true extent of the group's diversity spreads far beyond the medically and veterinary relevant species. We address several knowledge gaps in trypanosomatid research by undertaking sequencing, assembly, and analysis of genomes from previously overlooked representatives of this protistan group. RESULTS: We assembled genomes for twenty-one trypanosomatid species, with a primary focus on insect parasites and Trypanosoma spp. parasitizing non-human hosts. The assemblies exhibit sizes consistent with previously sequenced trypanosomatid genomes, ranging from approximately 18 Mb for Obscuromonas modryi to 35 Mb for Crithidia brevicula and Zelonia costaricensis. Despite being the smallest, the genome of O. modryi has the highest content of repetitive elements, contributing nearly half of its total size. Conversely, the highest proportion of unique DNA is found in the genomes of Wallacemonas spp., with repeats accounting for less than 8% of the assembly length. The majority of examined species exhibit varying degrees of aneuploidy, with trisomy being the most frequently observed condition after disomy. CONCLUSIONS: The genome of Obscuromonas modryi represents a very unusual, if not unique, example of evolution driven by two antidromous forces: i) increasing dependence on the host leading to genomic shrinkage and ii) expansion of repeats causing genome enlargement. The observed variation in somy within and between trypanosomatid genera suggests that these flagellates are largely predisposed to aneuploidy and, apparently, exploit it to gain a fitness advantage. High heterogeneity in the genome size, repeat content, and variation in chromosome copy numbers in the newly-sequenced species highlight the remarkable genome plasticity exhibited by trypanosomatid flagellates. These new genome assemblies are a robust foundation for future research on the genetic basis of life cycle changes and adaptation to different hosts in the family Trypanosomatidae.
- Keywords
- Dixenous, Genome assembly, Monoxenous, Parasite, Protist, Trypanosomatids, Whole-genome sequencing,
- MeSH
- Acclimatization MeSH
- Aneuploidy MeSH
- Genome Size MeSH
- Trypanosomatina * genetics MeSH
- Agriculture MeSH
- Animals MeSH
- Check Tag
- Animals MeSH
- Publication type
- Journal Article MeSH
Cognate tRNAs deliver specific amino acids to translating ribosomes according to the standard genetic code, and three codons with no cognate tRNAs serve as stop codons. Some protists have reassigned all stop codons as sense codons, neglecting this fundamental principle1-4. Here we analyse the in-frame stop codons in 7,259 predicted protein-coding genes of a previously undescribed trypanosomatid, Blastocrithidia nonstop. We reveal that in this species in-frame stop codons are underrepresented in genes expressed at high levels and that UAA serves as the only termination codon. Whereas new tRNAsGlu fully cognate to UAG and UAA evolved to reassign these stop codons, the UGA reassignment followed a different path through shortening the anticodon stem of tRNATrpCCA from five to four base pairs (bp). The canonical 5-bp tRNATrp recognizes UGG as dictated by the genetic code, whereas its shortened 4-bp variant incorporates tryptophan also into in-frame UGA. Mimicking this evolutionary twist by engineering both variants from B. nonstop, Trypanosoma brucei and Saccharomyces cerevisiae and expressing them in the last two species, we recorded a significantly higher readthrough for all 4-bp variants. Furthermore, a gene encoding B. nonstop release factor 1 acquired a mutation that specifically restricts UGA recognition, robustly potentiating the UGA reassignment. Virtually the same strategy has been adopted by the ciliate Condylostoma magnum. Hence, we describe a previously unknown, universal mechanism that has been exploited in unrelated eukaryotes with reassigned stop codons.
- MeSH
- Anticodon * chemistry genetics metabolism MeSH
- Ciliophora genetics MeSH
- Eukaryotic Cells * MeSH
- Genetic Code * genetics MeSH
- Mutation * MeSH
- Peptide Termination Factors * genetics metabolism MeSH
- RNA, Transfer, Glu genetics MeSH
- RNA, Transfer, Trp genetics MeSH
- RNA, Transfer * genetics metabolism MeSH
- Saccharomyces cerevisiae genetics MeSH
- Codon, Terminator * genetics MeSH
- Trypanosoma brucei brucei genetics MeSH
- Publication type
- Journal Article MeSH
- Research Support, Non-U.S. Gov't MeSH
- Names of Substances
- Anticodon * MeSH
- Peptide Termination Factors * MeSH
- RNA, Transfer, Glu MeSH
- RNA, Transfer, Trp MeSH
- RNA, Transfer * MeSH
- Codon, Terminator * MeSH