SARS-CoV-2 has accumulated many mutations since its emergence in late 2019. Nucleotide substitutions leading to amino acid replacements constitute the primary material for natural selection. Insertions, deletions, and substitutions appear to be critical for coronavirus's macro- and microevolution. Understanding the molecular mechanisms of mutations in the mutational hotspots (positions, loci with recurrent mutations, and nucleotide context) is important for disentangling roles of mutagenesis and selection. In the SARS-CoV-2 genome, deletions and insertions are frequently associated with repetitive sequences, whereas C>U substitutions are often surrounded by nucleotides resembling the APOBEC mutable motifs. We describe various approaches to mutation spectra analyses, including the context features of RNAs that are likely to be involved in the generation of recurrent mutations. We also discuss the interplay between mutations and natural selection as a complex evolutionary trend. The substantial variability and complexity of pipelines for the reconstruction of mutations and the huge number of genomic sequences are major problems for the analyses of mutations in the SARS-CoV-2 genome. As a solution, we advocate for the development of a centralized database of predicted mutations, which needs to be updated on a regular basis.
- MeSH
- COVID-19 * genetics MeSH
- Humans MeSH
- Mutation MeSH
- Mutagenesis MeSH
- Nucleotides MeSH
- SARS-CoV-2 genetics MeSH
- Check Tag
- Humans MeSH
- Publication type
- Journal Article MeSH
- Review MeSH
BACKGROUND: Accessory proteins have diverse roles in coronavirus pathobiology. One of them in SARS-CoV (the causative agent of the severe acute respiratory syndrome outbreak in 2002-2003) is encoded by the open reading frame 8 (ORF8). Among the most dramatic genomic changes observed in SARS-CoV isolated from patients during the peak of the pandemic in 2003 was the acquisition of a characteristic 29-nucleotide deletion in ORF8. This deletion cause splitting of ORF8 into two smaller ORFs, namely ORF8a and ORF8b. Functional consequences of this event are not entirely clear. RESULTS: Here, we performed evolutionary analyses of ORF8a and ORF8b genes and documented that in both cases the frequency of synonymous mutations was greater than that of nonsynonymous ones. These results suggest that ORF8a and ORF8b are under purifying selection, thus proteins translated from these ORFs are likely to be functionally important. Comparisons with several other SARS-CoV genes revealed that another accessory gene, ORF7a, has a similar ratio of nonsynonymous to synonymous mutations suggesting that ORF8a, ORF8b, and ORF7a are under similar selection pressure. CONCLUSIONS: Our results for SARS-CoV echo the known excess of deletions in the ORF7a-ORF7b-ORF8 complex of accessory genes in SARS-CoV-2. A high frequency of deletions in this gene complex might reflect recurrent searches in "functional space" of various accessory protein combinations that may eventually produce more advantageous configurations of accessory proteins similar to the fixed deletion in the SARS-CoV ORF8 gene.
- MeSH
- Biological Evolution MeSH
- COVID-19 * MeSH
- Humans MeSH
- Nucleotides MeSH
- Open Reading Frames MeSH
- SARS-CoV-2 genetics MeSH
- Check Tag
- Humans MeSH
- Publication type
- Journal Article MeSH
The analysis of deletions may reveal evolutionary trends and provide new insight into the surprising variability and rapidly spreading capability that SARS-CoV-2 has shown since its emergence. To understand the factors governing genomic stability, it is important to define the molecular mechanisms of deletions in the viral genome. In this work, we performed a statistical analysis of deletions. Specifically, we analyzed correlations between deletions in the SARS-CoV-2 genome and repetitive elements and documented a significant association of deletions with runs of identical (poly-) nucleotides and direct repeats. Our analyses of deletions in the accessory genes of SARS-CoV-2 suggested that there may be a hypervariability in ORF7A and ORF8 that is not associated with repetitive elements. Such recurrent search in a "sequence space" of accessory genes (that might be driven by natural selection) did not yet cause increased viability of the SARS-CoV-2 variants. However, deletions in the accessory genes may ultimately produce new variants that are more successful compared to the viral strains with the conventional architecture of the SARS-CoV-2 accessory genes.
- Publication type
- Journal Article MeSH
Leishmaniaviruses (LRVs) have been demonstrated to enhance progression of leishmaniasis, a vector-transmitted disease with a wide range of clinical manifestations that is caused by flagellates of the genus Leishmania. Here, we used two previously proposed strategies of the LRV ablation to shed light on the relationships of two Leishmania spp. with their respective viral species (L. guyanensis, LRV1 and L. major, LRV2) and demonstrated considerable difference between two studied systems. LRV1 could be easily eliminated by the expression of exogenous capsids regardless of their origin (the same or distantly related LRV1 strains, or even LRV2), while LRV2 was only partially depleted in the case of the native capsid overexpression. The striking differences were also observed in the effects of complete viral elimination with 2'C-methyladenosine (2-CMA) on the transcriptional profiles of these two Leishmania spp. While virtually no differentially expressed genes were detected after the LRV1 removal from L. guyanensis, the response of L. major after ablation of LRV2 involved 87 genes, the analysis of which suggested a considerable stress experienced even after several passages following the treatment. This effect on L. major was also reflected in a significant decrease of the proliferation rate, not documented in L. guyanensis and naturally virus-free strain of L. major. Our findings suggest that integration of L. major with LRV2 is deeper compared with that of L. guyanensis with LRV1. We presume this determines different effects of the viral presence on the Leishmania spp. infections. IMPORTANCELeishmania spp. represent human pathogens that cause leishmaniasis, a widespread parasitic disease with mild to fatal clinical manifestations. Some strains of leishmaniae bear leishmaniaviruses (LRVs), and this has been shown to aggravate disease course. We investigated the relationships of two distally related Leishmania spp. with their respective LRVs using different strategies of virus removal. Our results suggest the South American L. guyanensis easily loses its virus with no important consequences for the parasite in the laboratory culture. Conversely, the Old-World L. major is refractory to virus removal and experiences a prominent stress if this removal is nonetheless completed. The drastically different levels of integration between the studied Leishmania spp. and their viruses suggest distinct effects of the viral presence on infections in these species of parasites.
Leishmaniasis is a parasitic vector-borne disease caused by the protistan flagellates of the genus Leishmania. Leishmania (Viannia) guyanensis is one of the most common causative agents of the American tegumentary leishmaniasis. It has previously been shown that L. guyanensis strains that carry the endosymbiotic Leishmania RNA virus 1 (LRV1) cause more severe form of the disease in a mouse model than those that do not. The presence of the virus was implicated into the parasite's replication and spreading. In this respect, studying the molecular mechanisms of cellular control of viral infection is of great medical importance. Here, we report ~30.5 Mb high-quality genome assembly of the LRV1-positive L. guyanensis M4147. This strain was turned into a model by establishing the CRISPR-Cas9 system and ablating the gene encoding phosphatidate phosphatase 2-like (PAP2L) protein. The orthologue of this gene is conspicuously absent from the genome of an unusual member of the family Trypanosomatidae, Vickermania ingenoplastis, a species with mostly bi-flagellated cells. Our analysis of the PAP2L-null L. guyanensis showed an increase in the number of cells strikingly resembling the bi-flagellated V. ingenoplastis, likely as a result of the disruption of the cell cycle, significant accumulation of phosphatidic acid, and increased virulence compared to the wild type cells.
- MeSH
- Cell Cycle MeSH
- Phosphatidate Phosphatase genetics MeSH
- Leishmania guyanensis * MeSH
- Leishmaniavirus MeSH
- Leishmaniasis, Cutaneous * MeSH
- Lipids MeSH
- Mice MeSH
- Parasites * MeSH
- Animals MeSH
- Check Tag
- Mice MeSH
- Animals MeSH
- Publication type
- Journal Article MeSH
- Research Support, Non-U.S. Gov't MeSH
Cancer genomes harbor numerous genomic alterations and many cancers accumulate thousands of nucleotide sequence variations. A prominent fraction of these mutations arises as a consequence of the off-target activity of DNA/RNA editing cytosine deaminases followed by the replication/repair of edited sites by DNA polymerases (pol), as deduced from the analysis of the DNA sequence context of mutations in different tumor tissues. We have used the weight matrix (sequence profile) approach to analyze mutagenesis due to Activation Induced Deaminase (AID) and two error-prone DNA polymerases. Control experiments using shuffled weight matrices and somatic mutations in immunoglobulin genes confirmed the power of this method. Analysis of somatic mutations in various cancers suggested that AID and DNA polymerases η and θ contribute to mutagenesis in contexts that almost universally correlate with the context of mutations in A:T and G:C sites during the affinity maturation of immunoglobulin genes. Previously, we demonstrated that AID contributes to mutagenesis in (de)methylated genomic DNA in various cancers. Our current analysis of methylation data from malignant lymphomas suggests that driver genes are subject to different (de)methylation processes than non-driver genes and, in addition to AID, the activity of pols η and θ contributes to the establishment of methylation-dependent mutation profiles. This may reflect the functional importance of interplay between mutagenesis in cancer and (de)methylation processes in different groups of genes. The resulting changes in CpG methylation levels and chromatin modifications are likely to cause changes in the expression levels of driver genes that may affect cancer initiation and/or progression.
- Publication type
- Journal Article MeSH
Catalase is one of the most abundant enzymes on Earth. It decomposes hydrogen peroxide, thus protecting cells from dangerous reactive oxygen species. The catalase-encoding gene is conspicuously absent from the genome of most representatives of the family Trypanosomatidae. Here, we expressed this protein from the Leishmania mexicana Β-TUBULIN locus using a novel bicistronic expression system, which relies on the 2A peptide of Teschovirus A. We demonstrated that catalase-expressing parasites are severely compromised in their ability to develop in insects, to be transmitted and to infect mice, and to cause clinical manifestation in their mammalian host. Taken together, our data support the hypothesis that the presence of catalase is not compatible with the dixenous life cycle of Leishmania, resulting in loss of this gene from the genome during the evolution of these parasites.
- MeSH
- Virulence Factors genetics metabolism MeSH
- Catalase genetics metabolism MeSH
- Cells, Cultured MeSH
- Leishmania mexicana genetics growth & development pathogenicity MeSH
- Mice, Inbred BALB C MeSH
- Mice MeSH
- Protozoan Proteins genetics MeSH
- Psychodidae parasitology MeSH
- Life Cycle Stages genetics MeSH
- Teschovirus genetics MeSH
- Virulence MeSH
- Animals MeSH
- Check Tag
- Mice MeSH
- Female MeSH
- Animals MeSH
- Publication type
- Journal Article MeSH
- Research Support, Non-U.S. Gov't MeSH
SUMOylation is a post-translational modification that positively regulates monoallelic expression of the trypanosome variant surface glycoprotein (VSG). The presence of a highly SUMOylated focus associated with the nuclear body, where the VSG gene is transcribed, further suggests an important role of SUMOylation in regulating VSG expression. Here, we show that SNF2PH, a SUMOylated plant homeodomain (PH)-transcription factor, is upregulated in the bloodstream form of the parasite and enriched at the active VSG telomere. SUMOylation promotes the recruitment of SNF2PH to the VSG promoter, where it is required to maintain RNA polymerase I and thus to regulate VSG transcript levels. Further, ectopic overexpression of SNF2PH in insect forms, but not of a mutant lacking the PH domain, induces the expression of bloodstream stage-specific surface proteins. These data suggest that SNF2PH SUMOylation positively regulates VSG monoallelic transcription, while the PH domain is required for the expression of bloodstream-specific surface proteins. Thus, SNF2PH functions as a positive activator, linking expression of infective form surface proteins and VSG regulation, thereby acting as a major regulator of pathogenicity.
- MeSH
- Epigenesis, Genetic MeSH
- Glycoproteins genetics metabolism MeSH
- Protozoan Proteins genetics metabolism MeSH
- Chromatin Assembly and Disassembly MeSH
- RNA Polymerase I metabolism MeSH
- Sumoylation * MeSH
- Transcription Factors genetics metabolism MeSH
- Trypanosoma brucei brucei genetics metabolism MeSH
- Publication type
- Journal Article MeSH
- Research Support, Non-U.S. Gov't MeSH