SARS-CoV-2 is a novel positive-sense single-stranded RNA virus from the Coronaviridae family (genus Betacoronavirus), which has been established as causing the COVID-19 pandemic. The genome of SARS-CoV-2 is one of the largest among known RNA viruses, comprising of at least 26 known protein-coding loci. Studies thus far have outlined the coding capacity of the positive-sense strand of the SARS-CoV-2 genome, which can be used directly for protein translation. However, it has been recently shown that transcribed negative-sense viral RNA intermediates that arise during viral genome replication from positive-sense viruses can also code for proteins. No studies have yet explored the potential for negative-sense SARS-CoV-2 RNA intermediates to contain protein-coding loci. Thus, using sequence and structure-based bioinformatics methodologies, we have investigated the presence and validity of putative negative-sense ORFs (nsORFs) in the SARS-CoV-2 genome. Nine nsORFs were discovered to contain strong eukaryotic translation initiation signals and high codon adaptability scores, and several of the nsORFs were predicted to interact with RNA-binding proteins. Evolutionary conservation analyses indicated that some of the nsORFs are deeply conserved among related coronaviruses. Three-dimensional protein modeling revealed the presence of higher order folding among all putative SARS-CoV-2 nsORFs, and subsequent structural mimicry analyses suggest similarity of the nsORFs to DNA/RNA-binding proteins and proteins involved in immune signaling pathways. Altogether, these results suggest the potential existence of still undescribed SARS-CoV-2 proteins, which may play an important role in the viral lifecycle and COVID-19 pathogenesis.
- MeSH
- COVID-19 * genetika MeSH
- genom virový MeSH
- lidé MeSH
- pandemie MeSH
- proteiny vázající RNA genetika MeSH
- RNA virová chemie genetika MeSH
- SARS-CoV-2 * genetika MeSH
- Check Tag
- lidé MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
SARS-CoV-2 is an intensively investigated virus from the order Nidovirales (Coronaviridae family) that causes COVID-19 disease in humans. Through enormous scientific effort, thousands of viral strains have been sequenced to date, thereby creating a strong background for deep bioinformatics studies of the SARS-CoV-2 genome. In this study, we inspected high-frequency mutations of SARS-CoV-2 and carried out systematic analyses of their overlay with inverted repeat (IR) loci and CpG islands. The main conclusion of our study is that SARS-CoV-2 hot-spot mutations are significantly enriched within both IRs and CpG island loci. This points to their role in genomic instability and may predict further mutational drive of the SARS-CoV-2 genome. Moreover, CpG islands are strongly enriched upstream from viral ORFs and thus could play important roles in transcription and the viral life cycle. We hypothesize that hypermethylation of these loci will decrease the transcription of viral ORFs and could therefore limit the progression of the disease.
- MeSH
- COVID-19 virologie MeSH
- CpG ostrůvky * MeSH
- genom virový MeSH
- lidé MeSH
- metylace DNA MeSH
- mutace * MeSH
- SARS-CoV-2 genetika MeSH
- vazba proteinů MeSH
- Check Tag
- lidé MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
Non-canonical nucleic acid structures play important roles in the regulation of molecular processes. Considering the importance of the ongoing coronavirus crisis, we decided to evaluate genomes of all coronaviruses sequenced to date (stated more broadly, the order Nidovirales) to determine if they contain non-canonical nucleic acid structures. We discovered much evidence of putative G-quadruplex sites and even much more of inverted repeats (IRs) loci, which in fact are ubiquitous along the whole genomic sequence and indicate a possible mechanism for genomic RNA packaging. The most notable enrichment of IRs was found inside 5'UTR for IRs of size 12+ nucleotides, and the most notable enrichment of putative quadruplex sites (PQSs) was located before 3'UTR, inside 5'UTR, and before mRNA. This indicates crucial regulatory roles for both IRs and PQSs. Moreover, we found multiple G-quadruplex binding motifs in human proteins having potential for binding of SARS-CoV-2 RNA. Non-canonical nucleic acids structures in Nidovirales and in novel SARS-CoV-2 are therefore promising druggable structures that can be targeted and utilized in the future.
- Publikační typ
- časopisecké články MeSH
The p53 family of transcription factors plays key roles in development, genome stability, senescence and tumor development, and p53 is the most important tumor suppressor protein in humans. Although intensively investigated for many years, its initial evolutionary history is not yet fully elucidated. Using bioinformatic and structure prediction methods on current databases containing newly-sequenced genomes and transcriptomes, we present a detailed characterization of p53 family homologs in remote members of the Holozoa group, in the unicellular clades Filasterea, Ichthyosporea and Corallochytrea. Moreover, we show that these newly characterized homologous sequences contain domains that can form structures with high similarity to the human p53 family DNA-binding domain, and some also show similarities to the oligomerization and SAM domains. The presence of these remote homologs demonstrates an ancient origin of the p53 protein family.
- MeSH
- databáze genetické MeSH
- Eukaryota klasifikace genetika MeSH
- exony MeSH
- fylogeneze MeSH
- interakční proteinové domény a motivy MeSH
- introny MeSH
- konformace proteinů MeSH
- molekulární evoluce * MeSH
- molekulární modely MeSH
- multigenová rodina * MeSH
- nádorový supresorový protein p53 chemie genetika metabolismus MeSH
- sekvence aminokyselin MeSH
- sekvenční homologie aminokyselin * MeSH
- Publikační typ
- časopisecké články MeSH
The role of local DNA structures in the regulation of basic cellular processes is an emerging field of research. Amongst local non-B DNA structures, the significance of G-quadruplexes was demonstrated in the last decade, and their presence and functional relevance has been demonstrated in many genomes, including humans. In this study, we analyzed the presence and locations of G-quadruplex-forming sequences by G4Hunter in all complete bacterial genomes available in the NCBI database. G-quadruplex-forming sequences were identified in all species, however the frequency differed significantly across evolutionary groups. The highest frequency of G-quadruplex forming sequences was detected in the subgroup Deinococcus-Thermus, and the lowest frequency in Thermotogae. G-quadruplex forming sequences are non-randomly distributed and are favored in various evolutionary groups. G-quadruplex-forming sequences are enriched in ncRNA segments followed by mRNAs. Analyses of surrounding sequences showed G-quadruplex-forming sequences around tRNA and regulatory sequences. These data point to the unique and non-random localization of G-quadruplex-forming sequences in bacterial genomes.
- MeSH
- Bacteria genetika MeSH
- DNA bakterií chemie MeSH
- fylogeneze MeSH
- G-kvadruplexy * MeSH
- genom bakteriální MeSH
- konformace nukleové kyseliny MeSH
- lidé MeSH
- Check Tag
- lidé MeSH
- Publikační typ
- časopisecké články MeSH
The importance of local DNA structures in the regulation of basic cellular processes is an emerging field of research. Amongst local non-B DNA structures, G-quadruplexes are perhaps the most well-characterized to date, and their presence has been demonstrated in many genomes, including that of humans. G-quadruplexes are selectively bound by many regulatory proteins. In this paper, we have analyzed the amino acid composition of all seventy-seven described G-quadruplex binding proteins of Homo sapiens. Our comparison with amino acid frequencies in all human proteins and specific protein subsets (e.g., all nucleic acid binding) revealed unique features of quadruplex binding proteins, with prominent enrichment for glycine (G) and arginine (R). Cluster analysis with bootstrap resampling shows similarities and differences in amino acid composition of particular quadruplex binding proteins. Interestingly, we found that all characterized G-quadruplex binding proteins share a 20 amino acid long motif/domain (RGRGR GRGGG SGGSG GRGRG) which is similar to the previously described RG-rich domain (RRGDG RRRGG GGRGQ GGRGR GGGFKG) of the FRM1 G-quadruplex binding protein. Based on this protein fingerprint, we have predicted a new set of potential G-quadruplex binding proteins sharing this interesting domain rich in glycine and arginine residues.
Quadruplexes are noncanonical DNA structures that arise in guanine rich loci and have important biological functions. Classically, quadruplexes contain four stacked intramolecular G-tetrads. Surprisingly, although some algorithms allow searching for longer than 4G tracts for quadruplex formation, these have not yet been systematically studied. Therefore, we analyzed the human genome for sequences that are predicted to adopt stacked intramolecular G-tetrads with greater than four stacks. The data provide evidence for numerous G-quadruplexes that contain five or six stacked intramolecular G-tetrads. These sequences are predominantly found in known gene regulatory regions. Electrophoretic mobility assays and circular dichroism spectroscopy indicate that these sequences form quadruplex structures in vitro under physiological conditions. The localization and in vitro stability of these G-quadruplexes indicate their potentially important roles in gene regulation and their potential for therapeutic applications.
This present study investigated the impact of the application of stem cells to liver regeneration following the first stage of associating liver partition and portal vein ligation for staged hepatectomy (ALPPS). The experiment was conducted on a pig model (n=6, 3 that did not receive application of stem cells, 3 that received application stem cells). Collected samples of liver (day 0 and 9 following surgery) were subjected to complete transcriptome sequencing. In total, 39 differentially expressed genes were found in the group without the application of the stem cells (genes of unwanted processes such as fibrosis and inflammation). In the group that did receive application of stem cells, no significantly differentially expressed genes were found, indicating a properly regenerated liver remnant. The present study therefore demonstrated, to the best of our knowledge for the first time, the positive effect of stem cells application in the liver regeneration process during ALPPS procedure in the pig model.
- Publikační typ
- časopisecké články MeSH
Use of fungal organisms in rotating biological contactors (RBC) for bioremediation of liquid industrial wastes has so far been limited in spite of their significant biodegradation potential. The purpose was to investigate the power of RBC using Irpex lacteus for decolorization and detoxification of industrial dyes and dyeing textile liquors. Recalcitrant dye Methylene Blue (150 mg L(-1)) was decolorized within 70 days, its mutagenicity removed, and the biological toxicity decreased more than 10-fold. I. lacteus biofilm in the RBC completely decolorized within 26 and 47 days dyeing liquors containing disperse or reactive dyes adjusted to pH4.5 and 5-fold diluted with the growth medium, respectively. Their respective biological toxicity values were reduced 10- to 10(4)-fold in dependence of the test used. A battery of toxicity tests comprising Vibrio fisheri, Lemna minor and Sinapis alba was efficient to monitor the toxicity of textile dyes and wastewaters. Strong decolorization and detoxification power of RBC using I. lacteus biofilms was demonstrated.
- MeSH
- Aliivibrio fischeri účinky léků MeSH
- Araceae účinky léků MeSH
- barva MeSH
- barvicí látky metabolismus MeSH
- biodegradace MeSH
- bioreaktory MeSH
- chemické látky znečišťující vodu metabolismus MeSH
- Magnoliopsida účinky léků MeSH
- methylenová modř metabolismus MeSH
- odpad tekutý - odstraňování přístrojové vybavení metody MeSH
- odpadní voda * toxicita MeSH
- Polyporales metabolismus MeSH
- testy toxicity MeSH
- textilní průmysl MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH