Specialized or secondary metabolites are small molecules of biological origin, often showing potent biological activities with applications in agriculture, engineering and medicine. Usually, the biosynthesis of these natural products is governed by sets of co-regulated and physically clustered genes known as biosynthetic gene clusters (BGCs). To share information about BGCs in a standardized and machine-readable way, the Minimum Information about a Biosynthetic Gene cluster (MIBiG) data standard and repository was initiated in 2015. Since its conception, MIBiG has been regularly updated to expand data coverage and remain up to date with innovations in natural product research. Here, we describe MIBiG version 4.0, an extensive update to the data repository and the underlying data standard. In a massive community annotation effort, 267 contributors performed 8304 edits, creating 557 new entries and modifying 590 existing entries, resulting in a new total of 3059 curated entries in MIBiG. Particular attention was paid to ensuring high data quality, with automated data validation using a newly developed custom submission portal prototype, paired with a novel peer-reviewing model. MIBiG 4.0 also takes steps towards a rolling release model and a broader involvement of the scientific community. MIBiG 4.0 is accessible online at https://mibig.secondarymetabolites.org/.
BACKGROUND: The mammalian Natural Killer Complex (NKC) harbors genes and gene families encoding a variety of C-type lectin-like proteins expressed on various immune cells. The NKC is a complex genomic region well-characterized in mice, humans and domestic animals. The major limitations of automatic annotation of the NKC in non-model animals include short-read based sequencing, methods of assembling highly homologous and repetitive sequences, orthologues missing from reference databases and weak expression. In this situation, manual annotations of complex genomic regions are necessary. METHODS: This study presents a manual annotation of the genomic structure of the NKC region in a high-quality reference genome of the domestic cat and compares it with other felid species and with representatives of other carnivore families. Reference genomes of Carnivora, irrespective of sequencing and assembly methods, were screened by BLAST to retrieve information on their killer cell lectin-like receptor (KLR) gene content. Phylogenetic analysis of in silico translated proteins of expanded subfamilies was carried out. RESULTS: The overall genomic structure of the NKC in Carnivora is rather conservative in terms of its C-type lectin receptor gene content. A novel KLRH-like gene subfamily (KLRL) was identified in all Carnivora and a novel KLRJ-like gene was annotated in the Mustelidae. In all six families studied, one subfamily (KLRC) expanded and experienced pseudogenization. The KLRH gene subfamily expanded in all carnivore families except the Canidae. The KLRL gene subfamily expanded in carnivore families except the Felidae and Canidae, and in the Canidae it eroded to fragments. CONCLUSIONS: Knowledge of the genomic structure and gene content of the NKC region is a prerequisite for accurate annotations of newly sequenced genomes, especially of endangered wildlife species. Identification of expressed genes, pseudogenes and gene fragments in the context of expanded gene families would allow the assessment of functionally important variability in particular species.
- MeSH
- anotace sekvence MeSH
- buňky NK * imunologie metabolismus MeSH
- Carnivora * genetika MeSH
- fylogeneze * MeSH
- genom MeSH
- genomika * metody MeSH
- kočky genetika MeSH
- lektiny typu C genetika MeSH
- zvířata MeSH
- Check Tag
- kočky genetika MeSH
- zvířata MeSH
- Publikační typ
- časopisecké články MeSH
- srovnávací studie MeSH
The mechanisms by which myelodysplastic syndrome (MDS) cells resist the effects of hypomethylating agents (HMA) are currently the subject of intensive research. A better understanding of mechanisms by which the MDS cell becomes to tolerate HMA and progresses to acute myeloid leukemia (AML) requires the development of new cellular models. From MDS/AML cell lines we developed a model of 5-azacytidine (AZA) resistance whose stability was validated by a transplantation approach into immunocompromised mice. When investigating mRNA expression and DNA variants of the AZA resistant phenotype we observed deregulation of several cancer-related pathways including the phosphatidylinosito-3 kinase signaling. We have further shown that these pathways can be modulated by specific inhibitors that, while blocking the proliferation of AZA resistant cells, are unable to increase their sensitivity to AZA. Our data reveal a set of molecular mechanisms that can be targeted to expand therapeutic options during progression on AZA therapy.
- MeSH
- anotace sekvence MeSH
- azacytidin farmakologie MeSH
- biologické modely * MeSH
- chemorezistence * účinky léků genetika MeSH
- DNA nádorová genetika MeSH
- fosfatidylinositol-3-kinasy metabolismus MeSH
- myši SCID MeSH
- myši MeSH
- protoonkogenní proteiny c-akt metabolismus MeSH
- reprodukovatelnost výsledků MeSH
- signální transdukce účinky léků MeSH
- transkriptom genetika MeSH
- zvířata MeSH
- Check Tag
- myši MeSH
- zvířata MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
N6-methyladenosine (m6A) and N6,2'-O-dimethyladenosine (m6Am) are two abundant modifications found in mRNAs and ncRNAs that can regulate multiple aspects of RNA biology. They function mainly by regulating interactions with specific RNA-binding proteins. Both modifications are linked to development, disease and stress response. To date, three methyltransferases and two demethylases have been identified that modify adenosines in mammalian mRNAs. Here, we present a comprehensive analysis of the interactomes of these enzymes. PCIF1 protein network comprises mostly factors involved in nascent RNA synthesis by RNA polymerase II, whereas ALKBH5 is closely linked with most aspects of pre-mRNA processing and mRNA export to the cytoplasm. METTL16 resides in subcellular compartments co-inhabited by several other RNA modifiers and processing factors. FTO interactome positions this demethylase at a crossroad between RNA transcription, RNA processing and DNA replication and repair. Altogether, these enzymes share limited spatial interactomes, pointing to specific molecular mechanisms of their regulation.
- MeSH
- adaptorové proteiny signální transdukční genetika metabolismus MeSH
- adenosin analogy a deriváty metabolismus MeSH
- alfa-ketoglutarát-dependentní dioxygenasa, AlkB homolog 5 genetika metabolismus MeSH
- anotace sekvence MeSH
- gen pro FTO genetika metabolismus MeSH
- genetická transkripce MeSH
- genová ontologie MeSH
- HEK293 buňky MeSH
- jaderné proteiny genetika metabolismus MeSH
- lidé MeSH
- mapování interakce mezi proteiny MeSH
- messenger RNA genetika metabolismus MeSH
- methyltransferasy genetika metabolismus MeSH
- N-demethylasy genetika metabolismus MeSH
- nekódující RNA genetika metabolismus MeSH
- oprava DNA MeSH
- protein - isoformy genetika metabolismus MeSH
- replikace DNA MeSH
- vazba proteinů MeSH
- Check Tag
- lidé MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
BACKGROUND: Prostate cancer is caused by genomic aberrations in normal epithelial cells, however clinical translation of findings from analyses of cancer cells alone has been very limited. A deeper understanding of the tumour microenvironment is needed to identify the key drivers of disease progression and reveal novel therapeutic opportunities. RESULTS: In this study, the experimental enrichment of selected cell-types, the development of a Bayesian inference model for continuous differential transcript abundance, and multiplex immunohistochemistry permitted us to define the transcriptional landscape of the prostate cancer microenvironment along the disease progression axis. An important role of monocytes and macrophages in prostate cancer progression and disease recurrence was uncovered, supported by both transcriptional landscape findings and by differential tissue composition analyses. These findings were corroborated and validated by spatial analyses at the single-cell level using multiplex immunohistochemistry. CONCLUSIONS: This study advances our knowledge concerning the role of monocyte-derived recruitment in primary prostate cancer, and supports their key role in disease progression, patient survival and prostate microenvironment immune modulation.
- MeSH
- anotace sekvence MeSH
- imunofenotypizace MeSH
- imunohistochemie MeSH
- Kaplanův-Meierův odhad MeSH
- lidé MeSH
- monocyty metabolismus patologie MeSH
- nádorové mikroprostředí genetika MeSH
- nádory prostaty diagnóza genetika metabolismus mortalita MeSH
- prognóza MeSH
- progrese nemoci MeSH
- stanovení celkové genové exprese * metody MeSH
- transkriptom * MeSH
- výpočetní biologie metody MeSH
- vysoce účinné nukleotidové sekvenování MeSH
- Check Tag
- lidé MeSH
- mužské pohlaví MeSH
- Publikační typ
- časopisecké články MeSH
BACKGROUND: The ATP-binding cassette (ABC) transporter superfamily is comprised predominantly of proteins which directly utilize energy from ATP to move molecules across the plasma membrane. Although they have been the subject of frequent investigation across many taxa, arthropod ABCs have been less well studied. While the manual annotation of ABC transporters has been performed in many arthropods, there has so far been no systematic comparison of the superfamily within this order using the increasing number of sequenced genomes. Furthermore, functional work on these genes is limited. RESULTS: Here, we developed a standardized pipeline to annotate ABCs from predicted proteomes and used it to perform comparative genomics on ABC families across arthropod lineages. Using Kruskal-Wallis tests and the Computational Analysis of gene Family Evolution (CAFE), we were able to observe significant expansions of the ABC-B full transporters (P-glycoproteins) in Lepidoptera and the ABC-H transporters in Hemiptera. RNA-sequencing of epithelia tissues in the Lepidoptera Helicoverpa armigera showed that the 7 P-glycoprotein paralogues differ substantially in their tissue distribution, suggesting a spatial division of labor. It also seems that functional redundancy is a feature of these transporters as RNAi knockdown showed that most transporters are dispensable with the exception of the highly conserved gene Snu, which is probably due to its role in cuticular formation. CONCLUSIONS: We have performed an annotation of the ABC superfamily across > 150 arthropod species for which good quality protein annotations exist. Our findings highlight specific expansions of ABC transporter families which suggest evolutionary adaptation. Future work will be able to use this analysis as a resource to provide a better understanding of the ABC superfamily in arthropods.
BACKGROUND: Ectoparasites from the family Diplozoidae (Platyhelminthes, Monogenea) belong to obligate haematophagous helminths of cyprinid fish. Current knowledge of these worms is for the most part limited to their morphological, phylogenetic, and population features. Information concerning the biochemical and molecular nature of physiological processes involved in host-parasite interaction, such as evasion of the immune system and its regulation, digestion of macromolecules, suppression of blood coagulation and inflammation, and effect on host tissue and physiology, is lacking. In this study, we report for the first time a comprehensive transcriptomic/secretome description of expressed genes and proteins secreted by the adult stage of Eudiplozoon nipponicum (Goto, 1891) Khotenovsky, 1985, an obligate sanguivorous monogenean which parasitises the gills of the common carp (Cyprinus carpio). RESULTS: RNA-seq raw reads (324,941 Roche 454 and 149,697,864 Illumina) were generated, de novo assembled, and filtered into 37,062 protein-coding transcripts. For 19,644 (53.0%) of them, we determined their sequential homologues. In silico functional analysis of E. nipponicum RNA-seq data revealed numerous transcripts, pathways, and GO terms responsible for immunomodulation (inhibitors of proteolytic enzymes, CD59-like proteins, fatty acid binding proteins), feeding (proteolytic enzymes cathepsins B, D, L1, and L3), and development (fructose 1,6-bisphosphatase, ferritin, and annexin). LC-MS/MS spectrometry analysis identified 721 proteins secreted by E. nipponicum with predominantly immunomodulatory and anti-inflammatory functions (peptidyl-prolyl cis-trans isomerase, homolog to SmKK7, tetraspanin) and ability to digest host macromolecules (cathepsins B, D, L1). CONCLUSIONS: In this study, we integrated two high-throughput sequencing techniques, mass spectrometry analysis, and comprehensive bioinformatics approach in order to arrive at the first comprehensive description of monogenean transcriptome and secretome. Exploration of E. nipponicum transcriptome-related nucleotide sequences and translated and secreted proteins offer a better understanding of molecular biology and biochemistry of these, often neglected, organisms. It enabled us to report the essential physiological pathways and protein molecules involved in their interactions with the fish hosts.
Bacterial pathogens sense specific cues associated with different host niches and integrate these signals to appropriately adjust the global gene expression. Bordetella pertussis is a Gram-negative, strictly human pathogen of the respiratory tract and the etiological agent of whooping cough (pertussis). Though B. pertussis does not cause invasive infections, previous results indicated that this reemerging pathogen responds to blood exposure. Here, omics RNA-seq and LC-MS/MS techniques were applied to determine the blood-responsive regulon of B. pertussis. These analyses revealed that direct contact with blood rewired global gene expression profiles in B. pertussis as the expression of almost 20% of all genes was significantly modulated. However, upon loss of contact with blood, the majority of blood-specific effects vanished, with the exception of several genes encoding the T3SS-secreted substrates. For the first time, the T3SS regulator BtrA was identified in culture supernatants of B. pertussis. Furthermore, proteomic analysis identified BP2259 protein as a novel secreted T3SS substrate, which is required for T3SS functionality. Collectively, presented data indicate that contact with blood represents an important cue for B. pertussis cells.
- MeSH
- anotace sekvence MeSH
- bakteriální proteiny metabolismus MeSH
- Bordetella pertussis fyziologie MeSH
- chromatografie kapalinová MeSH
- faktory virulence MeSH
- genomika * metody MeSH
- lidé MeSH
- proteomika * metody MeSH
- regulace genové exprese u bakterií MeSH
- sekreční systém typu III genetika metabolismus MeSH
- stanovení celkové genové exprese MeSH
- tandemová hmotnostní spektrometrie MeSH
- transkriptom MeSH
- virulence MeSH
- výpočetní biologie metody MeSH
- Check Tag
- lidé MeSH
- Publikační typ
- časopisecké články MeSH
Deregulation of microRNA (miRNA) expression plays a critical role in the transition from a physiological to a pathological state. The accurate miRNA promoter identification in multiple cell types is a fundamental endeavor towards understanding and characterizing the underlying mechanisms of both physiological as well as pathological conditions. DIANA-miRGen v4 (www.microrna.gr/mirgenv4) provides cell type specific miRNA transcription start sites (TSSs) for over 1500 miRNAs retrieved from the analysis of >1000 cap analysis of gene expression (CAGE) samples corresponding to 133 tissues, cell lines and primary cells available in FANTOM repository. MiRNA TSS locations were associated with transcription factor binding site (TFBSs) annotation, for >280 TFs, derived from analyzing the majority of ENCODE ChIP-Seq datasets. For the first time, clusters of cell types having common miRNA TSSs are characterized and provided through a user friendly interface with multiple layers of customization. DIANA-miRGen v4 significantly improves our understanding of miRNA biogenesis regulation at the transcriptional level by providing a unique integration of high-quality annotations for hundreds of cell specific miRNA promoters with experimentally derived TFBSs.
- MeSH
- anotace sekvence MeSH
- buněčné linie MeSH
- databáze nukleových kyselin * MeSH
- genetická transkripce MeSH
- genom * MeSH
- internet MeSH
- lidé MeSH
- mikro RNA genetika metabolismus MeSH
- počátek transkripce MeSH
- primární buněčná kultura MeSH
- promotorové oblasti (genetika) * MeSH
- sekvence nukleotidů MeSH
- software * MeSH
- transkripční faktory genetika metabolismus MeSH
- vazba proteinů MeSH
- Check Tag
- lidé MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
CATH (https://www.cathdb.info) identifies domains in protein structures from wwPDB and classifies these into evolutionary superfamilies, thereby providing structural and functional annotations. There are two levels: CATH-B, a daily snapshot of the latest domain structures and superfamily assignments, and CATH+, with additional derived data, such as predicted sequence domains, and functionally coherent sequence subsets (Functional Families or FunFams). The latest CATH+ release, version 4.3, significantly increases coverage of structural and sequence data, with an addition of 65,351 fully-classified domains structures (+15%), providing 500 238 structural domains, and 151 million predicted sequence domains (+59%) assigned to 5481 superfamilies. The FunFam generation pipeline has been re-engineered to cope with the increased influx of data. Three times more sequences are captured in FunFams, with a concomitant increase in functional purity, information content and structural coverage. FunFam expansion increases the structural annotations provided for experimental GO terms (+59%). We also present CATH-FunVar web-pages displaying variations in protein sequences and their proximity to known or predicted functional sites. We present two case studies (1) putative cancer drivers and (2) SARS-CoV-2 proteins. Finally, we have improved links to and from CATH including SCOP, InterPro, Aquaria and 2DProt.
- MeSH
- anotace sekvence MeSH
- COVID-19 epidemiologie prevence a kontrola virologie MeSH
- databáze proteinů statistika a číselné údaje MeSH
- epidemie MeSH
- internet MeSH
- lidé MeSH
- proteinové domény * MeSH
- proteiny chemie genetika metabolismus MeSH
- SARS-CoV-2 genetika metabolismus fyziologie MeSH
- sekvence aminokyselin MeSH
- sekvenční analýza proteinů metody MeSH
- sekvenční homologie aminokyselin MeSH
- virové proteiny chemie genetika metabolismus MeSH
- výpočetní biologie metody statistika a číselné údaje MeSH
- Check Tag
- lidé MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH