protein database Dotaz Zobrazit nápovědu
Proteins are the most abundant component of the cell nucleus, where they perform a plethora of functions, including the assembly of long DNA molecules into condensed chromatin, DNA replication and repair, regulation of gene expression, synthesis of RNA molecules and their modification. Proteins are important components of nuclear bodies and are involved in the maintenance of the nuclear architecture, transport across the nuclear envelope and cell division. Given their importance, the current poor knowledge of plant nuclear proteins and their dynamics during the cell's life and division is striking. Several factors hamper the analysis of the plant nuclear proteome, but the most critical seems to be the contamination of nuclei by cytosolic material during their isolation. With the availability of an efficient protocol for the purification of plant nuclei, based on flow cytometric sorting, contamination by cytoplasmic remnants can be minimized. Moreover, flow cytometry allows the separation of nuclei in different stages of the cell cycle (G1, S, and G2). This strategy has led to the identification of large number of nuclear proteins from barley (Hordeum vulgare), thus triggering the creation of a dedicated database called UNcleProt, http://barley.gambrinus.ueb.cas.cz/ .
- Klíčová slova
- barley, cell cycle, database, flow-cytometry, localization, mass spectrometry, nuclear proteome, nucleus,
- MeSH
- buněčný cyklus * MeSH
- data mining MeSH
- databáze proteinů * MeSH
- jaderné proteiny klasifikace metabolismus MeSH
- ječmen (rod) cytologie MeSH
- rostlinné proteiny klasifikace metabolismus MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Názvy látek
- jaderné proteiny MeSH
- rostlinné proteiny MeSH
Genetic variation occurring within conserved functional protein domains warrants special attention when examining DNA variation in the context of disease causation. Here we introduce a resource, freely available at www.prot2hg.com, that addresses the question of whether a particular variant falls onto an annotated protein domain and directly translates chromosomal coordinates onto protein residues. The tool can perform a multiple-site query in a simple way, and the whole dataset is available for download as well as incorporated into our own accessible pipeline. To create this resource, National Center for Biotechnology Information protein data were retrieved using the Entrez Programming Utilities. After processing all human protein domains, residue positions were reverse translated and mapped to the reference genome hg19 and stored in a MySQL database. In total, 760 487 protein domains from 42 371 protein models were mapped to hg19 coordinates and made publicly available for search or download (www.prot2hg.com). In addition, this annotation was implemented into the genomics research platform GENESIS in order to query nearly 8000 exomes and genomes of families with rare Mendelian disorders (tgp-foundation.org). When applied to patient genetic data, we found that rare (<1%) variants in the Genome Aggregation Database were significantly more annotated onto a protein domain in comparison to common (>1%) variants. Similarly, variants described as pathogenic or likely pathogenic in ClinVar were more likely to be annotated onto a domain. In addition, we tested a dataset consisting of 60 causal variants in a cohort of patients with epileptic encephalopathy and found that 71% of them (43 variants) were propagated onto protein domains. In summary, we developed a resource that annotates variants in the coding part of the genome onto conserved protein domains in order to increase variant prioritization efficiency. Database URL: www.prot2hg.com.
- MeSH
- anotace sekvence metody MeSH
- data mining metody MeSH
- databáze genetické * MeSH
- datové kurátorství metody MeSH
- genetická variace * MeSH
- genom lidský genetika MeSH
- genomika metody MeSH
- internet MeSH
- lidé MeSH
- proteinové domény genetika MeSH
- proteiny chemie genetika metabolismus MeSH
- výpočetní biologie metody MeSH
- Check Tag
- lidé MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Názvy látek
- proteiny MeSH
The majority of naturally occurring proteins have evolved to function under mild conditions inside the living organisms. One of the critical obstacles for the use of proteins in biotechnological applications is their insufficient stability at elevated temperatures or in the presence of salts. Since experimental screening for stabilizing mutations is typically laborious and expensive, in silico predictors are often used for narrowing down the mutational landscape. The recent advances in machine learning and artificial intelligence further facilitate the development of such computational tools. However, the accuracy of these predictors strongly depends on the quality and amount of data used for training and testing, which have often been reported as the current bottleneck of the approach. To address this problem, we present a novel database of experimental thermostability data for single-point mutants FireProtDB. The database combines the published datasets, data extracted manually from the recent literature, and the data collected in our laboratory. Its user interface is designed to facilitate both types of the expected use: (i) the interactive explorations of individual entries on the level of a protein or mutation and (ii) the construction of highly customized and machine learning-friendly datasets using advanced searching and filtering. The database is freely available at https://loschmidt.chemi.muni.cz/fireprotdb.
- MeSH
- anotace sekvence MeSH
- bodová mutace * MeSH
- databáze proteinů * MeSH
- datové soubory jako téma MeSH
- internet MeSH
- molekulární modely MeSH
- proteiny chemie genetika MeSH
- software MeSH
- stabilita proteinů MeSH
- strojové učení statistika a číselné údaje MeSH
- výpočetní biologie metody MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Názvy látek
- proteiny MeSH
ChannelsDB 2.0 is an updated database providing structural information about the position, geometry and physicochemical properties of protein channels-tunnels and pores-within deposited biomacromolecular structures from PDB and AlphaFoldDB databases. The newly deposited information originated from several sources. Firstly, we included data calculated using a popular CAVER tool to complement the data obtained using original MOLE tool for detection and analysis of protein tunnels and pores. Secondly, we added tunnels starting from cofactors within the AlphaFill database to enlarge the scope of the database to protein models based on Uniprot. This has enlarged available channel annotations ∼4.6 times as of 1 September 2023. The database stores information about geometrical features, e.g. length and radius, and physico-chemical properties based on channel-lining amino acids. The stored data are interlinked with the available UniProt mutation annotation data. ChannelsDB 2.0 provides an excellent resource for deep analysis of the role of biomacromolecular tunnels and pores. The database is available free of charge: https://channelsdb2.biodata.ceitec.cz.
- MeSH
- aminokyseliny MeSH
- databáze proteinů * MeSH
- konformace proteinů MeSH
- proteiny * chemie MeSH
- software * MeSH
- Publikační typ
- časopisecké články MeSH
- Názvy látek
- aminokyseliny MeSH
- proteiny * MeSH
Morphine is considered a gold standard in pain treatment. Nevertheless, its use could be associated with severe side effects, including drug addiction. Thus, it is very important to understand the molecular mechanism of morphine action in order to develop new methods of pain therapy, or at least to attenuate the side effects of opioids usage. Proteomics allows for the indication of proteins involved in certain biological processes, but the number of items identified in a single study is usually overwhelming. Thus, researchers face the difficult problem of choosing the proteins which are really important for the investigated processes and worth further studies. Therefore, based on the 29 published articles, we created a database of proteins regulated by morphine administration - The Morphinome Database (addiction-proteomics.org). This web tool allows for indicating proteins that were identified during different proteomics studies. Moreover, the collection and organization of such a vast amount of data allows us to find the same proteins that were identified in various studies and to create their ranking, based on the frequency of their identification. STRING and KEGG databases indicated metabolic pathways which those molecules are involved in. This means that those molecular pathways seem to be strongly affected by morphine administration and could be important targets for further investigations. SIGNIFICANCE: The data about proteins identified by different proteomics studies of molecular changes caused by morphine administration (29 published articles) were gathered in the Morphinome Database. Unification of those data allowed for the identification of proteins that were indicated several times by distinct proteomics studies, which means that they seem to be very well verified and important for the entire process. Those proteins might be now considered promising aims for more detailed studies of their role in the molecular mechanism of morphine action.
- Klíčová slova
- Bioinformatics, Database, Meta-analysis, Metabolic pathways, Morphine, Morphinome, Proteins, Proteome,
- MeSH
- databáze faktografické * MeSH
- databáze jako téma MeSH
- internet MeSH
- lidé MeSH
- metabolické sítě a dráhy MeSH
- morfin aplikace a dávkování MeSH
- proteiny účinky léků MeSH
- proteomika metody MeSH
- Check Tag
- lidé MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Názvy látek
- morfin MeSH
- proteiny MeSH
Protein solubility is an attractive engineering target primarily due to its relation to yields in protein production and manufacturing. Moreover, better knowledge of the mutational effects on protein solubility could connect several serious human diseases with protein aggregation. However, we have limited understanding of the protein structural determinants of solubility, and the available data have mostly been scattered in the literature. Here, we present SoluProtMutDB - the first database containing data on protein solubility changes upon mutations. Our database accommodates 33 000 measurements of 17 000 protein variants in 103 different proteins. The database can serve as an essential source of information for the researchers designing improved protein variants or those developing machine learning tools to predict the effects of mutations on solubility. The database comprises all the previously published solubility datasets and thousands of new data points from recent publications, including deep mutational scanning experiments. Moreover, it features many available experimental conditions known to affect protein solubility. The datasets have been manually curated with substantial corrections, improving suitability for machine learning applications. The database is available at loschmidt.chemi.muni.cz/soluprotmutdb.
- Klíčová slova
- Machine learning, Mutational database, Protein aggregation, Protein engineering, Protein yield, Soluble expression,
- Publikační typ
- časopisecké články MeSH
SUMMARY: Secondary structures provide a deep insight into the protein architecture. They can serve for comparison between individual protein family members. The most straightforward way how to deal with protein secondary structure is its visualization using 2D diagrams. Several software tools for the generation of 2D diagrams were developed. Unfortunately, they create 2D diagrams based on only a single protein. Therefore, 2D diagrams of two proteins from one family markedly differ. For this reason, we developed the 2DProts database, which contains secondary structure 2D diagrams for all domains from the CATH and all proteins from PDB databases. These 2D diagrams are generated based on a whole protein family, and they also consider information about the 3D arrangement of secondary structure elements. Moreover, 2DProts database contains multiple 2D diagrams, which provide an overview of a whole protein family's secondary structures. 2DProts is updated weekly and is integrated into CATH. AVAILABILITY AND IMPLEMENTATION: Freely accessible at https://2dprots.ncbr.muni.cz. The web interface was implemented in JavaScript. The database was implemented in Python. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
- MeSH
- databáze faktografické MeSH
- proteiny * chemie MeSH
- sekundární struktura proteinů MeSH
- software * MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Názvy látek
- proteiny * MeSH
We have started the construction of a two-dimensional database of the proteome of Francisella tularensis, a bacterium that is responsible for the highly pathogenic disease tularemia. The genome of this intracellular pathogen is not completely sequenced yet and, currently, information about only 66 proteins is available from NCBI database. We have analyzed the F. tularensis live vaccine strain by two-dimensional gel electrophoresis with immobilized pH 3-10 gradient in the first dimension and 9-16% gradient or tricine SDS-PAGE in the second dimension. In both cases about 2000 spots were detected. Furthermore, we compared the protein pattern of the nonvirulent F. tularensis live vaccine strain with protein profiles of two wild type clinical isolates and more than 50 differentially expressed proteins were counted. The separated proteins are going to be identified by peptide mass fingerprinting. However, due to the lack of complete genome sequence data only eight proteins were unambiguously identified. Among them, acid phosphatase and the most basic isoform of a hypothetical 23 kDa protein are characteristic only for virulent strains.
- MeSH
- 2D gelová elektroforéza MeSH
- bakteriální proteiny chemie genetika izolace a purifikace MeSH
- databáze proteinů * MeSH
- Francisella tularensis chemie genetika patogenita MeSH
- peptidové mapování MeSH
- proteom chemie genetika izolace a purifikace MeSH
- spektrometrie hmotnostní - ionizace laserem za účasti matrice MeSH
- techniky in vitro MeSH
- virulence MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Názvy látek
- bakteriální proteiny MeSH
- proteom MeSH
Following the discovery of serious errors in the structure of biomacromolecules, structure validation has become a key topic of research, especially for ligands and non-standard residues. ValidatorDB (freely available at http://ncbr.muni.cz/ValidatorDB) offers a new step in this direction, in the form of a database of validation results for all ligands and non-standard residues from the Protein Data Bank (all molecules with seven or more heavy atoms). Model molecules from the wwPDB Chemical Component Dictionary are used as reference during validation. ValidatorDB covers the main aspects of validation of annotation, and additionally introduces several useful validation analyses. The most significant is the classification of chirality errors, allowing the user to distinguish between serious issues and minor inconsistencies. Other such analyses are able to report, for example, completely erroneous ligands, alternate conformations or complete identity with the model molecules. All results are systematically classified into categories, and statistical evaluations are performed. In addition to detailed validation reports for each molecule, ValidatorDB provides summaries of the validation results for the entire PDB, for sets of molecules sharing the same annotation (three-letter code) or the same PDB entry, and for user-defined selections of annotations or PDB entries.
- MeSH
- aminokyseliny chemie MeSH
- anotace sekvence MeSH
- databáze proteinů * MeSH
- internet MeSH
- konformace proteinů MeSH
- ligandy MeSH
- molekulární modely MeSH
- proteiny chemie MeSH
- reprodukovatelnost výsledků MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Názvy látek
- aminokyseliny MeSH
- ligandy MeSH
- proteiny MeSH
The accessibility of the partial genome sequence of Francisella tularensis strain Schu 4 was the starting point for a comprehensive proteome analysis of the intracellular pathogen F. tularensis. The main goal of this study is identification of protein candidates of value for the development of diagnostics, therapeutics and vaccines. In this review, the current status of 2-DE F. tularensis database building, approaches used for identification of biologically important subsets of F. tularensis proteins, and functional and topological assignments of identified proteins using various prediction programs and database homology searches are presented.
- MeSH
- bakteriální proteiny chemie genetika MeSH
- databáze proteinů * MeSH
- Francisella tularensis chemie MeSH
- proteom * MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- přehledy MeSH
- Názvy látek
- bakteriální proteiny MeSH
- proteom * MeSH