Simulation annotations
Dotaz
Zobrazit nápovědu
MOTIVATION: Objective assessment of bioimage analysis methods is an essential step towards understanding their robustness and parameter sensitivity, calling for the availability of heterogeneous bioimage datasets accompanied by their reference annotations. Because manual annotations are known to be arduous, highly subjective and barely reproducible, numerous simulators have emerged over past decades, generating synthetic bioimage datasets complemented with inherent reference annotations. However, the installation and configuration of these tools generally constitutes a barrier to their widespread use. RESULTS: We present a modern, modular web-interface, CytoPacq, to facilitate the generation of synthetic benchmark datasets relevant for multi-dimensional cell imaging. CytoPacq poses a user-friendly graphical interface with contextual tooltips and currently allows a comfortable access to various cell simulation systems of fluorescence microscopy, which have already been recognized and used by the scientific community, in a straightforward and self-contained form. AVAILABILITY AND IMPLEMENTATION: CytoPacq is a publicly available online service running at https://cbia.fi.muni.cz/simulator. More information about it as well as examples of generated bioimage datasets are available directly through the web-interface. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
- MeSH
- počítačová simulace MeSH
- software * MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
Background: Developmental coordination disorder (DCD) is described as a motor skill disorder characterized by a marked impairment in the development of motor coordination abilities that significantly interferes with performance of daily activities and/or academic achievement. Since some electrophysiological studies suggest differences between children with/without motor development problems, we prepared an experimental protocol and performed electrophysiological experiments with the aim of making a step toward a possible diagnosis of this disorder using the event-related potentials (ERP) technique. The second aim is to properly annotate the obtained raw data with relevant metadata and promote their long-term sustainability. Results: The data from 32 school children (16 with possible DCD and 16 in the control group) were collected. Each dataset contains raw electroencephalography (EEG) data in the BrainVision format and provides sufficient metadata (such as age, gender, results of the motor test, and hearing thresholds) to allow other researchers to perform analysis. For each experiment, the percentage of ERP trials damaged by blinking artifacts was estimated. Furthermore, ERP trials were averaged across different participants and conditions, and the resulting plots are included in the manuscript. This should help researchers to estimate the usability of individual datasets for analysis. Conclusions: The aim of the whole project is to find out if it is possible to make any conclusions about DCD from EEG data obtained. For the purpose of further analysis, the data were collected and annotated respecting the current outcomes of the International Neuroinformatics Coordinating Facility Program on Standards for Data Sharing, the Task Force on Electrophysiology, and the group developing the Ontology for Experimental Neurophysiology. The data with metadata are stored in the EEG/ERP Portal.
- MeSH
- akustická stimulace MeSH
- datové kurátorství MeSH
- dítě MeSH
- elektroencefalografie MeSH
- evokované potenciály MeSH
- komorbidita MeSH
- kvantitativní znak dědičný MeSH
- lidé MeSH
- počítačová simulace MeSH
- poruchy motorických dovedností diagnóza MeSH
- reakční čas MeSH
- reprodukovatelnost výsledků MeSH
- software MeSH
- světelná stimulace MeSH
- Check Tag
- dítě MeSH
- lidé MeSH
- mužské pohlaví MeSH
- ženské pohlaví MeSH
- Publikační typ
- časopisecké články MeSH
E-photosynthesis framework is a web-based platform for modeling and analysis of photosynthetic processes. Compared to its earlier version, the present platform employs advanced software methods and technologies to support an effective implementation of vastly diverse kinetic models of photosynthesis. We report on the first phase implementation of the tool new version and demonstrate the functionalities of model visualization, presentation of model components, rate constants, initial conditions and of model annotation. The demonstration also includes export of a model to the Systems Biology Markup Language format and remote numerical simulation of the model.
A computational model of gene expression was applied to a novel test set of microarray time series measurements to reveal regulatory interactions between transcriptional regulators represented by 45 sigma factors and the genes expressed during germination of a prokaryote Streptomyces coelicolor. Using microarrays, the first 5.5 h of the process was recorded in 13 time points, which provided a database of gene expression time series on genome-wide scale. The computational modeling of the kinetic relations between the sigma factors, individual genes and genes clustered according to the similarity of their expression kinetics identified kinetically plausible sigma factor-controlled networks. Using genome sequence annotations, functional groups of genes that were predominantly controlled by specific sigma factors were identified. Using external binding data complementing the modeling approach, specific genes involved in the control of the studied process were identified and their function suggested.
- MeSH
- genetická transkripce MeSH
- genové regulační sítě * MeSH
- kinetika MeSH
- modely genetické * MeSH
- počítačová simulace MeSH
- regulace genové exprese u bakterií * MeSH
- sekvenční analýza hybridizací s uspořádaným souborem oligonukleotidů MeSH
- sigma faktor metabolismus MeSH
- spory bakteriální genetika růst a vývoj metabolismus MeSH
- stanovení celkové genové exprese * MeSH
- Streptomyces coelicolor genetika metabolismus fyziologie MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
Protein structural families are groups of homologous proteins defined by the organization of secondary structure elements (SSEs). Nowadays, many families contain vast numbers of structures, and the SSEs can help to orient within them. Communities around specific protein families have even developed specialized SSE annotations, always assigning the same name to the equivalent SSEs in homologous proteins. A detailed analysis of the groups of equivalent SSEs provides an overview of the studied family and enriches the analysis of any particular protein at hand. We developed a workflow for the analysis of the secondary structure anatomy of a protein family. We applied this analysis to the model family of cytochromes P450 (CYPs)-a family of important biotransformation enzymes with a community-wide used SSE annotation. We report the occurrence, typical length and amino acid sequence for the equivalent SSE groups, the conservation/variability of these properties and relationship to the substrate recognition sites. We also suggest a generic residue numbering scheme for the CYP family. Comparing the bacterial and eukaryotic part of the family highlights the significant differences and reveals a well-known anomalous group of bacterial CYPs with some typically eukaryotic features. Our workflow for SSE annotation for CYP and other families can be freely used at address https://sestra.ncbr.muni.cz .
Computational systems biology provides multiple formalisms for modelling of biochemical processes among which the rule-based approach is one of the most suitable. Its main advantage is a compact and precise mechanistic description of complex processes. However, state-of-the-art rule-based languages still suffer several shortcomings that limit their use in practice. In particular, the elementary (low-level) syntax and semantics of rule-based languages complicate model construction and maintenance for users outside computer science. On the other hand, mathematical models based on differential equations (ODEs) still make the most typical used modelling framework. In consequence, robust re-interpretation and integration of models are difficult, thus making the systems biology paradigm technically challenging. Though several high-level languages have been developed at the top of rule-based principles, none of them provides a satisfactory and complete solution for semi-automated description and annotation of heterogeneous biophysical processes integrated at the cellular level. We present the second generation of a rule-based language called Biochemical Space Language (BCSL) that combines the advantages of different approaches and thus makes an effort to overcome several problems of existing solutions. BCSL relies on the formal basis of the rule-based methodology while preserving user-friendly syntax of plain chemical equations. BCSL combines the following aspects: the level of abstraction that hides structural and quantitative details but yet gives a precise mechanistic view of systems dynamics; executable semantics allowing formal analysis and consistency checking at the level of the language; universality allowing the integration of different biochemical mechanisms; scalability and compactness of the specification; hierarchical specification and composability of chemical entities; and support for genome-scale annotation.
- MeSH
- algoritmy MeSH
- biochemické jevy * MeSH
- biologické modely * MeSH
- jazyk (prostředek komunikace) MeSH
- lidé MeSH
- počítačová simulace * MeSH
- programovací jazyk MeSH
- proteiny chemie klasifikace metabolismus MeSH
- software * MeSH
- systémová biologie * MeSH
- teoretické modely MeSH
- Check Tag
- lidé MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
Single nucleotide variants represent a prevalent form of genetic variation. Mutations in the coding regions are frequently associated with the development of various genetic diseases. Computational tools for the prediction of the effects of mutations on protein function are very important for analysis of single nucleotide variants and their prioritization for experimental characterization. Many computational tools are already widely employed for this purpose. Unfortunately, their comparison and further improvement is hindered by large overlaps between the training datasets and benchmark datasets, which lead to biased and overly optimistic reported performances. In this study, we have constructed three independent datasets by removing all duplicities, inconsistencies and mutations previously used in the training of evaluated tools. The benchmark dataset containing over 43,000 mutations was employed for the unbiased evaluation of eight established prediction tools: MAPP, nsSNPAnalyzer, PANTHER, PhD-SNP, PolyPhen-1, PolyPhen-2, SIFT and SNAP. The six best performing tools were combined into a consensus classifier PredictSNP, resulting into significantly improved prediction performance, and at the same time returned results for all mutations, confirming that consensus prediction represents an accurate and robust alternative to the predictions delivered by individual tools. A user-friendly web interface enables easy access to all eight prediction tools, the consensus classifier PredictSNP and annotations from the Protein Mutant Database and the UniProt database. The web server and the datasets are freely available to the academic community at http://loschmidt.chemi.muni.cz/predictsnp.
- MeSH
- algoritmy MeSH
- databáze proteinů MeSH
- fylogeneze MeSH
- genetická variace MeSH
- genetické nemoci vrozené genetika MeSH
- genom lidský MeSH
- internet MeSH
- jednonukleotidový polymorfismus * MeSH
- lidé MeSH
- mutace * MeSH
- počítačová simulace MeSH
- software MeSH
- výpočetní biologie metody MeSH
- Check Tag
- lidé MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
Tunnels in enzymes with buried active sites are key structural features allowing the entry of substrates and the release of products, thus contributing to the catalytic efficiency. Targeting the bottlenecks of protein tunnels is also a powerful protein engineering strategy. However, the identification of functional tunnels in multiple protein structures is a non-trivial task that can only be addressed computationally. We present a pipeline integrating automated structural analysis with an in-house machine-learning predictor for the annotation of protein pockets, followed by the calculation of the energetics of ligand transport via biochemically relevant tunnels. A thorough validation using eight distinct molecular systems revealed that CaverDock analysis of ligand un/binding is on par with time-consuming molecular dynamics simulations, but much faster. The optimized and validated pipeline was applied to annotate more than 17,000 cognate enzyme-ligand complexes. Analysis of ligand un/binding energetics indicates that the top priority tunnel has the most favourable energies in 75% of cases. Moreover, energy profiles of cognate ligands revealed that a simple geometry analysis can correctly identify tunnel bottlenecks only in 50% of cases. Our study provides essential information for the interpretation of results from tunnel calculation and energy profiling in mechanistic enzymology and protein engineering. We formulated several simple rules allowing identification of biochemically relevant tunnels based on the binding pockets, tunnel geometry, and ligand transport energy profiles.Scientific contributionsThe pipeline introduced in this work allows for the detailed analysis of a large set of protein-ligand complexes, focusing on transport pathways. We are introducing a novel predictor for determining the relevance of binding pockets for tunnel calculation. For the first time in the field, we present a high-throughput energetic analysis of ligand binding and unbinding, showing that approximate methods for these simulations can identify additional mutagenesis hotspots in enzymes compared to purely geometrical methods. The predictor is included in the supplementary material and can also be accessed at https://github.com/Faranehhad/Large-Scale-Pocket-Tunnel-Annotation.git . The tunnel data calculated in this study has been made publicly available as part of the ChannelsDB 2.0 database, accessible at https://channelsdb2.biodata.ceitec.cz/ .
- Publikační typ
- časopisecké články MeSH
Anterior gradient 2 (AGR2) is an endoplasmic reticulum (ER)-resident protein disulfide isomerase (PDI) known to be overexpressed in many human epithelial cancers and is involved in cell migration, cellular transformation, angiogenesis, and metastasis. This protein inhibits the activity of the tumor suppressor p53, and its expression levels can be used to predict cancer patient outcome. However, the precise network of AGR2-interacting partners and clients remains to be fully characterized. Herein, we used label-free quantification and also stable isotope labeling with amino acids in cell culture-based LC-MS/MS analyses to identify proteins interacting with AGR2. Functional annotation confirmed that AGR2 and its interaction partners are associated with processes in the ER that maintain intracellular metabolic homeostasis and participate in the unfolded protein response, including those associated with changes in cellular metabolism, energy, and redox states in response to ER stress. As a proof of concept, the interaction between AGR2 and PDIA3, another ER-resident PDI, was studied in more detail. Pathway analysis revealed that AGR2 and PDIA3 play roles in protein folding in ER, including post-translational modification and in cellular response to stress. We confirmed the AGR2-PDIA3 complex formation in cancer cells, which was enhanced in response to ER stress. Accordingly, molecular docking characterized potential quaternary structure of this complex; however, it remains to be elucidated whether AGR2 rather contributes to PDIA3 maturation in ER, the complex directly acts in cellular signaling, or mediates AGR2 secretion. Our study provides a comprehensive insight into the protein-protein interaction network of AGR2 by identifying functionally relevant proteins and related cellular and biochemical pathways associated with the role of AGR2 in cancer cells.
- MeSH
- chromatografie kapalinová MeSH
- lidé MeSH
- mapy interakcí proteinů MeSH
- mukoproteiny * metabolismus MeSH
- nádory * MeSH
- onkogenní proteiny * metabolismus MeSH
- proteindisulfidisomerasy * MeSH
- simulace molekulového dockingu MeSH
- tandemová hmotnostní spektrometrie MeSH
- Check Tag
- lidé MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
There is a lack of experimental electron ionization high-resolution mass spectra available to assist compound identification. The in silico generation of mass spectra by quantum chemistry can aid annotation workflows, in particular to support the identification of compounds that lack experimental reference spectra, such as environmental chemicals. We present an open-source, semiautomated workflow for the in silico prediction of electron ionization high-resolution mass spectra at 70 eV based on the QCxMS software. The workflow was applied to predict the spectra of 367 environmental chemicals, and the accuracy was evaluated by comparison to experimental reference spectra acquired. The molecular flexibility, number of rotatable bonds, and number of electronegative atoms of a compound were negatively correlated with prediction accuracy. Few analytes are predicted to sufficient accuracy for the direct application of predicted spectra in spectral matching workflows (overall average score 428). The m/z values of the top 5 most abundant ions of predicted spectra rarely match ions in experimental spectra, evidencing the disconnect between simulated fragmentation pathways and empirical reaction mechanisms.
- Publikační typ
- časopisecké články MeSH