JavaScript NENÍ povolen !

Prosím povolte JavaScript.

* Zobrazit nápovědu

Reset

Nejvíce citované: 30398643

13 citací v PubMed Filtry

Nejvíce citovaný článek - PubMed ID 30398643

Záznam nenalezen v Medvik. Navštivte PubMed 30398643

Článek

Generate what you can make: achieving in-house synthesizability with readily available resources in de novo drug design

Hassen, Alan Kai
Autor Hassen, Alan Kai ORCID Leiden Institute of Advanced Computer Science, Leiden University, Leiden, The Netherlands. a.k.hassen@liacs.leidenuniv.nl Machine Learning Research, Pfizer Research and Development, Berlin, Germany. a.k.hassen@liacs.leidenuniv.nl
Šícho, Martin
Autor Autorita ORCID Leiden Academic Centre of Drug Research, Leiden University, Leiden, The Netherlands CZ-OPENSCREEN: National Infrastructure for Chemical Biology, Department of Informatics and Chemistry, Faculty of Chemical Technolog, University of Chemistry and Technology Prague, Prague, Czech Republic
van Aalst, Yorick J
Autor van Aalst, Yorick J ORCID Leiden Academic Centre of Drug Research, Leiden University, Leiden, The Netherlands
Huizenga, Mirjam C W
Autor Huizenga, Mirjam C W ORCID Leiden Institute of Chemistry, Leiden University, Leiden, The Netherlands
Reynolds, Darcy N R
Autor Reynolds, Darcy N R Leiden Institute of Chemistry, Leiden University, Leiden, The Netherlands
Luukkonen, Sohvi
Autor Luukkonen, Sohvi ORCID Leiden Academic Centre of Drug Research, Leiden University, Leiden, The Netherlands
Bernatavicius, Andrius
Autor Bernatavicius, Andrius Leiden Institute of Advanced Computer Science, Leiden University, Leiden, The Netherlands Leiden Academic Centre of Drug Research, Leiden University, Leiden, The Netherlands
Clevert, Djork-Arné
Autor Clevert, Djork-Arné ORCID Machine Learning Research, Pfizer Research and Development, Berlin, Germany
Janssen, Antonius P A
Autor Janssen, Antonius P A ORCID Leiden Institute of Chemistry, Leiden University, Leiden, The Netherlands. a.p.a.janssen@lic.leidenuniv.nl
van Westen, Gerard J P
Autor van Westen, Gerard J P ORCID Leiden Academic Centre of Drug Research, Leiden University, Leiden, The Netherlands. gerard@lacdr.leidenuniv.nl

Journal of cheminformatics. 2025 Mar 28 ; 17 (1) : 41. [epub] 20250328

J Cheminform
ISSN 1758-2946
Zdroj

Computer-Aided Synthesis Planning (CASP) and CASP-based approximated synthesizability scores have rarely been used as generation objectives in Computer-Aided Drug Design despite facilitating the in-silico generation of synthesizable molecules. However, these synthesizability approaches are disconnected from the reality of small laboratory drug design, where building block resources are limited, thus making the notion of in-house synthesizability with already available resources highly desirable. In this work, we show a successful in-house de novo drug design workflow generating active and in-house synthesizable ligands of monoglyceride lipase (MGLL). First, we demonstrate the successful transfer of CASP from 17.4 million commercial building blocks to a small laboratory setting of roughly 6000 building blocks with only a decrease of -12% in CASP success when accepting two reaction-steps longer synthesis routes on average. Next, we present a rapidly retrainable in-house synthesizability score, successfully capturing our in-house synthesizability without relying on external building block resources. We show that including our in-house synthesizability score in a multi-objective de novo drug design workflow, alongside a simple QSAR model, provides thousands of potentially active and easily in-house synthesizable molecules. Finally, we experimentally evaluate the synthesis and biochemical activity of three de novo candidates using their CASP-suggested synthesis routes employing only in-house building blocks. We find one candidate with evident activity, suggesting potential new ligand ideas for MGLL inhibitors while showcasing the usefulness of our in-house synthesizability score for de novo drug design.Scientific contribution Our core scientific contribution is the introduction of in-house de novo drug design, which enables the practical application of generative methods in small laboratories by utilizing a limited stock of available building blocks. Our fast-to-adapt workflow for in-house synthesizability scoring requires minimal computational retraining costs while supporting a high diversity of generated structures. We highlight the practicality of our approach through a comprehensive in-vitro case study that relies entirely on in-house resources, including in-silico generation, synthesis planning, and activity evaluation.

Klíčová slova
Casp, Computer-aided synthesis planning, De novo drug design, In vitro, Medicinal chemistry, Retrosynthesis, Synthesizability, Synthesizability score, Virtual screening,
Publikační typ
časopisecké články MeSH

Článek

Genomics yields biological and phenotypic insights into bipolar disorder

Nature. 2025 Mar ; 639 (8056) : 968-975. [epub] 20250122

ISSN 1476-4687 | 0028-0836
Zdroj

Bipolar disorder is a leading contributor to the global burden of disease1. Despite high heritability (60-80%), the majority of the underlying genetic determinants remain unknown2. We analysed data from participants of European, East Asian, African American and Latino ancestries (n = 158,036 cases with bipolar disorder, 2.8 million controls), combining clinical, community and self-reported samples. We identified 298 genome-wide significant loci in the multi-ancestry meta-analysis, a fourfold increase over previous findings3, and identified an ancestry-specific association in the East Asian cohort. Integrating results from fine-mapping and other variant-to-gene mapping approaches identified 36 credible genes in the aetiology of bipolar disorder. Genes prioritized through fine-mapping were enriched for ultra-rare damaging missense and protein-truncating variations in cases with bipolar disorder4, highlighting convergence of common and rare variant signals. We report differences in the genetic architecture of bipolar disorder depending on the source of patient ascertainment and on bipolar disorder subtype (type I or type II). Several analyses implicate specific cell types in the pathophysiology of bipolar disorder, including GABAergic interneurons and medium spiny neurons. Together, these analyses provide additional insights into the genetic architecture and biological underpinnings of bipolar disorder.

MeSH
Asijci genetika MeSH
běloch MeSH
běloši genetika MeSH
bipolární porucha * genetika MeSH
celogenomová asociační studie * MeSH
černoši nebo Afroameričané genetika MeSH
fenotyp * MeSH
GABAergní neurony metabolismus MeSH
genetická predispozice k nemoci MeSH
genomika * MeSH
Hispánci a Latinoameričané genetika MeSH
lidé MeSH
Check Tag
lidé MeSH
mužské pohlaví MeSH
ženské pohlaví MeSH
Publikační typ
časopisecké články MeSH

Článek

QSPRpred: a Flexible Open-Source Quantitative Structure-Property Relationship Modelling Tool

Journal of cheminformatics. 2024 Nov 14 ; 16 (1) : 128. [epub] 20241114

J Cheminform
ISSN 1758-2946
Zdroj

Building reliable and robust quantitative structure-property relationship (QSPR) models is a challenging task. First, the experimental data needs to be obtained, analyzed and curated. Second, the number of available methods is continuously growing and evaluating different algorithms and methodologies can be arduous. Finally, the last hurdle that researchers face is to ensure the reproducibility of their models and facilitate their transferability into practice. In this work, we introduce QSPRpred, a toolkit for analysis of bioactivity data sets and QSPR modelling, which attempts to address the aforementioned challenges. QSPRpred's modular Python API enables users to intuitively describe different parts of a modelling workflow using a plethora of pre-implemented components, but also integrates customized implementations in a "plug-and-play" manner. QSPRpred data sets and models are directly serializable, which means they can be readily reproduced and put into operation after training as the models are saved with all required data pre-processing steps to make predictions on new compounds directly from SMILES strings. The general-purpose character of QSPRpred is also demonstrated by inclusion of support for multi-task and proteochemometric modelling. The package is extensively documented and comes with a large collection of tutorials to help new users. In this paper, we describe all of QSPRpred's functionalities and also conduct a small benchmarking case study to illustrate how different components can be leveraged to compare a diverse set of models. QSPRpred is fully open-source and available at https://github.com/CDDLeiden/QSPRpred .Scientific ContributionQSPRpred aims to provide a complex, but comprehensive Python API to conduct all tasks encountered in QSPR modelling from data preparation and analysis to model creation and model deployment. In contrast to similar packages, QSPRpred offers a wider and more exhaustive range of capabilities and integrations with many popular packages that also go beyond QSPR modelling. A significant contribution of QSPRpred is also in its automated and highly standardized serialization scheme, which significantly improves reproducibility and transferability of models.

Klíčová slova
Cheminformatics, Machine learning, Proteochemometrics, QSAR modelling, QSPR modelling, Software,
Publikační typ
časopisecké články MeSH

Článek

Crystal Clear: Decoding Isocyanide Intermolecular Interactions through Crystallography

The Journal of organic chemistry. 2024 Jan 19 ; 89 (2) : 957-974. [epub] 20240104

J Org Chem
ISSN 1520-6904 | 0022-3263
Zdroj

The isocyanide group is the chameleon among the functional groups in organic chemistry. Unlike other multiatom functional groups, where the electrophilic and nucleophilic moieties are typically separated, isocyanides combine both functionalities in the terminal carbon. This unique feature can be rationalized using the frontier orbital concept and has significant implications for its intermolecular interactions and the reactivity of the functional group. In this study, we perform a Cambridge Crystallographic Database-supported analysis of isocyanide intramolecular interactions to investigate the intramolecular interactions of isocyanides in the solid state, excluding isocyanide-metal complexes. We discuss examples of different interaction classes, including the isocyanide as a hydrogen bond acceptor (RNC···HX), halogen bonding (RNC···X), and interactions involving the isocyanide and carbon atoms (RNC···C). The latter interaction serves as an intriguing illustration of a Bürgi-Dunitz trajectory and represents a crucial experimental detail in the well-known multicomponent reactions such as the Ugi- and Passerini-type mechanisms. Understanding the spectrum of intramolecular interactions that isocyanides can undergo holds significant implications in fields such as medicinal chemistry, materials science, and asymmetric catalysis.

Publikační typ
časopisecké články MeSH

Článek

Machine Learning-Assisted Recurrence Prediction for Patients With Early-Stage Non-Small-Cell Lung Cancer

JCO clinical cancer informatics. 2023 Jul ; 7 () : e2200062.

JCO Clin Cancer Inform
ISSN 2473-4276
Zdroj

PURPOSE: Stratifying patients with cancer according to risk of relapse can personalize their care. In this work, we provide an answer to the following research question: How to use machine learning to estimate probability of relapse in patients with early-stage non-small-cell lung cancer (NSCLC)? MATERIALS AND METHODS: For predicting relapse in 1,387 patients with early-stage (I-II) NSCLC from the Spanish Lung Cancer Group data (average age 65.7 years, female 24.8%, male 75.2%), we train tabular and graph machine learning models. We generate automatic explanations for the predictions of such models. For models trained on tabular data, we adopt SHapley Additive exPlanations local explanations to gauge how each patient feature contributes to the predicted outcome. We explain graph machine learning predictions with an example-based method that highlights influential past patients. RESULTS: Machine learning models trained on tabular data exhibit a 76% accuracy for the random forest model at predicting relapse evaluated with a 10-fold cross-validation (the model was trained 10 times with different independent sets of patients in test, train, and validation sets, and the reported metrics are averaged over these 10 test sets). Graph machine learning reaches 68% accuracy over a held-out test set of 200 patients, calibrated on a held-out set of 100 patients. CONCLUSION: Our results show that machine learning models trained on tabular and graph data can enable objective, personalized, and reproducible prediction of relapse and, therefore, disease outcome in patients with early-stage NSCLC. With further prospective and multisite validation, and additional radiological and molecular data, this prognostic model could potentially serve as a predictive decision support tool for deciding the use of adjuvant treatments in early-stage lung cancer.

MeSH
lidé MeSH
lokální recidiva nádoru diagnóza MeSH
nádory plic * diagnóza terapie MeSH
nemalobuněčný karcinom plic * diagnóza terapie MeSH
prognóza MeSH
senioři MeSH
strojové učení MeSH
Check Tag
lidé MeSH
mužské pohlaví MeSH
senioři MeSH
ženské pohlaví MeSH
Publikační typ
časopisecké články MeSH
práce podpořená grantem MeSH

Článek

Tetracycline-modifying enzyme SmTetX from Stenotrophomonas maltophilia

Acta crystallographica. Section F, Structural biology communications. 2023 Jul 01 ; 79 (Pt 7) : 180-192. [epub] 20230705

Acta Crystallogr F Struct Biol Commun
ISSN 2053-230X
Zdroj

The resistance of the emerging human pathogen Stenotrophomonas maltophilia to tetracycline antibiotics mainly depends on multidrug efflux pumps and ribosomal protection enzymes. However, the genomes of several strains of this Gram-negative bacterium code for a FAD-dependent monooxygenase (SmTetX) homologous to tetracycline destructases. This protein was recombinantly produced and its structure and function were investigated. Activity assays using SmTetX showed its ability to modify oxytetracycline with a catalytic rate comparable to those of other destructases. SmTetX shares its fold with the tetracycline destructase TetX from Bacteroides thetaiotaomicron; however, its active site possesses an aromatic region that is unique in this enzyme family. A docking study confirmed tetracycline and its analogues to be the preferred binders amongst various classes of antibiotics.

Klíčová slova
FAD-dependent monooxygenases, antibiotic resistance, tetracycline,
MeSH
antibakteriální látky farmakologie chemie MeSH
krystalografie rentgenová MeSH
lidé MeSH
mikrobiální testy citlivosti MeSH
oxytetracyklin * metabolismus MeSH
Stenotrophomonas maltophilia * genetika metabolismus MeSH
tetracyklin farmakologie metabolismus MeSH
Check Tag
lidé MeSH
Publikační typ
časopisecké články MeSH
Názvy látek
antibakteriální látky MeSH
oxytetracyklin * MeSH
tetracyklin MeSH

Článek

A comparison of approaches to accessing existing biological and chemical relational databases via SPARQL

Journal of cheminformatics. 2023 Jun 20 ; 15 (1) : 61. [epub] 20230620

J Cheminform
ISSN 1758-2946
Zdroj

Current biological and chemical research is increasingly dependent on the reusability of previously acquired data, which typically come from various sources. Consequently, there is a growing need for database systems and databases stored in them to be interoperable with each other. One of the possible solutions to address this issue is to use systems based on Semantic Web technologies, namely on the Resource Description Framework (RDF) to express data and on the SPARQL query language to retrieve the data. Many existing biological and chemical databases are stored in the form of a relational database (RDB). Converting a relational database into the RDF form and storing it in a native RDF database system may not be desirable in many cases. It may be necessary to preserve the original database form, and having two versions of the same data may not be convenient. A solution may be to use a system mapping the relational database to the RDF form. Such a system keeps data in their original relational form and translates incoming SPARQL queries to equivalent SQL queries, which are evaluated by a relational-database system. This review compares different RDB-to-RDF mapping systems with a primary focus on those that can be used free of charge. In addition, it compares different approaches to expressing RDB-to-RDF mappings. The review shows that these systems represent a viable method providing sufficient performance. Their real-life performance is demonstrated on data and queries coming from the neXtProt project.

Klíčová slova
RDB-to-RDF mapping, Relational database, Resource Description Framework, SPARQL,
Publikační typ
časopisecké články MeSH
přehledy MeSH

Článek

Minimal information for chemosensitivity assays (MICHA): a next-generation pipeline to enable the FAIRification of drug screening experiments

Briefings in bioinformatics. 2022 Jan 17 ; 23 (1) : .

Brief Bioinform
ISSN 1477-4054 | 1467-5463
Zdroj

Chemosensitivity assays are commonly used for preclinical drug discovery and clinical trial optimization. However, data from independent assays are often discordant, largely attributed to uncharacterized variation in the experimental materials and protocols. We report here the launching of Minimal Information for Chemosensitivity Assays (MICHA), accessed via https://micha-protocol.org. Distinguished from existing efforts that are often lacking support from data integration tools, MICHA can automatically extract publicly available information to facilitate the assay annotation including: 1) compounds, 2) samples, 3) reagents and 4) data processing methods. For example, MICHA provides an integrative web server and database to obtain compound annotation including chemical structures, targets and disease indications. In addition, the annotation of cell line samples, assay protocols and literature references can be greatly eased by retrieving manually curated catalogues. Once the annotation is complete, MICHA can export a report that conforms to the FAIR principle (Findable, Accessible, Interoperable and Reusable) of drug screening studies. To consolidate the utility of MICHA, we provide FAIRified protocols from five major cancer drug screening studies as well as six recently conducted COVID-19 studies. With the MICHA web server and database, we envisage a wider adoption of a community-driven effort to improve the open access of drug sensitivity assays.

Klíčová slova
FAIR research data, data integration tools, drug discovery, drug sensitivity assays,
Publikační typ
časopisecké články MeSH

Článek

GenUI: interactive and extensible open source software platform for de novo molecular generation and cheminformatics

Journal of cheminformatics. 2021 Sep 25 ; 13 (1) : 73. [epub] 20210925

J Cheminform
ISSN 1758-2946
Zdroj

Many contemporary cheminformatics methods, including computer-aided de novo drug design, hold promise to significantly accelerate and reduce the cost of drug discovery. Thanks to this attractive outlook, the field has thrived and in the past few years has seen an especially significant growth, mainly due to the emergence of novel methods based on deep neural networks. This growth is also apparent in the development of novel de novo drug design methods with many new generative algorithms now available. However, widespread adoption of new generative techniques in the fields like medicinal chemistry or chemical biology is still lagging behind the most recent developments. Upon taking a closer look, this fact is not surprising since in order to successfully integrate the most recent de novo drug design methods in existing processes and pipelines, a close collaboration between diverse groups of experimental and theoretical scientists needs to be established. Therefore, to accelerate the adoption of both modern and traditional de novo molecular generators, we developed Generator User Interface (GenUI), a software platform that makes it possible to integrate molecular generators within a feature-rich graphical user interface that is easy to use by experts of diverse backgrounds. GenUI is implemented as a web service and its interfaces offer access to cheminformatics tools for data preprocessing, model building, molecule generation, and interactive chemical space visualization. Moreover, the platform is easy to extend with customizable frontend React.js components and backend Python extensions. GenUI is open source and a recently developed de novo molecular generator, DrugEx, was integrated as a proof of principle. In this work, we present the architecture and implementation details of GenUI and discuss how it can facilitate collaboration in the disparate communities interested in de novo molecular generation and computer-aided drug discovery.

Klíčová slova
De novo drug design, Deep learning, Graphical user interface, Molecule generation, Web application,
Publikační typ
časopisecké články MeSH

Článek

Will the chemical probes please stand up?

RSC medicinal chemistry. 2021 Aug 18 ; 12 (8) : 1428-1441. [epub] 20210716

RSC Med Chem
ISSN 2632-8682
Zdroj

In 2005, the NIH Molecular Libraries Program (MLP) undertook the identification of tool compounds to expand biological insights, now termed small-molecule chemical probes. This inspired other organisations to initiate similar efforts from 2010 onwards. As a central focus of the Probes & Drugs portal (P&D), we have standardised, integrated and compared sets of declared probe compounds harvested from 12 different sources. This turned out to be challenging and revealed unexpected anomalies. Results in this work address key questions including; a) individual and total structure counts, b) overlaps between sources, c) comparisons with selected PubChem sources and d) investigating the probe coverage of druggable targets. In addition, we developed new high-level scoring schemes to filter collections down to probes of higher quality. This generated 548 high-quality chemical probes (HQCP) covering 447 distinct protein targets. This HQCP collection has been added to the P&D portal and will be regularly updated as established sources expand and new ones release data.

Publikační typ
časopisecké články MeSH

* Zobrazit nápovědu

Upřesnit dle MeSH