JavaScript NENÍ povolen !

Prosím povolte JavaScript.

29285887 OR Hit Dexter A Machine-Learning Model for the Prediction of Frequent Hitters Dotaz Zobrazit nápovědu

Přesná shoda Sémantické

Reset

2 záznamů v BMČ

Článek online

Hit Dexter: A Machine-Learning Model for the Prediction of Frequent Hitters

... False-positive assay readouts caused by badly behaving compounds-frequent hitters, pan-assay interference ...

Stork, Conrad
Autor Stork, Conrad Center for Bioinformatics, Universität Hamburg, Bundesstraße 43, 20146, Hamburg, Germany
Wagner, Johannes
Autor Wagner, Johannes Center for Bioinformatics, Universität Hamburg, Bundesstraße 43, 20146, Hamburg, Germany
Friedrich, Nils-Ole
Autor Friedrich, Nils-Ole Center for Bioinformatics, Universität Hamburg, Bundesstraße 43, 20146, Hamburg, Germany
de Bruyn Kops, Christina
Autor de Bruyn Kops, Christina Center for Bioinformatics, Universität Hamburg, Bundesstraße 43, 20146, Hamburg, Germany
Šícho, Martin
Autor Šícho, Martin Center for Bioinformatics, Universität Hamburg, Bundesstraße 43, 20146, Hamburg, Germany. National Infrastructure for Chemical Biology, Laboratory of Informatics and Chemistry, Faculty of Chemical Technology, University of Chemistry and Technology Prague, 166 28, Prague 6, Czech Republic
Kirchmair, Johannes
Autor Kirchmair, Johannes Center for Bioinformatics, Universität Hamburg, Bundesstraße 43, 20146, Hamburg, Germany

ChemMedChem. 2018 ; 13 (6) : 564-571. [pub] 20180201

ISSN 1860-7187
Medvik
Zdroj

False-positive assay readouts caused by badly behaving compounds-frequent hitters, pan-assay interference compounds (PAINS), aggregators, and others-continue to pose a major challenge to experimental screening. There are only a few in silico methods that allow the prediction of such problematic compounds. We report the development of Hit Dexter, two extremely randomized trees classifiers for the prediction of compounds likely to trigger positive assay readouts either by true promiscuity or by assay interference. The models were trained on a well-prepared dataset extracted from the PubChem Bioassay database, consisting of approximately 311 000 compounds tested for activity on at least 50 proteins. Hit Dexter reached MCC and AUC values of up to 0.67 and 0.96 on an independent test set, respectively. The models are expected to be of high value, in particular to medicinal chemists and biochemists who can use Hit Dexter to identify compounds for which extra caution should be exercised with positive assay readouts. Hit Dexter is available as a free web service at http://hitdexter.zbh. uni-hamburg.de.

MeSH
databáze faktografické MeSH
falešně pozitivní reakce MeSH
knihovny malých molekul chemie farmakologie MeSH
počítačová simulace MeSH
rychlé screeningové testy metody MeSH
strojové učení * MeSH
Publikační typ
časopisecké články MeSH
práce podpořená grantem MeSH

Článek

Hit Dexter 2.0: Machine-Learning Models for the Prediction of Frequent Hitters

... Hit Dexter is a recently introduced machine learning approach that predicts frequent hitters independent ...

Journal of chemical information and modeling. 2019 ; 59 (3) : 1030-1043. [pub] 20190125

J Chem Inf Model
ISSN 1549-960X
Medvik
Zdroj

Assay interference caused by small molecules continues to pose a significant challenge for early drug discovery. A number of rule-based and similarity-based approaches have been derived that allow the flagging of potentially "badly behaving compounds", "bad actors", or "nuisance compounds". These compounds are typically aggregators, reactive compounds, and/or pan-assay interference compounds (PAINS), and many of them are frequent hitters. Hit Dexter is a recently introduced machine learning approach that predicts frequent hitters independent of the underlying physicochemical mechanisms (including also the binding of compounds based on "privileged scaffolds" to multiple binding sites). Here we report on the development of a second generation of machine learning models which now covers both primary screening assays and confirmatory dose-response assays. Protein sequence clustering was newly introduced to minimize the overrepresentation of structurally and functionally related proteins. The models correctly classified compounds of large independent test sets as (highly) promiscuous or nonpromiscuous with Matthews correlation coefficient (MCC) values of up to 0.64 and area under the receiver operating characteristic curve (AUC) values of up to 0.96. The models were also utilized to characterize sets of compounds with specific biological and physicochemical properties, such as dark chemical matter, aggregators, compounds from a high-throughput screening library, drug-like compounds, approved drugs, potential PAINS, and natural products. Among the most interesting outcomes is that the new Hit Dexter models predict the presence of large fractions of (highly) promiscuous compounds among approved drugs. Importantly, predictions of the individual Hit Dexter models are generally in good agreement and consistent with those of Badapple, an established statistical model for the prediction of frequent hitters. The new Hit Dexter 2.0 web service, available at http://hitdexter2.zbh.uni-hamburg.de , not only provides user-friendly access to all machine learning models presented in this work but also to similarity-based methods for the prediction of aggregators and dark chemical matter as well as a comprehensive collection of available rule sets for flagging frequent hitters and compounds including undesired substructures.

Kolekce

Publikováno

Filtry

29285887 OR Hit Dexter A Machine-Learning Model for the Prediction of Frequent Hitters Dotaz Zobrazit nápovědu

Přesná shoda Sémantické

29285887 OR Hit Dexter A Machine-Learning Model for the Prediction of Frequent Hitters Dotaz Zobrazit nápovědu Přesná shoda Sémantické

Upřesnit dle MeSH

29285887 OR Hit Dexter A Machine-Learning Model for the Prediction of Frequent Hitters Dotaz Zobrazit nápovědu

Přesná shoda Sémantické