-
Something wrong with this record ?
FAME 2: Simple and Effective Machine Learning Model of Cytochrome P450 Regioselectivity
M. Šícho, C. de Bruyn Kops, C. Stork, D. Svozil, J. Kirchmair,
Language English Country United States
Document type Journal Article, Research Support, Non-U.S. Gov't
- MeSH
- Software MeSH
- Stereoisomerism MeSH
- Machine Learning * MeSH
- Substrate Specificity MeSH
- Cytochrome P-450 Enzyme System metabolism MeSH
- Computational Biology methods MeSH
- Xenobiotics chemistry metabolism MeSH
- Publication type
- Journal Article MeSH
- Research Support, Non-U.S. Gov't MeSH
We report on the further development of FAst MEtabolizer (FAME; J. Chem. Inf. MODEL: 2013, 53, 2896-2907), a collection of random forest models for the prediction of sites of metabolism (SoMs) of xenobiotics. A broad set of descriptors was explored, from simple 2D descriptors such as those used in FAME, to quantum chemical descriptors employed in some of the most accurate models for SoM prediction currently available. In line with the original FAME approach, our objective was to keep things simple and to come up with accurate and robust models that are based on a small number of 2D descriptors. We found that circular descriptions of atoms and their environments with such descriptors in combination with an extremely randomized trees algorithm can yield models that perform equally well compared to more complex approaches. Thorough evaluation experiments on an independent test set showed that the best of these models obtained a Matthews correlation coefficient, area under the receiver operating characteristic curve, and Top-2 accuracy of 0.57, 0.91 and 94.1%, respectively. Models for the prediction of isoform-specific regioselectivity of CYP 3A4, 2D6, and 2C9 were also developed and showed competitive performance. The best models have been integrated into a newly developed software package (FAME 2), which is available free of charge from the authors.
References provided by Crossref.org
- 000
- 00000naa a2200000 a 4500
- 001
- bmc18016440
- 003
- CZ-PrNML
- 005
- 20180518110500.0
- 007
- ta
- 008
- 180515s2017 xxu f 000 0|eng||
- 009
- AR
- 024 7_
- $a 10.1021/acs.jcim.7b00250 $2 doi
- 035 __
- $a (PubMed)28782945
- 040 __
- $a ABA008 $b cze $d ABA008 $e AACR2
- 041 0_
- $a eng
- 044 __
- $a xxu
- 100 1_
- $a Šícho, Martin $u Faculty of Mathematics, Informatics and Natural Sciences, Department of Computer Science, Center for Bioinformatics, Universität Hamburg , Hamburg, 20146, Germany. CZ-OPENSCREEN: National Infrastructure for Chemical Biology, Laboratory of Informatics and Chemistry, Faculty of Chemical Technology, University of Chemistry and Technology Prague , 166 28 Prague 6, Czech Republic.
- 245 10
- $a FAME 2: Simple and Effective Machine Learning Model of Cytochrome P450 Regioselectivity / $c M. Šícho, C. de Bruyn Kops, C. Stork, D. Svozil, J. Kirchmair,
- 520 9_
- $a We report on the further development of FAst MEtabolizer (FAME; J. Chem. Inf. MODEL: 2013, 53, 2896-2907), a collection of random forest models for the prediction of sites of metabolism (SoMs) of xenobiotics. A broad set of descriptors was explored, from simple 2D descriptors such as those used in FAME, to quantum chemical descriptors employed in some of the most accurate models for SoM prediction currently available. In line with the original FAME approach, our objective was to keep things simple and to come up with accurate and robust models that are based on a small number of 2D descriptors. We found that circular descriptions of atoms and their environments with such descriptors in combination with an extremely randomized trees algorithm can yield models that perform equally well compared to more complex approaches. Thorough evaluation experiments on an independent test set showed that the best of these models obtained a Matthews correlation coefficient, area under the receiver operating characteristic curve, and Top-2 accuracy of 0.57, 0.91 and 94.1%, respectively. Models for the prediction of isoform-specific regioselectivity of CYP 3A4, 2D6, and 2C9 were also developed and showed competitive performance. The best models have been integrated into a newly developed software package (FAME 2), which is available free of charge from the authors.
- 650 _2
- $a výpočetní biologie $x metody $7 D019295
- 650 _2
- $a systém (enzymů) cytochromů P-450 $x metabolismus $7 D003577
- 650 12
- $a strojové učení $7 D000069550
- 650 _2
- $a software $7 D012984
- 650 _2
- $a stereoizomerie $7 D013237
- 650 _2
- $a substrátová specifita $7 D013379
- 650 _2
- $a xenobiotika $x chemie $x metabolismus $7 D015262
- 655 _2
- $a časopisecké články $7 D016428
- 655 _2
- $a práce podpořená grantem $7 D013485
- 700 1_
- $a de Bruyn Kops, Christina $u Faculty of Mathematics, Informatics and Natural Sciences, Department of Computer Science, Center for Bioinformatics, Universität Hamburg , Hamburg, 20146, Germany.
- 700 1_
- $a Stork, Conrad $u Faculty of Mathematics, Informatics and Natural Sciences, Department of Computer Science, Center for Bioinformatics, Universität Hamburg , Hamburg, 20146, Germany.
- 700 1_
- $a Svozil, Daniel $u CZ-OPENSCREEN: National Infrastructure for Chemical Biology, Laboratory of Informatics and Chemistry, Faculty of Chemical Technology, University of Chemistry and Technology Prague , 166 28 Prague 6, Czech Republic.
- 700 1_
- $a Kirchmair, Johannes $u Faculty of Mathematics, Informatics and Natural Sciences, Department of Computer Science, Center for Bioinformatics, Universität Hamburg , Hamburg, 20146, Germany.
- 773 0_
- $w MED00008945 $t Journal of chemical information and modeling $x 1549-960X $g Roč. 57, č. 8 (2017), s. 1832-1846
- 856 41
- $u https://pubmed.ncbi.nlm.nih.gov/28782945 $y Pubmed
- 910 __
- $a ABA008 $b sig $c sign $y a $z 0
- 990 __
- $a 20180515 $b ABA008
- 991 __
- $a 20180518110638 $b ABA008
- 999 __
- $a ok $b bmc $g 1300064 $s 1013280
- BAS __
- $a 3
- BAS __
- $a PreBMC
- BMC __
- $a 2017 $b 57 $c 8 $d 1832-1846 $e 20170807 $i 1549-960X $m Journal of chemical information and modeling $n J Chem Inf Model $x MED00008945
- LZP __
- $a Pubmed-20180515