• Something wrong with this record ?

FAME 2: Simple and Effective Machine Learning Model of Cytochrome P450 Regioselectivity

M. Šícho, C. de Bruyn Kops, C. Stork, D. Svozil, J. Kirchmair,

. 2017 ; 57 (8) : 1832-1846. [pub] 20170807

Language English Country United States

Document type Journal Article, Research Support, Non-U.S. Gov't

We report on the further development of FAst MEtabolizer (FAME; J. Chem. Inf. MODEL: 2013, 53, 2896-2907), a collection of random forest models for the prediction of sites of metabolism (SoMs) of xenobiotics. A broad set of descriptors was explored, from simple 2D descriptors such as those used in FAME, to quantum chemical descriptors employed in some of the most accurate models for SoM prediction currently available. In line with the original FAME approach, our objective was to keep things simple and to come up with accurate and robust models that are based on a small number of 2D descriptors. We found that circular descriptions of atoms and their environments with such descriptors in combination with an extremely randomized trees algorithm can yield models that perform equally well compared to more complex approaches. Thorough evaluation experiments on an independent test set showed that the best of these models obtained a Matthews correlation coefficient, area under the receiver operating characteristic curve, and Top-2 accuracy of 0.57, 0.91 and 94.1%, respectively. Models for the prediction of isoform-specific regioselectivity of CYP 3A4, 2D6, and 2C9 were also developed and showed competitive performance. The best models have been integrated into a newly developed software package (FAME 2), which is available free of charge from the authors.

References provided by Crossref.org

000      
00000naa a2200000 a 4500
001      
bmc18016440
003      
CZ-PrNML
005      
20180518110500.0
007      
ta
008      
180515s2017 xxu f 000 0|eng||
009      
AR
024    7_
$a 10.1021/acs.jcim.7b00250 $2 doi
035    __
$a (PubMed)28782945
040    __
$a ABA008 $b cze $d ABA008 $e AACR2
041    0_
$a eng
044    __
$a xxu
100    1_
$a Šícho, Martin $u Faculty of Mathematics, Informatics and Natural Sciences, Department of Computer Science, Center for Bioinformatics, Universität Hamburg , Hamburg, 20146, Germany. CZ-OPENSCREEN: National Infrastructure for Chemical Biology, Laboratory of Informatics and Chemistry, Faculty of Chemical Technology, University of Chemistry and Technology Prague , 166 28 Prague 6, Czech Republic.
245    10
$a FAME 2: Simple and Effective Machine Learning Model of Cytochrome P450 Regioselectivity / $c M. Šícho, C. de Bruyn Kops, C. Stork, D. Svozil, J. Kirchmair,
520    9_
$a We report on the further development of FAst MEtabolizer (FAME; J. Chem. Inf. MODEL: 2013, 53, 2896-2907), a collection of random forest models for the prediction of sites of metabolism (SoMs) of xenobiotics. A broad set of descriptors was explored, from simple 2D descriptors such as those used in FAME, to quantum chemical descriptors employed in some of the most accurate models for SoM prediction currently available. In line with the original FAME approach, our objective was to keep things simple and to come up with accurate and robust models that are based on a small number of 2D descriptors. We found that circular descriptions of atoms and their environments with such descriptors in combination with an extremely randomized trees algorithm can yield models that perform equally well compared to more complex approaches. Thorough evaluation experiments on an independent test set showed that the best of these models obtained a Matthews correlation coefficient, area under the receiver operating characteristic curve, and Top-2 accuracy of 0.57, 0.91 and 94.1%, respectively. Models for the prediction of isoform-specific regioselectivity of CYP 3A4, 2D6, and 2C9 were also developed and showed competitive performance. The best models have been integrated into a newly developed software package (FAME 2), which is available free of charge from the authors.
650    _2
$a výpočetní biologie $x metody $7 D019295
650    _2
$a systém (enzymů) cytochromů P-450 $x metabolismus $7 D003577
650    12
$a strojové učení $7 D000069550
650    _2
$a software $7 D012984
650    _2
$a stereoizomerie $7 D013237
650    _2
$a substrátová specifita $7 D013379
650    _2
$a xenobiotika $x chemie $x metabolismus $7 D015262
655    _2
$a časopisecké články $7 D016428
655    _2
$a práce podpořená grantem $7 D013485
700    1_
$a de Bruyn Kops, Christina $u Faculty of Mathematics, Informatics and Natural Sciences, Department of Computer Science, Center for Bioinformatics, Universität Hamburg , Hamburg, 20146, Germany.
700    1_
$a Stork, Conrad $u Faculty of Mathematics, Informatics and Natural Sciences, Department of Computer Science, Center for Bioinformatics, Universität Hamburg , Hamburg, 20146, Germany.
700    1_
$a Svozil, Daniel $u CZ-OPENSCREEN: National Infrastructure for Chemical Biology, Laboratory of Informatics and Chemistry, Faculty of Chemical Technology, University of Chemistry and Technology Prague , 166 28 Prague 6, Czech Republic.
700    1_
$a Kirchmair, Johannes $u Faculty of Mathematics, Informatics and Natural Sciences, Department of Computer Science, Center for Bioinformatics, Universität Hamburg , Hamburg, 20146, Germany.
773    0_
$w MED00008945 $t Journal of chemical information and modeling $x 1549-960X $g Roč. 57, č. 8 (2017), s. 1832-1846
856    41
$u https://pubmed.ncbi.nlm.nih.gov/28782945 $y Pubmed
910    __
$a ABA008 $b sig $c sign $y a $z 0
990    __
$a 20180515 $b ABA008
991    __
$a 20180518110638 $b ABA008
999    __
$a ok $b bmc $g 1300064 $s 1013280
BAS    __
$a 3
BAS    __
$a PreBMC
BMC    __
$a 2017 $b 57 $c 8 $d 1832-1846 $e 20170807 $i 1549-960X $m Journal of chemical information and modeling $n J Chem Inf Model $x MED00008945
LZP    __
$a Pubmed-20180515

Find record

Citation metrics

Logged in users only

Archiving options

Loading data ...