• Je něco špatně v tomto záznamu ?

How Does the Methodology of 3D Structure Preparation Influence the Quality of pKa Prediction

S. Geidl, R. Svobodová Vařeková, V. Bendová, L. Petrusek, CM. Ionescu, Z. Jurka, R. Abagyan, J. Koča,

. 2015 ; 55 (6) : 1088-97. [pub] 20150611

Jazyk angličtina Země Spojené státy americké

Typ dokumentu časopisecké články, Research Support, N.I.H., Extramural, práce podpořená grantem

Perzistentní odkaz   https://www.medvik.cz/link/bmc16010155

The acid dissociation constant is an important molecular property, and it can be successfully predicted by Quantitative Structure-Property Relationship (QSPR) models, even for in silico designed molecules. We analyzed how the methodology of in silico 3D structure preparation influences the quality of QSPR models. Specifically, we evaluated and compared QSPR models based on six different 3D structure sources (DTP NCI, Pubchem, Balloon, Frog2, OpenBabel, and RDKit) combined with four different types of optimization. These analyses were performed for three classes of molecules (phenols, carboxylic acids, anilines), and the QSPR model descriptors were quantum mechanical (QM) and empirical partial atomic charges. Specifically, we developed 516 QSPR models and afterward systematically analyzed the influence of the 3D structure source and other factors on their quality. Our results confirmed that QSPR models based on partial atomic charges are able to predict pKa with high accuracy. We also confirmed that ab initio and semiempirical QM charges provide very accurate QSPR models and using empirical charges based on electronegativity equalization is also acceptable, as well as advantageous, because their calculation is very fast. On the other hand, Gasteiger-Marsili empirical charges are not applicable for pKa prediction. We later found that QSPR models for some classes of molecules (carboxylic acids) are less accurate. In this context, we compared the influence of different 3D structure sources. We found that an appropriate selection of 3D structure source and optimization method is essential for the successful QSPR modeling of pKa. Specifically, the 3D structures from the DTP NCI and Pubchem databases performed the best, as they provided very accurate QSPR models for all the tested molecular classes and charge calculation approaches, and they do not require optimization. Also, Frog2 performed very well. Other 3D structure sources can also be used but are not so robust, and an unfortunate combination of molecular class and charge calculation approach can produce weak QSPR models. Additionally, these 3D structures generally need optimization in order to produce good quality QSPR models.

Citace poskytuje Crossref.org

000      
00000naa a2200000 a 4500
001      
bmc16010155
003      
CZ-PrNML
005      
20160414123928.0
007      
ta
008      
160408s2015 xxu f 000 0|eng||
009      
AR
024    7_
$a 10.1021/ci500758w $2 doi
024    7_
$a 10.1021/ci500758w $2 doi
035    __
$a (PubMed)26010215
040    __
$a ABA008 $b cze $d ABA008 $e AACR2
041    0_
$a eng
044    __
$a xxu
100    1_
$a Geidl, Stanislav $u †National Centre for Biomolecular Research, Faculty of Science, and CEITEC - Central European Institute of Technology, Masaryk University Brno, Kamenice 5, 625 00 Brno-Bohunice, Czech Republic.
245    10
$a How Does the Methodology of 3D Structure Preparation Influence the Quality of pKa Prediction / $c S. Geidl, R. Svobodová Vařeková, V. Bendová, L. Petrusek, CM. Ionescu, Z. Jurka, R. Abagyan, J. Koča,
520    9_
$a The acid dissociation constant is an important molecular property, and it can be successfully predicted by Quantitative Structure-Property Relationship (QSPR) models, even for in silico designed molecules. We analyzed how the methodology of in silico 3D structure preparation influences the quality of QSPR models. Specifically, we evaluated and compared QSPR models based on six different 3D structure sources (DTP NCI, Pubchem, Balloon, Frog2, OpenBabel, and RDKit) combined with four different types of optimization. These analyses were performed for three classes of molecules (phenols, carboxylic acids, anilines), and the QSPR model descriptors were quantum mechanical (QM) and empirical partial atomic charges. Specifically, we developed 516 QSPR models and afterward systematically analyzed the influence of the 3D structure source and other factors on their quality. Our results confirmed that QSPR models based on partial atomic charges are able to predict pKa with high accuracy. We also confirmed that ab initio and semiempirical QM charges provide very accurate QSPR models and using empirical charges based on electronegativity equalization is also acceptable, as well as advantageous, because their calculation is very fast. On the other hand, Gasteiger-Marsili empirical charges are not applicable for pKa prediction. We later found that QSPR models for some classes of molecules (carboxylic acids) are less accurate. In this context, we compared the influence of different 3D structure sources. We found that an appropriate selection of 3D structure source and optimization method is essential for the successful QSPR modeling of pKa. Specifically, the 3D structures from the DTP NCI and Pubchem databases performed the best, as they provided very accurate QSPR models for all the tested molecular classes and charge calculation approaches, and they do not require optimization. Also, Frog2 performed very well. Other 3D structure sources can also be used but are not so robust, and an unfortunate combination of molecular class and charge calculation approach can produce weak QSPR models. Additionally, these 3D structures generally need optimization in order to produce good quality QSPR models.
650    _2
$a počítačová simulace $7 D003198
650    _2
$a racionální návrh léčiv $7 D015195
650    12
$a molekulární modely $7 D008958
650    12
$a molekulární konformace $7 D008968
650    12
$a chemické jevy $7 D055598
650    12
$a kvantitativní vztahy mezi strukturou a aktivitou $7 D021281
650    _2
$a kvantová teorie $7 D011789
655    _2
$a časopisecké články $7 D016428
655    _2
$a Research Support, N.I.H., Extramural $7 D052061
655    _2
$a práce podpořená grantem $7 D013485
700    1_
$a Svobodová Vařeková, Radka $u †National Centre for Biomolecular Research, Faculty of Science, and CEITEC - Central European Institute of Technology, Masaryk University Brno, Kamenice 5, 625 00 Brno-Bohunice, Czech Republic.
700    1_
$a Bendová, Veronika $u †National Centre for Biomolecular Research, Faculty of Science, and CEITEC - Central European Institute of Technology, Masaryk University Brno, Kamenice 5, 625 00 Brno-Bohunice, Czech Republic.
700    1_
$a Petrusek, Lukáš $u †National Centre for Biomolecular Research, Faculty of Science, and CEITEC - Central European Institute of Technology, Masaryk University Brno, Kamenice 5, 625 00 Brno-Bohunice, Czech Republic.
700    1_
$a Ionescu, Crina-Maria $u †National Centre for Biomolecular Research, Faculty of Science, and CEITEC - Central European Institute of Technology, Masaryk University Brno, Kamenice 5, 625 00 Brno-Bohunice, Czech Republic.
700    1_
$a Jurka, Zdeněk $u †National Centre for Biomolecular Research, Faculty of Science, and CEITEC - Central European Institute of Technology, Masaryk University Brno, Kamenice 5, 625 00 Brno-Bohunice, Czech Republic.
700    1_
$a Abagyan, Ruben $u ‡Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, 9500 Gilman Drive, MC 0657, San Diego, California 92161, United States. $7 gn_A_00000163
700    1_
$a Koča, Jaroslav $u †National Centre for Biomolecular Research, Faculty of Science, and CEITEC - Central European Institute of Technology, Masaryk University Brno, Kamenice 5, 625 00 Brno-Bohunice, Czech Republic.
773    0_
$w MED00008945 $t Journal of chemical information and modeling $x 1549-960X $g Roč. 55, č. 6 (2015), s. 1088-97
856    41
$u https://pubmed.ncbi.nlm.nih.gov/26010215 $y Pubmed
910    __
$a ABA008 $b sig $c sign $y a $z 0
990    __
$a 20160408 $b ABA008
991    __
$a 20160414124013 $b ABA008
999    __
$a ok $b bmc $g 1113584 $s 934523
BAS    __
$a 3
BAS    __
$a PreBMC
BMC    __
$a 2015 $b 55 $c 6 $d 1088-97 $e 20150611 $i 1549-960X $m Journal of chemical information and modeling $n J Chem Inf Model $x MED00008945
LZP    __
$a Pubmed-20160408

Najít záznam

Citační ukazatele

Nahrávání dat ...

    Možnosti archivace