• Something wrong with this record ?

Predicting pK(a) values of substituted phenols from atomic charges: comparison of different quantum mechanical methods and charge distribution schemes

R. Svobodová Vareková, S. Geidl, CM. Ionescu, O. Skrehota, M. Kudera, D. Sehnal, T. Bouchal, R. Abagyan, HJ. Huber, J. Koca,

. 2011 ; 51 (8) : 1795-1806. [pub] 20110801

Language English Country United States

Document type Journal Article

The acid dissociation (ionization) constant pK(a) is one of the fundamental properties of organic molecules. We have evaluated different computational strategies and models to predict the pK(a) values of substituted phenols using partial atomic charges. Partial atomic charges for 124 phenol molecules were calculated using 83 approaches containing seven theory levels (MP2, HF, B3LYP, BLYP, BP86, AM1, and PM3), three basis sets (6-31G*, 6-311G, STO-3G), and five population analyses (MPA, NPA, Hirshfeld, MK, and Löwdin). The correlations between pK(a) and various atomic charge descriptors were examined, and the best descriptors were selected for preparing the quantitative structure-property relationship (QSPR) models. One QSPR model was created for each of the 83 approaches to charge calculation, and then the accuracy of all these models was analyzed and compared. The pK(a)s predicted by most of the models correlate strongly with experimental pK(a) values. For example, more than 25% of the models have correlation coefficients (R²) greater than 0.95 and root-mean-square errors smaller than 0.49. All seven examined theory levels are applicable for pK(a) prediction from charges. The best results were obtained for the MP2 and HF level of theory. The most suitable basis set was found to be 6-31G*. The 6-311G basis set provided slightly weaker correlations, and unexpectedly also, the STO-3G basis set is applicable for the QSPR modeling of pK(a). The Mulliken, natural, and Löwdin population analyses provide accurate models for all tested theory levels and basis sets. The results provided by the Hirshfeld population analysis were also acceptable, but the QSPR models based on MK charges show only weak correlations.

References provided by Crossref.org

000      
00000naa a2200000 a 4500
001      
bmc12022354
003      
CZ-PrNML
005      
20170410095208.0
007      
ta
008      
120806s2011 xxu f 000 0#eng||
009      
AR
024    7_
$a 10.1021/ci200133w $2 doi
035    __
$a (PubMed)21761919
040    __
$a ABA008 $b cze $d ABA008 $e AACR2
041    0_
$a eng
044    __
$a xxu
100    1_
$a Svobodová Vareková, Radka $u National Centre for Biomolecular Research, Faculty of Science and CEITEC - Central European Institute of Technology, Masaryk University Brno, Brno-Bohunice, Czech Republic.
245    10
$a Predicting pK(a) values of substituted phenols from atomic charges: comparison of different quantum mechanical methods and charge distribution schemes / $c R. Svobodová Vareková, S. Geidl, CM. Ionescu, O. Skrehota, M. Kudera, D. Sehnal, T. Bouchal, R. Abagyan, HJ. Huber, J. Koca,
520    9_
$a The acid dissociation (ionization) constant pK(a) is one of the fundamental properties of organic molecules. We have evaluated different computational strategies and models to predict the pK(a) values of substituted phenols using partial atomic charges. Partial atomic charges for 124 phenol molecules were calculated using 83 approaches containing seven theory levels (MP2, HF, B3LYP, BLYP, BP86, AM1, and PM3), three basis sets (6-31G*, 6-311G, STO-3G), and five population analyses (MPA, NPA, Hirshfeld, MK, and Löwdin). The correlations between pK(a) and various atomic charge descriptors were examined, and the best descriptors were selected for preparing the quantitative structure-property relationship (QSPR) models. One QSPR model was created for each of the 83 approaches to charge calculation, and then the accuracy of all these models was analyzed and compared. The pK(a)s predicted by most of the models correlate strongly with experimental pK(a) values. For example, more than 25% of the models have correlation coefficients (R²) greater than 0.95 and root-mean-square errors smaller than 0.49. All seven examined theory levels are applicable for pK(a) prediction from charges. The best results were obtained for the MP2 and HF level of theory. The most suitable basis set was found to be 6-31G*. The 6-311G basis set provided slightly weaker correlations, and unexpectedly also, the STO-3G basis set is applicable for the QSPR modeling of pK(a). The Mulliken, natural, and Löwdin population analyses provide accurate models for all tested theory levels and basis sets. The results provided by the Hirshfeld population analysis were also acceptable, but the QSPR models based on MK charges show only weak correlations.
650    _2
$a farmaceutická chemie $x metody $x statistika a číselné údaje $7 D002626
650    _2
$a počítačová simulace $7 D003198
650    _2
$a kinetika $7 D007700
650    _2
$a chemické modely $7 D008956
650    _2
$a statistické modely $7 D015233
650    _2
$a molekulární konformace $7 D008968
650    _2
$a léčivé přípravky $x analýza $x chemie $7 D004364
650    _2
$a fenoly $x analýza $x chemie $7 D010636
650    _2
$a kvantitativní vztahy mezi strukturou a aktivitou $7 D021281
650    _2
$a kvantová teorie $7 D011789
650    _2
$a statická elektřina $7 D055672
655    _2
$a časopisecké články $7 D016428
700    1_
$a Geidl, Stanislav
700    1_
$a Ionescu, Crina-Maria
700    1_
$a Skrehota, Ondrej
700    1_
$a Kudera, Michal
700    1_
$a Sehnal, David
700    1_
$a Bouchal, Tomás
700    1_
$a Abagyan, Ruben $7 gn_A_00000163
700    1_
$a Huber, Heinrich J
700    1_
$a Koca, Jaroslav
773    0_
$w MED00008945 $t Journal of chemical information and modeling $x 1549-960X $g Roč. 51, č. 8 (2011), s. 1795-1806
856    41
$u https://pubmed.ncbi.nlm.nih.gov/21761919 $y Pubmed
910    __
$a ABA008 $b sig $c sign $y m $z 0
990    __
$a 20120806 $b ABA008
991    __
$a 20170410095505 $b ABA008
999    __
$a ok $b bmc $g 944267 $s 779651
BAS    __
$a 3
BAS    __
$a PreBMC
BMC    __
$a 2011 $b 51 $c 8 $d 1795-1806 $e 20110801 $i 1549-960X $m Journal of chemical information and modeling $n J Chem Inf Model $x MED00008945
LZP    __
$a Pubmed-20120806/12/01

Find record

Citation metrics

Loading data ...

Archiving options

Loading data ...