Predicting pK(a) values of substituted phenols from atomic charges: comparison of different quantum mechanical methods and charge distribution schemes
Language English Country United States Media print-electronic
Document type Journal Article
PubMed
21761919
DOI
10.1021/ci200133w
Knihovny.cz E-resources
- MeSH
- Models, Chemical MeSH
- Chemistry, Pharmaceutical methods statistics & numerical data MeSH
- Phenols analysis chemistry MeSH
- Kinetics MeSH
- Quantitative Structure-Activity Relationship MeSH
- Quantum Theory MeSH
- Pharmaceutical Preparations analysis chemistry MeSH
- Molecular Conformation MeSH
- Computer Simulation MeSH
- Static Electricity MeSH
- Models, Statistical MeSH
- Publication type
- Journal Article MeSH
- Names of Substances
- Phenols MeSH
- Pharmaceutical Preparations MeSH
The acid dissociation (ionization) constant pK(a) is one of the fundamental properties of organic molecules. We have evaluated different computational strategies and models to predict the pK(a) values of substituted phenols using partial atomic charges. Partial atomic charges for 124 phenol molecules were calculated using 83 approaches containing seven theory levels (MP2, HF, B3LYP, BLYP, BP86, AM1, and PM3), three basis sets (6-31G*, 6-311G, STO-3G), and five population analyses (MPA, NPA, Hirshfeld, MK, and Löwdin). The correlations between pK(a) and various atomic charge descriptors were examined, and the best descriptors were selected for preparing the quantitative structure-property relationship (QSPR) models. One QSPR model was created for each of the 83 approaches to charge calculation, and then the accuracy of all these models was analyzed and compared. The pK(a)s predicted by most of the models correlate strongly with experimental pK(a) values. For example, more than 25% of the models have correlation coefficients (R²) greater than 0.95 and root-mean-square errors smaller than 0.49. All seven examined theory levels are applicable for pK(a) prediction from charges. The best results were obtained for the MP2 and HF level of theory. The most suitable basis set was found to be 6-31G*. The 6-311G basis set provided slightly weaker correlations, and unexpectedly also, the STO-3G basis set is applicable for the QSPR modeling of pK(a). The Mulliken, natural, and Löwdin population analyses provide accurate models for all tested theory levels and basis sets. The results provided by the Hirshfeld population analysis were also acceptable, but the QSPR models based on MK charges show only weak correlations.
References provided by Crossref.org
Optimized SQE atomic charges for peptides accessible via a web application
Atomic Charge Calculator II: web-based tool for the calculation of partial atomic charges
NEEMP: software for validation, accurate calculation and fast parameterization of EEM charges
High-quality and universal empirical atomic charges for chemoinformatics applications
How Does the Methodology of 3D Structure Preparation Influence the Quality of pKa Prediction?
Predicting p Ka values from EEM atomic charges