Most cited article - PubMed ID 35687357
Non-covalent interactions atlas benchmark data sets 4: σ-hole interactions
Machine learning (ML) methods offer a promising route to the construction of universal molecular potentials with high accuracy and low computational cost. It is becoming evident that integrating physical principles into these models, or utilizing them in a Δ-ML scheme, significantly enhances their robustness and transferability. This paper introduces PM6-ML, a Δ-ML method that synergizes the semiempirical quantum-mechanical (SQM) method PM6 with a state-of-the-art ML potential applied as a universal correction. The method demonstrates superior performance over standalone SQM and ML approaches and covers a broader chemical space than its predecessors. It is scalable to systems with thousands of atoms, which makes it applicable to large biomolecular systems. Extensive benchmarking confirms PM6-ML's accuracy and robustness. Its practical application is facilitated by a direct interface to MOPAC. The code and parameters are available at https://github.com/Honza-R/mopac-ml.
- Publication type
- Journal Article MeSH
The use of quantum mechanical potentials in protein-ligand affinity prediction is becoming increasingly feasible with growing computational power. To move forward, validation of such potentials on real-world challenges is necessary. To this end, we have collated an extensive set of over a thousand galectin inhibitors with known affinities and docked them into galectin-3. The docked poses were then used to systematically evaluate several modern force fields and semiempirical quantum mechanical (SQM) methods up to the tight-binding level under consistent computational workflow. Implicit solvation models available with the tested methods were used to simulate solvation effects. Overall, the best methods in this study achieved a Pearson correlation of 0.7-0.8 between the computed and experimental affinities. There were differences between the tested methods in their ability to rank ligands across the entire ligand set as well as within subsets of structurally similar ligands. A major discrepancy was observed for a subset of ligands that bind to the protein via a halogen bond, which was clearly challenging for all the tested methods. The inclusion of an entropic term calculated by the rigid-rotor-harmonic-oscillator approximation at SQM level slightly worsened correlation with experiment but brought the calculated affinities closer to experimental values. We also found that the success of the prediction strongly depended on the solvation model. Furthermore, we provide an in-depth analysis of the individual energy terms and their effect on the overall prediction accuracy.
- MeSH
- Galectins * chemistry metabolism MeSH
- Protein Conformation MeSH
- Quantum Theory * MeSH
- Ligands MeSH
- Molecular Docking Simulation MeSH
- Thermodynamics MeSH
- Protein Binding MeSH
- Publication type
- Journal Article MeSH
- Names of Substances
- Galectins * MeSH
- Ligands MeSH
Accurate estimation of protein-ligand binding affinity is the cornerstone of computer-aided drug design. We present a universal physics-based scoring function, named SQM2.20, addressing key terms of binding free energy using semiempirical quantum-mechanical computational methods. SQM2.20 incorporates the latest methodological advances while remaining computationally efficient even for systems with thousands of atoms. To validate it rigorously, we have compiled and made available the PL-REX benchmark dataset consisting of high-resolution crystal structures and reliable experimental affinities for ten diverse protein targets. Comparative assessments demonstrate that SQM2.20 outperforms other scoring methods and reaches a level of accuracy similar to much more expensive DFT calculations. In the PL-REX dataset, it achieves excellent correlation with experimental data (average R2 = 0.69) and exhibits consistent performance across all targets. In contrast to DFT, SQM2.20 provides affinity predictions in minutes, making it suitable for practical applications in hit identification or lead optimization.
- MeSH
- Ligands MeSH
- Proteins * metabolism MeSH
- Drug Design * MeSH
- Thermodynamics MeSH
- Protein Binding MeSH
- Publication type
- Journal Article MeSH
- Names of Substances
- Ligands MeSH
- Proteins * MeSH
Accurate estimates of intermolecular interaction energy, ΔE, are crucial for modeling the properties of organic electronic materials and many other systems. For a diverse set of 50 dimers comprising up to 50 atoms (Set50-50, with 7 of its members being models of single-stacking junctions), benchmark ΔE data were compiled. They were obtained by the focal-point strategy, which involves computations using the canonical variant of the coupled cluster theory with singles, doubles, and perturbative triples [CCSD(T)] performed while applying a large basis set, along with extrapolations of the respective energy components to the complete basis set (CBS) limit. The resulting ΔE data were used to gauge the performance for the Set50-50 of several density-functional theory (DFT)-based approaches, and of one of the localized variants of the CCSD(T) method. This evaluation revealed that (1) the proposed "silver standard" approach, which employs the localized CCSD(T) method and CBS extrapolations, can be expected to provide accuracy better than two kJ/mol for absolute values of ΔE, and (2) from among the DFT techniques, computationally by far the cheapest approach (termed "ωB97X-3c/vDZP" by its authors) performed remarkably well. These findings are directly applicable in cost-effective yet reliable searches of the potential energy surfaces of noncovalent complexes.
- Keywords
- CCSD(T), DFT, interaction energy, noncovalent interactions, supramolecular junctions,
- MeSH
- Dimerization MeSH
- Electronics * MeSH
- Physical Phenomena MeSH
- Polymers MeSH
- Publication type
- Journal Article MeSH
- Names of Substances
- Polymers MeSH
In this theoretical study, we set out to demonstrate the substitution effect of PEDOT analogues on planarity as an intrinsic indicator for electronic performance. We perform a quantum mechanical (DFT) study of PEDOT and analogous model systems and demonstrate the usefulness of the ωB97X-V functional to simulate chalcogen bonds and other noncovalent interactions. We confirm that the chalcogen bond stabilizes the planar conformation and further visualize its presence via the electrostatic potential surface. In comparison to the prevalent B3LYP, we gain 4-fold savings in computational time and simulate model systems of up to a dodecamer. Implications for design of conductive polymers can be drawn from the results, and an example for self-doped polymers is presented where modulation of the strength of the chalcogen bond plays a significant role.
- Publication type
- Journal Article MeSH
There has been a growing interest in quantitative predictions of the intermolecular binding energy of large complexes. One of the most important quantum chemical techniques capable of such predictions is the domain-based local pair natural orbital (DLPNO) scheme for the coupled cluster theory with singles, doubles, and iterative triples [CCSD(T)], whose results are extrapolated to the complete basis set (CBS) limit. Here, the DLPNO-based focal-point method is devised with the aim of obtaining CBS-extrapolated values that are very close to their canonical CCSD(T)/CBS counterparts, and thus may serve for routinely checking a performance of less expensive computational methods, for example, those based on the density-functional theory (DFT). The efficacy of this method is demonstrated for several sets of noncovalent complexes with varying amounts of the electrostatics, induction, and dispersion contributions to binding (as revealed by accurate DFT-based symmetry-adapted perturbation theory (SAPT) calculations). It is shown that when applied to dimeric models of poly(3-hydroxybutyrate) chains in its two polymorphic forms, the DLPNO-CCSD(T) and DFT-SAPT computational schemes agree to within about 2 kJ/mol of an absolute value of the interaction energy. These computational schemes thus should be useful for a reliable description of factors leading to the enthalpic stabilization of extended systems.
- Keywords
- CCSD(T), DFT-SAPT, DLPNO, intermolecular binding, noncovalent interactions,
- MeSH
- Cost-Benefit Analysis MeSH
- Quantum Theory * MeSH
- Static Electricity MeSH
- Density Functional Theory MeSH
- Thermodynamics MeSH
- Publication type
- Journal Article MeSH