Nejvíce citovaný článek - PubMed ID 33620192
Non-Covalent Interactions Atlas Benchmark Data Sets 3: Repulsive Contacts
Single-structure scoring functions have been considered inferior to expensive ensemble free energy methods in predicting protein-ligand affinities. We are revisiting this dogma with the recently developed semiempirical quantum-mechanical (SQM)-based scoring function, SQM2.20, comparing its performance to the standard scoring functions on one hand and state-of-the-art molecular dynamics (MD)-based free-energy methods on the other hand. The comparison is conducted on a well-established Wang data set comprising eight protein targets with 200 ligands. The initial low correlation of SQM2.20 scores with the experimental binding affinities of R2 = 0.21 was improved to R2 = 0.47 by a systematic refinement of the input structures and omission of the ligand deformation energy. Consequently, SQM2.20 representing accurate single-structure scoring functions, exhibited an average performance comparable to that of MD-based methods (R2 = 0.52) and surpassed the performance of standard scoring functions (R2 = 0.26). The per-target analysis highlighted the pivotal role of high-quality input structures on the outcomes of single-structure methods. In the instances where such structures are available, SQM2.20 scoring has been shown to rival or even exceed MD-based methods in predicting protein-ligand binding affinities, while exhibiting significantly reduced computation time.
The quantitative characterization of residue contributions to protein-protein binding across extensive flexible interfaces poses a significant challenge for biophysical computations. It is attributable to the inherent imperfections in the experimental structures themselves, as well as to the lack of reliable computational tools for the evaluation of all types of noncovalent interactions. This study leverages recent advancements in semiempirical quantum-mechanical and implicit solvent approaches embodied in the PM6-D3H4S/COSMO2 method for the development of a hierarchical computational protocols encompassing molecular dynamics, fragmentation, and virtual glycine scan techniques for the investigation of flexible protein-protein interactions. As a model, the binding of insulin to its receptor is selected, a complex and dynamic process that has been extensively studied experimentally. The interaction energies calculated at the PM6-D3H4S/COSMO2 level in ten molecular dynamics snapshots did not correlate with molecular mechanics/generalized Born interaction energies because only the former method is able to describe nonadditive effects. This became evident by the examination of the energetics in small-model dimers featuring all the present types of noncovalent interactions with respect to DFT-D3 calculations. The virtual glycine scan has identified 15 hotspot residues on insulin and 15 on the insulin receptor, and their contributions have been quantified using PM6-D3H4S/COSMO2. The accuracy and credibility of the approach are further supported by the fact that all the insulin hotspots have previously been detected by biochemical and structural methods. The modular nature of the protocol has enabled the formulation of several variants, each tailored to specific accuracy and efficiency requirements. The developed computational strategy is firmly rooted in general biophysical chemistry and is thus offered as a general tool for the quantification of interactions across relevant flexible protein-protein interfaces.
- MeSH
- inzulin metabolismus chemie MeSH
- konformace proteinů MeSH
- lidé MeSH
- receptor inzulinu * chemie metabolismus MeSH
- simulace molekulární dynamiky * MeSH
- termodynamika MeSH
- vazba proteinů MeSH
- Check Tag
- lidé MeSH
- Publikační typ
- časopisecké články MeSH
- Názvy látek
- inzulin MeSH
- receptor inzulinu * MeSH
Machine learning (ML) methods offer a promising route to the construction of universal molecular potentials with high accuracy and low computational cost. It is becoming evident that integrating physical principles into these models, or utilizing them in a Δ-ML scheme, significantly enhances their robustness and transferability. This paper introduces PM6-ML, a Δ-ML method that synergizes the semiempirical quantum-mechanical (SQM) method PM6 with a state-of-the-art ML potential applied as a universal correction. The method demonstrates superior performance over standalone SQM and ML approaches and covers a broader chemical space than its predecessors. It is scalable to systems with thousands of atoms, which makes it applicable to large biomolecular systems. Extensive benchmarking confirms PM6-ML's accuracy and robustness. Its practical application is facilitated by a direct interface to MOPAC. The code and parameters are available at https://github.com/Honza-R/mopac-ml.
- Publikační typ
- časopisecké články MeSH
Accurate estimation of protein-ligand binding affinity is the cornerstone of computer-aided drug design. We present a universal physics-based scoring function, named SQM2.20, addressing key terms of binding free energy using semiempirical quantum-mechanical computational methods. SQM2.20 incorporates the latest methodological advances while remaining computationally efficient even for systems with thousands of atoms. To validate it rigorously, we have compiled and made available the PL-REX benchmark dataset consisting of high-resolution crystal structures and reliable experimental affinities for ten diverse protein targets. Comparative assessments demonstrate that SQM2.20 outperforms other scoring methods and reaches a level of accuracy similar to much more expensive DFT calculations. In the PL-REX dataset, it achieves excellent correlation with experimental data (average R2 = 0.69) and exhibits consistent performance across all targets. In contrast to DFT, SQM2.20 provides affinity predictions in minutes, making it suitable for practical applications in hit identification or lead optimization.
- MeSH
- ligandy MeSH
- proteiny * metabolismus MeSH
- racionální návrh léčiv * MeSH
- termodynamika MeSH
- vazba proteinů MeSH
- Publikační typ
- časopisecké články MeSH
- Názvy látek
- ligandy MeSH
- proteiny * MeSH
Accurate estimates of intermolecular interaction energy, ΔE, are crucial for modeling the properties of organic electronic materials and many other systems. For a diverse set of 50 dimers comprising up to 50 atoms (Set50-50, with 7 of its members being models of single-stacking junctions), benchmark ΔE data were compiled. They were obtained by the focal-point strategy, which involves computations using the canonical variant of the coupled cluster theory with singles, doubles, and perturbative triples [CCSD(T)] performed while applying a large basis set, along with extrapolations of the respective energy components to the complete basis set (CBS) limit. The resulting ΔE data were used to gauge the performance for the Set50-50 of several density-functional theory (DFT)-based approaches, and of one of the localized variants of the CCSD(T) method. This evaluation revealed that (1) the proposed "silver standard" approach, which employs the localized CCSD(T) method and CBS extrapolations, can be expected to provide accuracy better than two kJ/mol for absolute values of ΔE, and (2) from among the DFT techniques, computationally by far the cheapest approach (termed "ωB97X-3c/vDZP" by its authors) performed remarkably well. These findings are directly applicable in cost-effective yet reliable searches of the potential energy surfaces of noncovalent complexes.
- Klíčová slova
- CCSD(T), DFT, interaction energy, noncovalent interactions, supramolecular junctions,
- MeSH
- dimerizace MeSH
- elektronika * MeSH
- fyzikální jevy MeSH
- polymery MeSH
- Publikační typ
- časopisecké články MeSH
- Názvy látek
- polymery MeSH
In this theoretical study, we set out to demonstrate the substitution effect of PEDOT analogues on planarity as an intrinsic indicator for electronic performance. We perform a quantum mechanical (DFT) study of PEDOT and analogous model systems and demonstrate the usefulness of the ωB97X-V functional to simulate chalcogen bonds and other noncovalent interactions. We confirm that the chalcogen bond stabilizes the planar conformation and further visualize its presence via the electrostatic potential surface. In comparison to the prevalent B3LYP, we gain 4-fold savings in computational time and simulate model systems of up to a dodecamer. Implications for design of conductive polymers can be drawn from the results, and an example for self-doped polymers is presented where modulation of the strength of the chalcogen bond plays a significant role.
- Publikační typ
- časopisecké články MeSH
There has been a growing interest in quantitative predictions of the intermolecular binding energy of large complexes. One of the most important quantum chemical techniques capable of such predictions is the domain-based local pair natural orbital (DLPNO) scheme for the coupled cluster theory with singles, doubles, and iterative triples [CCSD(T)], whose results are extrapolated to the complete basis set (CBS) limit. Here, the DLPNO-based focal-point method is devised with the aim of obtaining CBS-extrapolated values that are very close to their canonical CCSD(T)/CBS counterparts, and thus may serve for routinely checking a performance of less expensive computational methods, for example, those based on the density-functional theory (DFT). The efficacy of this method is demonstrated for several sets of noncovalent complexes with varying amounts of the electrostatics, induction, and dispersion contributions to binding (as revealed by accurate DFT-based symmetry-adapted perturbation theory (SAPT) calculations). It is shown that when applied to dimeric models of poly(3-hydroxybutyrate) chains in its two polymorphic forms, the DLPNO-CCSD(T) and DFT-SAPT computational schemes agree to within about 2 kJ/mol of an absolute value of the interaction energy. These computational schemes thus should be useful for a reliable description of factors leading to the enthalpic stabilization of extended systems.
- Klíčová slova
- CCSD(T), DFT-SAPT, DLPNO, intermolecular binding, noncovalent interactions,
- MeSH
- analýza nákladů a výnosů MeSH
- kvantová teorie * MeSH
- statická elektřina MeSH
- teorie funkcionálu hustoty MeSH
- termodynamika MeSH
- Publikační typ
- časopisecké články MeSH