Most cited article - PubMed ID 28949517
Empirical Self-Consistent Correction for the Description of Hydrogen Bonds in DFTB3
Machine learning (ML) methods offer a promising route to the construction of universal molecular potentials with high accuracy and low computational cost. It is becoming evident that integrating physical principles into these models, or utilizing them in a Δ-ML scheme, significantly enhances their robustness and transferability. This paper introduces PM6-ML, a Δ-ML method that synergizes the semiempirical quantum-mechanical (SQM) method PM6 with a state-of-the-art ML potential applied as a universal correction. The method demonstrates superior performance over standalone SQM and ML approaches and covers a broader chemical space than its predecessors. It is scalable to systems with thousands of atoms, which makes it applicable to large biomolecular systems. Extensive benchmarking confirms PM6-ML's accuracy and robustness. Its practical application is facilitated by a direct interface to MOPAC. The code and parameters are available at https://github.com/Honza-R/mopac-ml.
- Publication type
- Journal Article MeSH
The use of quantum mechanical potentials in protein-ligand affinity prediction is becoming increasingly feasible with growing computational power. To move forward, validation of such potentials on real-world challenges is necessary. To this end, we have collated an extensive set of over a thousand galectin inhibitors with known affinities and docked them into galectin-3. The docked poses were then used to systematically evaluate several modern force fields and semiempirical quantum mechanical (SQM) methods up to the tight-binding level under consistent computational workflow. Implicit solvation models available with the tested methods were used to simulate solvation effects. Overall, the best methods in this study achieved a Pearson correlation of 0.7-0.8 between the computed and experimental affinities. There were differences between the tested methods in their ability to rank ligands across the entire ligand set as well as within subsets of structurally similar ligands. A major discrepancy was observed for a subset of ligands that bind to the protein via a halogen bond, which was clearly challenging for all the tested methods. The inclusion of an entropic term calculated by the rigid-rotor-harmonic-oscillator approximation at SQM level slightly worsened correlation with experiment but brought the calculated affinities closer to experimental values. We also found that the success of the prediction strongly depended on the solvation model. Furthermore, we provide an in-depth analysis of the individual energy terms and their effect on the overall prediction accuracy.
- MeSH
- Galectins * metabolism chemistry antagonists & inhibitors MeSH
- Quantum Theory * MeSH
- Ligands MeSH
- Molecular Docking Simulation MeSH
- Protein Binding MeSH
- Publication type
- Journal Article MeSH
- Names of Substances
- Galectins * MeSH
- Ligands MeSH
A seventh blind test of crystal structure prediction has been organized by the Cambridge Crystallographic Data Centre. The results are presented in two parts, with this second part focusing on methods for ranking crystal structures in order of stability. The exercise involved standardized sets of structures seeded from a range of structure generation methods. Participants from 22 groups applied several periodic DFT-D methods, machine learned potentials, force fields derived from empirical data or quantum chemical calculations, and various combinations of the above. In addition, one non-energy-based scoring function was used. Results showed that periodic DFT-D methods overall agreed with experimental data within expected error margins, while one machine learned model, applying system-specific AIMnet potentials, agreed with experiment in many cases demonstrating promise as an efficient alternative to DFT-based methods. For target XXXII, a consensus was reached across periodic DFT methods, with consistently high predicted energies of experimental forms relative to the global minimum (above 4 kJ mol-1 at both low and ambient temperatures) suggesting a more stable polymorph is likely not yet observed. The calculation of free energies at ambient temperatures offered improvement of predictions only in some cases (for targets XXVII and XXXI). Several avenues for future research have been suggested, highlighting the need for greater efficiency considering the vast amounts of resources utilized in many cases.
- Keywords
- Cambridge Structural Database, blind test, crystal structure prediction, lattice energy, polymorphism,
- Publication type
- Journal Article MeSH
Molecular electronics promises the ultimate level of miniaturization of computers and other machines as organic molecules are the smallest known physical objects with nontrivial structure and function. But despite the plethora of molecular switches, memories, and motors developed during the almost 50-years long history of molecular electronics, mass production of molecular computers is still an elusive goal. This is mostly due to the lack of scalable nanofabrication methods capable of rapidly producing complex structures (similar to silicon chips or living cells) with atomic precision and a small number of defects. Living nature solves this problem by using linear polymer templates encoding large volumes of structural information into sequence of hydrogen bonded end groups which can be efficiently replicated and which can drive assembly of other molecular components into complex supramolecular structures. In this paper, we propose a nanofabrication method based on a class of photosensitive polymers inspired by these natural principles, which can operate in concert with UV photolithography used for fabrication of current microelectronic processors. We believe that such a method will enable a smooth transition from silicon toward molecular nanoelectronics and photonics. To demonstrate its feasibility, we performed a computational screening of candidate molecules that can selectively bind and therefore allow the deterministic assembly of molecular components. In the process, we unearthed trends and design principles applicable beyond the immediate scope of our proposed nanofabrication method, e.g., to biologically relevant DNA analogues and molecular recognition within hydrogen-bonded systems.
- Keywords
- DNA analogue, ab initio calculations, computational screening, hydrogen bonded system, molecular electronics, nanofabrication, self-assembly,
- Publication type
- Journal Article MeSH