Nejvíce citovaný článek - PubMed ID 26587619
Machine learning (ML) methods offer a promising route to the construction of universal molecular potentials with high accuracy and low computational cost. It is becoming evident that integrating physical principles into these models, or utilizing them in a Δ-ML scheme, significantly enhances their robustness and transferability. This paper introduces PM6-ML, a Δ-ML method that synergizes the semiempirical quantum-mechanical (SQM) method PM6 with a state-of-the-art ML potential applied as a universal correction. The method demonstrates superior performance over standalone SQM and ML approaches and covers a broader chemical space than its predecessors. It is scalable to systems with thousands of atoms, which makes it applicable to large biomolecular systems. Extensive benchmarking confirms PM6-ML's accuracy and robustness. Its practical application is facilitated by a direct interface to MOPAC. The code and parameters are available at https://github.com/Honza-R/mopac-ml.
- Publikační typ
- časopisecké články MeSH
The ISWI family protein SMARCA5 contains the ATP-binding pocket that coordinates the catalytic Mg2+ ion and water molecules for ATP hydrolysis. In this study, we demonstrate that SMARCA5 can also possess an alternative metal-binding ability. First, we isolated SMARCA5 on the cobalt column (IMAC) to near homogeneity. Examination of the interactions of SMARCA5 with metal-chelating supports showed that, apart from Co2+, it binds to Cu2+, Zn2+ and Ni2+. The efficiency of the binding to the last-listed metal was influenced by the chelating ligand, resulting in a strong preference for Ni-NTA over the Ni-CM-Asp equivalent. To gain insight in the preferential affinity for the Ni-NTA ligand, QM calculations were performed on model systems and metal-ligand complexes with a limited protein fragment of SMARCA5 containing the double-histidine (dHis) motif. The calculations correlated the observed affinity with the relative stability of the d-block metals to tetradentate ligand coordination over tridentate, as well as their overall octahedral coordination capacity. Likewise, binding free energies derived from model imidazole complexes mirrored the observed Ni-NTA/Ni-CM-Asp preferential affinity. Finally, similar calculations on complexes with a SMARCA5 peptide fragment derived from the AlphaFold structural prediction, captured almost accurately the expected relative stability of the TM complexes, and produced a large energetic separation (~10 kcal∙mol-1) between Ni-NTA and Ni-CM-Asp in favour of the former.
- MeSH
- adenosintrifosfatasy MeSH
- chromozomální proteiny, nehistonové metabolismus chemie MeSH
- kovy chemie metabolismus MeSH
- kvantová teorie MeSH
- lidé MeSH
- ligandy MeSH
- molekulární modely MeSH
- restrukturace chromatinu MeSH
- vazba proteinů * MeSH
- Check Tag
- lidé MeSH
- Publikační typ
- časopisecké články MeSH
- Názvy látek
- adenosintrifosfatasy MeSH
- chromozomální proteiny, nehistonové MeSH
- kovy MeSH
- ligandy MeSH
- SMARCA5 protein, human MeSH Prohlížeč
Long-range non-covalent interactions play a key role in the chemistry of natural polyphenols. We have previously proposed a description of supramolecular polyphenol complexes by the B3P86 density functional coupled with some corrections for dispersion. We couple here the B3P86 functional with the D3 correction for dispersion, assessing systematically the accuracy of the new B3P86-D3 model using for that the well-known S66, HB23, NCCE31, and S12L datasets for non-covalent interactions. Furthermore, the association energies of these complexes were carefully compared to those obtained by other dispersion-corrected functionals, such as B(3)LYP-D3, BP86-D3 or B3P86-NL. Finally, this set of models were also applied to a database composed of seven non-covalent polyphenol complexes of the most interest. Graphical abstract Weakly bound natural polyphenolsᅟ.
- Klíčová slova
- DFT-D, Natural polyphenols, Non-covalent interactions,
- MeSH
- chemické databáze MeSH
- molekulární modely MeSH
- molekulární struktura MeSH
- polyfenoly chemie MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Názvy látek
- polyfenoly MeSH
We evaluate the performance of the most widely used wavefunction, density functional theory, and semiempirical methods for the description of noncovalent interactions in a set of larger, mostly dispersion-stabilized noncovalent complexes (the L7 data set). The methods tested include MP2, MP3, SCS-MP2, SCS(MI)-MP2, MP2.5, MP2.X, MP2C, DFT-D, DFT-D3 (B3-LYP-D3, B-LYP-D3, TPSS-D3, PW6B95-D3, M06-2X-D3) and M06-2X, and semiempirical methods augmented with dispersion and hydrogen bonding corrections: SCC-DFTB-D, PM6-D, PM6-DH2 and PM6-D3H4. The test complexes are the octadecane dimer, the guanine trimer, the circumcoronene…adenine dimer, the coronene dimer, the guanine-cytosine dimer, the circumcoronene…guanine-cytosine dimer, and an amyloid fragment trimer containing phenylalanine residues. The best performing method is MP2.5 with relative root mean square deviation (rRMSD) of 4 %. It can thus be recommended as an alternative to the CCSD(T)/CBS (alternatively QCISD(T)/CBS) benchmark for molecular systems which exceed current computational capacity. The second best non-DFT method is MP2C with rRMSD of 8 %. A method with the most favorable "accuracy/cost" ratio belongs to the DFT family: BLYP-D3, with an rRMSD of 8 %. Semiempirical methods deliver less accurate results (the rRMSD exceeds 25 %). Nevertheless, their absolute errors are close to some much more expensive methods such as M06-2X, MP2 or SCS(MI)-MP2, and thus their price/performance ratio is excellent.
- Publikační typ
- časopisecké články MeSH