Most cited article - PubMed ID 11749516
Structure, energetics, and dynamics of the nucleic Acid base pairs: nonempirical ab initio calculations
Tautomerism of nucleic acid (NA) bases is a crucial factor for the maintenance and translation of genetic information in organisms. Only canonical tautomers of NA bases can form hydrogen-bonded complexes with their natural counterparts. On the other hand, rare tautomers of nucleobases have been proposed to be involved in processes catalysed by NA enzymes. Isocytosine, which can be considered as a structural fragment of guanine, is known to have two stable tautomers both in solution and solid states. The tautomer equilibrium of isocytosine contrasts with the remarkable stability of the canonical tautomer of guanine. This paper investigates the factors contributing to the stability of the canonical tautomer of guanine by a combination of NMR experiments and theoretical calculations. The electronic effects of substituents on the stability of the rare tautomers of isocytosine and guanine derivatives are studied by density functional theory (DFT) calculations. Selected derivatives are studied by variable-temperature NMR spectroscopy. Rare tautomers can be stabilised in solution by intermolecular hydrogen-bonding interactions with suitable partners. These intermolecular interactions give rise to characteristic signals in proton NMR spectra, which make it possible to undoubtedly confirm the presence of a rare tautomer.
- Keywords
- DFT calculations, NMR spectroscopy, nucleic acids, tautomerism,
- MeSH
- Cytosine analogs & derivatives chemistry MeSH
- Dimerization MeSH
- DNA chemistry MeSH
- Electrons MeSH
- Guanine chemistry MeSH
- Quantum Theory MeSH
- Magnetic Resonance Spectroscopy methods MeSH
- Macromolecular Substances MeSH
- Normal Distribution MeSH
- Reproducibility of Results MeSH
- Stereoisomerism MeSH
- Thermodynamics MeSH
- Hydrogen chemistry MeSH
- Hydrogen Bonding MeSH
- Publication type
- Journal Article MeSH
- Research Support, Non-U.S. Gov't MeSH
- Names of Substances
- Cytosine MeSH
- DNA MeSH
- Guanine MeSH
- isocytosine MeSH Browser
- Macromolecular Substances MeSH
- Hydrogen MeSH
We evaluate the performance of the most widely used wavefunction, density functional theory, and semiempirical methods for the description of noncovalent interactions in a set of larger, mostly dispersion-stabilized noncovalent complexes (the L7 data set). The methods tested include MP2, MP3, SCS-MP2, SCS(MI)-MP2, MP2.5, MP2.X, MP2C, DFT-D, DFT-D3 (B3-LYP-D3, B-LYP-D3, TPSS-D3, PW6B95-D3, M06-2X-D3) and M06-2X, and semiempirical methods augmented with dispersion and hydrogen bonding corrections: SCC-DFTB-D, PM6-D, PM6-DH2 and PM6-D3H4. The test complexes are the octadecane dimer, the guanine trimer, the circumcoronene…adenine dimer, the coronene dimer, the guanine-cytosine dimer, the circumcoronene…guanine-cytosine dimer, and an amyloid fragment trimer containing phenylalanine residues. The best performing method is MP2.5 with relative root mean square deviation (rRMSD) of 4 %. It can thus be recommended as an alternative to the CCSD(T)/CBS (alternatively QCISD(T)/CBS) benchmark for molecular systems which exceed current computational capacity. The second best non-DFT method is MP2C with rRMSD of 8 %. A method with the most favorable "accuracy/cost" ratio belongs to the DFT family: BLYP-D3, with an rRMSD of 8 %. Semiempirical methods deliver less accurate results (the rRMSD exceeds 25 %). Nevertheless, their absolute errors are close to some much more expensive methods such as M06-2X, MP2 or SCS(MI)-MP2, and thus their price/performance ratio is excellent.
- Publication type
- Journal Article MeSH
The hairpin ribozyme is a prominent member of small ribozymes since it does not require metal ions to achieve catalysis. Guanine 8 (G8) and adenine 38 (A38) have been identified as key participants in self-cleavage and -ligation. We have carried out hybrid quantum-mechanical/molecular mechanical (QM/MM) calculations to evaluate the energy along several putative reaction pathways. The error of our DFT description of the QM region was tested and shown to be ~1 kcal/mol. We find that self-cleavage of the hairpin ribozyme may follow several competing microscopic reaction mechanisms, all with calculated activation barriers in good agreement with those from experiment (20-21 kcal/mol). The initial nucleophilic attack of the A-1(2'-OH) group on the scissile phosphate is predicted to be rate-limiting in all these mechanisms. An unprotonated G8(-) (together with A38H(+)) yields a feasible activation barrier (20.4 kcal/mol). Proton transfer to a nonbridging phosphate oxygen also leads to feasible reaction pathways. Finally, our calculations consider thio-substitutions of one or both nonbridging oxygens of the scissile phosphate and predict that they have only a negligible effect on the reaction barrier, as observed experimentally.
- MeSH
- Catalysis MeSH
- Quantum Theory * MeSH
- Oxygen chemistry MeSH
- Protons MeSH
- RNA, Catalytic chemistry metabolism MeSH
- Molecular Dynamics Simulation * MeSH
- Thermodynamics MeSH
- Publication type
- Journal Article MeSH
- Research Support, Non-U.S. Gov't MeSH
- Research Support, N.I.H., Extramural MeSH
- Names of Substances
- hairpin ribozyme MeSH Browser
- Oxygen MeSH
- Protons MeSH
- RNA, Catalytic MeSH
In this feature article, we provide a side-by-side introduction for two research fields: quantum chemical calculations of molecular interaction in nucleic acids and RNA structural bioinformatics. Our main aim is to demonstrate that these research areas, while largely separated in contemporary literature, have substantial potential to complement each other that could significantly contribute to our understanding of the exciting world of nucleic acids. We identify research questions amenable to the combined application of modern ab initio methods and bioinformatics analysis of experimental structures while also assessing the limitations of these approaches. The ultimate aim is to attain valuable physicochemical insights regarding the nature of the fundamental molecular interactions and how they shape RNA structures, dynamics, function, and evolution.
- MeSH
- Nucleic Acid Conformation MeSH
- Quantum Theory * MeSH
- Nucleic Acids chemistry MeSH
- RNA chemistry MeSH
- Computational Biology * MeSH
- Publication type
- Journal Article MeSH
- Research Support, Non-U.S. Gov't MeSH
- Research Support, N.I.H., Extramural MeSH
- Research Support, U.S. Gov't, Non-P.H.S. MeSH
- Names of Substances
- Nucleic Acids MeSH
- RNA MeSH
We describe a novel, fundamental property of nucleobase structure, namely, pyramidilization at the N1/9 sites of purine and pyrimidine bases. Through a combined analyses of ultra-high-resolution X-ray structures of both oligonucleotides extracted from the Nucleic Acid Database and isolated nucleotides and nucleosides from the Cambridge Structural Database, together with a series of quantum chemical calculations, molecular dynamics (MD) simulations, and published solution nuclear magnetic resonance (NMR) data, we show that pyramidilization at the glycosidic nitrogen is an intrinsic property. This property is common to isolated nucleosides and nucleotides as well as oligonucleotides-it is also common to both RNA and DNA. Our analysis suggests that pyramidilization at N1/9 sites depends in a systematic way on the local structure of the nucleoside. Of note, the pyramidilization undergoes stereo-inversion upon reorientation of the glycosidic bond. The extent of the pyramidilization is further modulated by the conformation of the sugar ring. The observed pyramidilization is more pronounced for purine bases, while for pyrimidines it is negligible. We discuss how the assumption of nucleic acid base planarity can lead to systematic errors in determining the conformation of nucleotides from experimental data and from unconstrained MD simulations.
- MeSH
- Deoxyadenosines chemistry MeSH
- Deoxycytidine chemistry MeSH
- Nitrogen chemistry MeSH
- Crystallography, X-Ray MeSH
- Nuclear Magnetic Resonance, Biomolecular MeSH
- Oligonucleotides chemistry MeSH
- Computer Simulation MeSH
- Purine Nucleosides chemistry MeSH
- Purine Nucleotides chemistry MeSH
- Purines chemistry MeSH
- Pyrimidine Nucleosides chemistry MeSH
- Pyrimidine Nucleotides chemistry MeSH
- Pyrimidines chemistry MeSH
- Carbohydrates chemistry MeSH
- Publication type
- Journal Article MeSH
- Research Support, Non-U.S. Gov't MeSH
- Names of Substances
- 2'-deoxyadenosine MeSH Browser
- Deoxyadenosines MeSH
- Deoxycytidine MeSH
- Nitrogen MeSH
- Oligonucleotides MeSH
- Purine Nucleosides MeSH
- Purine Nucleotides MeSH
- Purines MeSH
- Pyrimidine Nucleosides MeSH
- Pyrimidine Nucleotides MeSH
- Pyrimidines MeSH
- Carbohydrates MeSH
Guanine-adenine (GA) base pairs play important roles in determining the structure, dynamics, and stability of RNA. In RNA internal loops, GA base pairs often occur in tandem arrangements and their structure is context and sequence dependent. Calculations reported here test the thermodynamic integration (TI) approach with the amber99 force field by comparing computational predictions of free energy differences with the free energy differences expected on the basis of NMR determined structures of the RNA motifs (5'-GCGGACGC-3')(2), (5'-GCiGGAiCGC-3')(2), (5'-GGCGAGCC-3')(2), and (5'-GGiCGAiGCC-3')(2). Here, iG and iC denote isoguanosine and isocytidine, which have amino and carbonyl groups transposed relative to guanosine and cytidine. The NMR structures show that the GA base pairs adopt either imino (cis Watson-Crick/Watson-Crick A-G) or sheared (trans Hoogsteen/Sugar edge A-G) conformations depending on the identity and orientation of the adjacent base pair. A new mixing function for the TI method is developed that allows alchemical transitions in which atoms can disappear in both the initial and final states. Unrestrained calculations gave DeltaG degrees values 2-4 kcal/mol different from expectations based on NMR data. Restraining the structures with hydrogen bond restraints did not improve the predictions. Agreement with NMR data was improved by 0.7 to 1.5 kcal/mol, however, when structures were restrained with weak positional restraints to sample around the experimentally determined NMR structures. The amber99 force field was modified to partially include pyramidalization effects of the unpaired amino group of guanosine in imino GA base pairs. This provided little or no improvement in comparisons with experiment. The marginal improvement is observed when the structure has potential cross-strand out-of-plane hydrogen bonding with the G amino group. The calculations using positional restraints and a nonplanar amino group reproduce the signs of DeltaG degrees from the experimental results and are, thus, capable of providing useful qualitative insights complementing the NMR experiments. Decomposition of the terms in the calculations reveals that the dominant terms are from electrostatic and interstrand interactions other than hydrogen bonds in the base pairs. The results suggest that a better description of the backbone is key to reproducing the experimental free energy results with computational free energy predictions.
- Publication type
- Journal Article MeSH
The formation of a cation-stabilized guanine quadruplex (G-DNA) stem is an exceptionally slow process involving complex kinetics that has not yet been characterized at atomic resolution. Here, we investigate the formation of a parallel stranded G-DNA stem consisting of four strands of d(GGGG) using molecular dynamics simulations with explicit inclusion of counterions and solvent. Due to the limitations imposed by the nanosecond timescale of the simulations, rather than watching for the spontaneous formation of G-DNA, our approach probes the stability of possible supramolecular intermediates (including two-, three-, and four-stranded assemblies with out-of-register base pairing between guanines) on the formation pathway. The simulations suggest that "cross-like" two-stranded assemblies may serve as nucleation centers in the initial formation of parallel stranded G-DNA quadruplexes, proceeding through a series of rearrangements involving trapping of cations, association of additional strands, and progressive slippage of strands toward the full stem. To supplement the analysis, approximate free energies of the models are obtained with explicit consideration of the integral cations. The approach applied here serves as a prototype for qualitatively investigating other G-DNA molecules using molecular dynamics simulation and free-energy analysis.
- MeSH
- Time Factors MeSH
- DNA chemistry MeSH
- G-Quadruplexes MeSH
- Guanine chemistry MeSH
- Ions MeSH
- Cations MeSH
- Kinetics MeSH
- Nucleic Acid Conformation MeSH
- Molecular Conformation MeSH
- Models, Molecular MeSH
- Oligonucleotides chemistry MeSH
- Sodium chemistry MeSH
- Software MeSH
- Temperature MeSH
- Thermodynamics MeSH
- Hydrogen Bonding MeSH
- Publication type
- Journal Article MeSH
- Research Support, Non-U.S. Gov't MeSH
- Research Support, U.S. Gov't, Non-P.H.S. MeSH
- Names of Substances
- DNA MeSH
- Guanine MeSH
- Ions MeSH
- Cations MeSH
- Oligonucleotides MeSH
- Sodium MeSH
Unrestrained 5-20-ns explicit-solvent molecular dynamics simulations using the Cornell et al. force field have been carried out for d[GCG(N)11GCG]2 (N, purine base) considering guanine*cytosine (G*C), adenine*thymine (A*T), inosine*5-methyl-cytosine (I*mC), and 2-amino-adenine*thymine (D*T) basepairs. The simulations unambiguously show that the structure and elasticity of N-tracts is primarily determined by the presence of the amino group in the minor groove. Simulated A-, I-, and AI-tracts show almost identical structures, with high propeller twist and minor groove narrowing. G- and D-tracts have small propeller twisting and are partly shifted toward the A-form. The elastic properties also differ between the two groups. The sequence-dependent electrostatic component of base stacking seems to play a minor role. Our conclusions are entirely consistent with available experimental data. Nevertheless, the propeller twist and helical twist in the simulated A-tract appear to be underestimated compared to crystallographic studies. To obtain further insight into the possible force field deficiencies, additional multiple simulations have been made for d(A)10, systematically comparing four major force fields currently used in DNA simulations and utilizing B and A-DNA forms as the starting structure. This comparison shows that the conclusions of the present work are not influenced by the force field choice.
- MeSH
- DNA chemistry MeSH
- Nucleic Acid Conformation MeSH
- Models, Molecular MeSH
- Base Pairing MeSH
- Polydeoxyribonucleotides chemistry MeSH
- Elasticity MeSH
- Purines chemistry MeSH
- Hydrogen Bonding MeSH
- Publication type
- Journal Article MeSH
- Research Support, Non-U.S. Gov't MeSH
- Names of Substances
- DNA MeSH
- Polydeoxyribonucleotides MeSH
- Purines MeSH
The ability of the four-stranded guanine (G)-DNA motif to incorporate nonstandard guanine analogue bases 6-oxopurine (inosine, I), 6-thioguanine (tG), and 6-thiopurine (tI) has been investigated using large-scale molecular dynamics simulations. The simulations suggest that a G-DNA stem can incorporate inosines without any marked effect on its structure and dynamics. The all-inosine quadruplex stem d(IIII)(4) shows identical dynamical properties as d(GGGG)(4) on the nanosecond time scale, with both molecular assemblies being stabilized by monovalent cations residing in the channel of the stem. However, simulations carried out in the absence of these cations show dramatic differences in the behavior of d(GGGG)(4) and d(IIII)(4). Whereas vacant d(GGGG)(4) shows large fluctuations but does not disintegrate, vacant d(IIII)(4) is completely disrupted within the first nanosecond. This is a consequence of the lack of the H-bonds involving the N2 amino group that is not present in inosine. This indicates that formation of the inosine quadruplex could involve entirely different intermediate structures than formation of the guanosine quadruplex, and early association of cations in this process appears to be inevitable. In the simulations, the incorporation of 6-thioguanine and 6-thiopurine sharply destabilizes four-stranded G-DNA structures, in close agreement with experimental data. The main reason is the size of the thiogroup leading to considerable steric conflicts and expelling the cations out of the channel of the quadruplex stem. The G-DNA stem can accommodate a single thioguanine base with minor perturbations. Incorporation of a thioguanine quartet layer is associated with a large destabilization of the G-DNA stem whereas the all-thioguanine quadruplex immediately collapses.
- MeSH
- Biophysics MeSH
- Biophysical Phenomena MeSH
- DNA chemistry MeSH
- Inosine chemistry MeSH
- Ion Channels chemistry MeSH
- Nucleic Acid Conformation * MeSH
- Mercaptopurine chemistry MeSH
- Models, Molecular MeSH
- Sodium chemistry MeSH
- Thermodynamics MeSH
- Thioguanine chemistry MeSH
- Hydrogen Bonding MeSH
- Publication type
- Journal Article MeSH
- Research Support, Non-U.S. Gov't MeSH
- Names of Substances
- DNA MeSH
- Inosine MeSH
- Ion Channels MeSH
- Mercaptopurine MeSH
- Sodium MeSH
- Thioguanine MeSH
The crystal structure of d(CATGGGCCCATG)(2) shows unique stacking patterns of a stable B<-->A-DNA intermediate. We evaluated intrinsic base stacking energies in this crystal structure using an ab initio quantum mechanical method. We found that all crystal base pair steps have stacking energies close to their values in the standard and crystal B-DNA geometries. Thus, naturally occurring stacking geometries were essentially isoenergetic while individual base pair steps differed substantially in the balance of intra-strand and inter-strand stacking terms. Also, relative dispersion, electrostatic and polarization contributions to the stability of different base pair steps were very sensitive to base composition and sequence context. A large stacking flexibility is most apparent for the CpA step, while the GpG step is characterized by weak intra-strand stacking. Hydration effects were estimated using the Langevin dipoles solvation model. These calculations showed that an aqueous environment efficiently compensates for electrostatic stacking contributions. Finally, we have carried out explicit solvent molecular dynamics simulation of the d(CATGGGCCCATG)(2) duplex in water. Here the DNA conformation did not retain the initial crystal geometry, but moved from the B<-->A intermediate towards the B-DNA structure. The base stacking energy improved in the course of this simulation. Our findings indicate that intrinsic base stacking interactions are not sufficient to stabilize the local conformational variations in crystals.
- MeSH
- DNA chemistry genetics metabolism MeSH
- Crystallization MeSH
- Models, Molecular MeSH
- Pliability MeSH
- Base Pairing * MeSH
- Computer Simulation * MeSH
- Solvents MeSH
- Base Sequence MeSH
- Static Electricity MeSH
- Thermodynamics MeSH
- Water chemistry metabolism MeSH
- Base Composition MeSH
- Publication type
- Journal Article MeSH
- Names of Substances
- DNA MeSH
- Solvents MeSH
- Water MeSH