Most cited article - PubMed ID 27863061
How to understand atomistic molecular dynamics simulations of RNA and protein-RNA complexes?
Molecular dynamics (MD) simulations are an important and well-established tool for investigating RNA structural dynamics, but their accuracy relies heavily on the quality of the employed force field (ff). In this work, we present a comprehensive evaluation of widely used pair-additive and polarizable RNA ffs using the challenging UUCG tetraloop (TL) benchmark system. Extensive standard MD simulations, initiated from the NMR structure of the 14-mer UUCG TL, revealed that most ffs did not maintain the native state, instead favoring alternative loop conformations. Notably, three very recent variants of pair-additive ffs, OL3CP-gHBfix21, DES-Amber, and OL3R2.7, successfully preserved the native structure over a 10 × 20 μs time scale. To further assess these ffs, we performed enhanced sampling folding simulations of the shorter 8-mer UUCG TL, starting from the single-stranded conformation. Estimated folding free energies (ΔG°fold) varied significantly among these three ffs, with values of 0.0 ± 0.6, 2.4 ± 0.8, and 7.4 ± 0.2 kcal/mol for OL3CP-gHBfix21, DES-Amber, and OL3R2.7, respectively. The ΔG°fold value predicted by the OL3CP-gHBfix21 ff was closest to experimental estimates, ranging from -1.6 to -0.7 kcal/mol. In contrast, the higher ΔG°fold values obtained using DES-Amber and OL3R2.7 were unexpected, suggesting that key interactions are inaccurately described in the folded, unfolded, or misfolded ensembles. These discrepancies led us to further test DES-Amber and OL3R2.7 ffs on additional RNA and DNA systems, where further performance issues were observed. Our results emphasize the complexity of accurately modeling RNA dynamics and suggest that creating an RNA ff capable of reliably performing across a wide range of RNA systems remains extremely challenging. In conclusion, our study provides valuable insights into the capabilities of current RNA ffs and highlights key areas for future ff development.
- MeSH
- Nucleic Acid Conformation MeSH
- RNA * chemistry MeSH
- Molecular Dynamics Simulation * MeSH
- Thermodynamics MeSH
- Publication type
- Journal Article MeSH
- Names of Substances
- RNA * MeSH
Recognition of single-stranded RNA (ssRNA) by RNA recognition motif (RRM) domains is an important class of protein-RNA interactions. Many such complexes were characterized using nuclear magnetic resonance (NMR) and/or X-ray crystallography techniques, revealing ensemble-averaged pictures of the bound states. However, it is becoming widely accepted that better understanding of protein-RNA interactions would be obtained from ensemble descriptions. Indeed, earlier molecular dynamics simulations of bound states indicated visible dynamics at the RNA-RRM interfaces. Here, we report the first atomistic simulation study of spontaneous binding of short RNA sequences to RRM domains of HuR and SRSF1 proteins. Using a millisecond-scale aggregate ensemble of unbiased simulations, we were able to observe a few dozen binding events. HuR RRM3 utilizes a pre-binding state to navigate the RNA sequence to its partially disordered bound state and then to dynamically scan its different binding registers. SRSF1 RRM2 binding is more straightforward but still multiple-pathway. The present study necessitated development of a goal-specific force field modification, scaling down the intramolecular van der Waals interactions of the RNA which also improves description of the RNA-RRM bound state. Our study opens up a new avenue for large-scale atomistic investigations of binding landscapes of protein-RNA complexes, and future perspectives of such research are discussed.
- MeSH
- ELAV-Like Protein 1 metabolism MeSH
- RNA Recognition Motif genetics MeSH
- RNA-Binding Proteins * metabolism MeSH
- RNA * chemistry MeSH
- RNA Recognition Motif Proteins metabolism MeSH
- Protein Binding MeSH
- Binding Sites MeSH
- Publication type
- Journal Article MeSH
- Research Support, Non-U.S. Gov't MeSH
- Names of Substances
- ELAV-Like Protein 1 MeSH
- RNA-Binding Proteins * MeSH
- RNA * MeSH
- RNA Recognition Motif Proteins MeSH
The conserved protein Hfq is a key factor in the RNA-mediated control of gene expression in most known bacteria. The transient intermediates Hfq forms with RNA support intricate and robust regulatory networks. In Pseudomonas, Hfq recognizes repeats of adenine-purine-any nucleotide (ARN) in target mRNAs via its distal binding side, and together with the catabolite repression control (Crc) protein, assembles into a translation-repression complex. Earlier experiments yielded static, ensemble-averaged structures of the complex, but details of its interface dynamics and assembly pathway remained elusive. Using explicit solvent atomistic molecular dynamics simulations, we modeled the extensive dynamics of the Hfq-RNA interface and found implications for the assembly of the complex. We predict that syn/anti flips of the adenine nucleotides in each ARN repeat contribute to a dynamic recognition mechanism between the Hfq distal side and mRNA targets. We identify a previously unknown binding pocket that can accept any nucleotide and propose that it may serve as a 'status quo' staging point, providing nonspecific binding affinity, until Crc engages the Hfq-RNA binary complex. The dynamical components of the Hfq-RNA recognition can speed up screening of the pool of the surrounding RNAs, participate in rapid accommodation of the RNA on the protein surface, and facilitate competition among different RNAs. The register of Crc in the ternary assembly could be defined by the recognition of a guanine-specific base-phosphate interaction between the first and last ARN repeats of the bound RNA. This dynamic substrate recognition provides structural rationale for the stepwise assembly of multicomponent ribonucleoprotein complexes nucleated by Hfq-RNA binding.
- Keywords
- ARN repeats, Crc protein, Hfq protein, RNA metabolism, RNA-binding protein, dynamic recognition, molecular dynamics, protein–nucleic acid interaction,
- MeSH
- RNA, Bacterial chemistry genetics metabolism MeSH
- Nucleic Acid Conformation MeSH
- Protein Conformation MeSH
- Nucleotide Motifs * MeSH
- Host Factor 1 Protein chemistry genetics metabolism MeSH
- Pseudomonas aeruginosa genetics metabolism MeSH
- Gene Expression Regulation, Bacterial * MeSH
- Protein Binding MeSH
- Binding Sites MeSH
- Publication type
- Journal Article MeSH
- Research Support, Non-U.S. Gov't MeSH
- Names of Substances
- RNA, Bacterial MeSH
- Host Factor 1 Protein MeSH
Molecular dynamics (MD) simulations became a leading tool for investigation of structural dynamics of nucleic acids. Despite recent efforts to improve the empirical potentials (force fields, ffs), RNA ffs have persisting deficiencies, which hamper their utilization in quantitatively accurate simulations. Previous studies have shown that at least two salient problems contribute to difficulties in the description of free-energy landscapes of small RNA motifs: (i) excessive stabilization of the unfolded single-stranded RNA ensemble by intramolecular base-phosphate and sugar-phosphate interactions and (ii) destabilization of the native folded state by underestimation of stability of base pairing. Here, we introduce a general ff term (gHBfix) that can selectively fine-tune nonbonding interaction terms in RNA ffs, in particular, the H bonds. The gHBfix potential affects the pairwise interactions between all possible pairs of the specific atom types, while all other interactions remain intact; i.e., it is not a structure-based model. In order to probe the ability of the gHBfix potential to refine the ff nonbonded terms, we performed an extensive set of folding simulations of RNA tetranucleotides and tetraloops. On the basis of these data, we propose particular gHBfix parameters to modify the AMBER RNA ff. The suggested parametrization significantly improves the agreement between experimental data and the simulation conformational ensembles, although our current ff version still remains far from being flawless. While attempts to tune the RNA ffs by conventional reparametrizations of dihedral potentials or nonbonded terms can lead to major undesired side effects, as we demonstrate for some recently published ffs, gHBfix has a clear promising potential to improve the ff performance while avoiding introduction of major new imbalances.
- MeSH
- RNA chemistry MeSH
- Molecular Dynamics Simulation * MeSH
- Hydrogen Bonding MeSH
- Publication type
- Journal Article MeSH
- Names of Substances
- RNA MeSH
The RNA recognition motif (RRM) is the most common RNA binding domain across eukaryotic proteins. It is therefore of great value to engineer its specificity to target RNAs of arbitrary sequence. This was recently achieved for the RRM in Rbfox protein, where four mutations R118D, E147R, N151S, and E152T were designed to target the precursor to the oncogenic miRNA 21. Here, we used a variety of molecular dynamics-based approaches to predict specific interactions at the binding interface. Overall, we have run approximately 50 microseconds of enhanced sampling and plain molecular dynamics simulations on the engineered complex as well as on the wild-type Rbfox·pre-miRNA 20b from which the mutated systems were designed. Comparison with the available NMR data on the wild type molecules (protein, RNA, and their complex) served to establish the accuracy of the calculations. Free energy calculations suggest that further improvements in affinity and selectivity are achieved by the S151T replacement.
- MeSH
- Nucleic Acid Conformation MeSH
- Humans MeSH
- MicroRNAs chemistry genetics metabolism MeSH
- Models, Molecular MeSH
- RNA Recognition Motif * genetics MeSH
- Nuclear Magnetic Resonance, Biomolecular MeSH
- Protein Engineering MeSH
- RNA-Binding Proteins chemistry genetics metabolism MeSH
- RNA chemistry metabolism MeSH
- Amino Acid Sequence MeSH
- Molecular Dynamics Simulation MeSH
- RNA Stability MeSH
- Protein Binding MeSH
- Binding Sites genetics MeSH
- Computational Biology MeSH
- Check Tag
- Humans MeSH
- Publication type
- Journal Article MeSH
- Research Support, Non-U.S. Gov't MeSH
- Names of Substances
- MicroRNAs MeSH
- MIRN20b microRNA, human MeSH Browser
- MIRN21 microRNA, human MeSH Browser
- RNA-Binding Proteins MeSH
- RNA MeSH
We have carried out an extended set of standard and enhanced-sampling MD simulations (for a cumulative simulation time of 620 μs) with the aim to study folding landscapes of the rGGGUUAGGG and rGGGAGGG parallel G-hairpins (PH) with propeller loop. We identify folding and unfolding pathways of the PH, which is bridged with the unfolded state via an ensemble of cross-like structures (CS) possessing mutually tilted or perpendicular G-strands interacting via guanine-guanine H-bonding. The oligonucleotides reach the PH conformation from the unfolded state via a conformational diffusion through the folding landscape, i.e. as a series of rearrangements of the H-bond interactions starting from compacted anti-parallel hairpin-like structures. Although isolated PHs do not appear to be thermodynamically stable we suggest that CS and PH-types of structures are sufficiently populated during RNA guanine quadruplex (GQ) folding within the context of complete GQ-forming sequences. These structures may participate in compact coil-like ensembles that involve all four G-strands and already some bound ions. Such ensembles can then rearrange into the fully folded parallel GQs via conformational diffusion. We propose that the basic atomistic folding mechanism of propeller loops suggested in this work may be common for their formation in RNA and DNA GQs.
- MeSH
- G-Quadruplexes * MeSH
- Guanine chemistry metabolism MeSH
- Kinetics MeSH
- RNA chemistry metabolism MeSH
- RNA Folding * MeSH
- Base Sequence MeSH
- Molecular Dynamics Simulation MeSH
- Thermodynamics MeSH
- Hydrogen Bonding MeSH
- Publication type
- Journal Article MeSH
- Research Support, Non-U.S. Gov't MeSH
- Names of Substances
- Guanine MeSH
- RNA MeSH
With both catalytic and genetic functions, ribonucleic acid (RNA) is perhaps the most pluripotent chemical species in molecular biology, and its functions are intimately linked to its structure and dynamics. Computer simulations, and in particular atomistic molecular dynamics (MD), allow structural dynamics of biomolecular systems to be investigated with unprecedented temporal and spatial resolution. We here provide a comprehensive overview of the fast-developing field of MD simulations of RNA molecules. We begin with an in-depth, evaluatory coverage of the most fundamental methodological challenges that set the basis for the future development of the field, in particular, the current developments and inherent physical limitations of the atomistic force fields and the recent advances in a broad spectrum of enhanced sampling methods. We also survey the closely related field of coarse-grained modeling of RNA systems. After dealing with the methodological aspects, we provide an exhaustive overview of the available RNA simulation literature, ranging from studies of the smallest RNA oligonucleotides to investigations of the entire ribosome. Our review encompasses tetranucleotides, tetraloops, a number of small RNA motifs, A-helix RNA, kissing-loop complexes, the TAR RNA element, the decoding center and other important regions of the ribosome, as well as assorted others systems. Extended sections are devoted to RNA-ion interactions, ribozymes, riboswitches, and protein/RNA complexes. Our overview is written for as broad of an audience as possible, aiming to provide a much-needed interdisciplinary bridge between computation and experiment, together with a perspective on the future of the field.
- MeSH
- DNA chemistry MeSH
- Catalysis MeSH
- Nucleic Acid Conformation * MeSH
- Computer Simulation MeSH
- RNA chemistry MeSH
- Molecular Dynamics Simulation * MeSH
- Publication type
- Journal Article MeSH
- Research Support, Non-U.S. Gov't MeSH
- Review MeSH
- Research Support, N.I.H., Extramural MeSH
- Names of Substances
- DNA MeSH
- RNA MeSH
The Fox-1 RNA recognition motif (RRM) domain is an important member of the RRM protein family. We report a 1.8 Å X-ray structure of the free Fox-1 containing six distinct monomers. We use this and the nuclear magnetic resonance (NMR) structure of the Fox-1 protein/RNA complex for molecular dynamics (MD) analyses of the structured hydration. The individual monomers of the X-ray structure show diverse hydration patterns, however, MD excellently reproduces the most occupied hydration sites. Simulations of the protein/RNA complex show hydration consistent with the isolated protein complemented by hydration sites specific to the protein/RNA interface. MD predicts intricate hydration sites with water-binding times extending up to hundreds of nanoseconds. We characterize two of them using NMR spectroscopy, RNA binding with switchSENSE and free-energy calculations of mutant proteins. Both hydration sites are experimentally confirmed and their abolishment reduces the binding free-energy. A quantitative agreement between theory and experiment is achieved for the S155A substitution but not for the S122A mutant. The S155 hydration site is evolutionarily conserved within the RRM domains. In conclusion, MD is an effective tool for predicting and interpreting the hydration patterns of protein/RNA complexes. Hydration is not easily detectable in NMR experiments but can affect stability of protein/RNA complexes.
- MeSH
- Crystallography, X-Ray MeSH
- Humans MeSH
- RNA Recognition Motif genetics MeSH
- Mutagenesis, Site-Directed MeSH
- Nuclear Magnetic Resonance, Biomolecular MeSH
- Recombinant Proteins chemistry genetics metabolism MeSH
- RNA metabolism MeSH
- RNA Splicing Factors chemistry genetics metabolism MeSH
- Molecular Dynamics Simulation MeSH
- Amino Acid Substitution MeSH
- Binding Sites MeSH
- Water chemistry MeSH
- Check Tag
- Humans MeSH
- Publication type
- Journal Article MeSH
- Names of Substances
- RBFOX1 protein, human MeSH Browser
- Recombinant Proteins MeSH
- RNA MeSH
- RNA Splicing Factors MeSH
- Water MeSH