Most cited article - PubMed ID 32367806
Choice of Force Field for Proteins Containing Structured and Intrinsically Disordered Regions
By merging advanced dimensionality reduction (DR) and clustering algorithm (CA) techniques, our study advances the sampling procedure for predicting NMR chemical shifts (CS) in intrinsically disordered proteins (IDPs), making a significant leap forward in the field of protein analysis/modeling. We enhance NMR CS sampling by generating clustered ensembles that accurately reflect the different properties and phenomena encapsulated by the IDP trajectories. This investigation critically assessed different rapid CS predictors, both neural network (e.g., Sparta+ and ShiftX2) and database-driven (ProCS-15), and highlighted the need for more advanced quantum calculations and the subsequent need for more tractable-sized conformational ensembles. Although neural network CS predictors outperformed ProCS-15 for all atoms, all tools showed poor agreement with HN CSs, and the neural network CS predictors were unable to capture the influence of phosphorylated residues, highly relevant for IDPs. This study also addressed the limitations of using direct clustering with collective variables, such as the widespread implementation of the GROMOS algorithm. Clustered ensembles (CEs) produced by this algorithm showed poor performance with chemical shifts compared to sequential ensembles (SEs) of similar size. Instead, we implement a multiscale DR and CA approach and explore the challenges and limitations of applying these algorithms to obtain more robust and tractable CEs. The novel feature of this investigation is the use of solvent-accessible surface area (SASA) as one of the fingerprints for DR alongside previously investigated α carbon distance/angles or ϕ/ψ dihedral angles. The ensembles produced with SASA tSNE DR produced CEs better aligned with the experimental CS of between 0.17 and 0.36 r2 (0.18-0.26 ppm) depending on the system and replicate. Furthermore, this technique produced CEs with better agreement than traditional SEs in 85.7% of all ensemble sizes. This study investigates the quality of ensembles produced based on different input features, comparing latent spaces produced by linear vs nonlinear DR techniques and a novel integrated silhouette score scanning protocol for tSNE DR.
The pre-tetramerization loop (PTL) of the human tumor suppressor protein p53 is an intrinsically disordered region (IDR) necessary for the tetramerization process, and its flexibility contributes to the essential conformational changes needed. Although the IDR can be accurately simulated in the traditional manner of molecular dynamics (MD) with the end-to-end distance (EEdist) unhindered, we sought to explore the effects of restraining the EEdist to the values predicted by electron microscopy (EM) and other distances. Simulating the PTL trajectory with a restrained EEdist , we found an increased agreement of nuclear magnetic resonance (NMR) chemical shifts with experiments. Additionally, we observed a plethora of secondary structures and contacts that only appear when the trajectory is restrained. Our findings expand the understanding of the tetramerization of p53 and provide insight into how mutations could make the protein impotent. In particular, our findings demonstrate the importance of restraining the EEdist in studying IDRs and how their conformations change under different conditions. Our results provide a better understanding of the PTL and the conformational dynamics of IDRs in general, which are useful for further studies regarding mutations and their effects on the activity of p53.
- MeSH
- Protein Conformation MeSH
- Humans MeSH
- Magnetic Resonance Spectroscopy MeSH
- Tumor Suppressor Protein p53 chemistry MeSH
- Protein Structure, Secondary MeSH
- Molecular Dynamics Simulation * MeSH
- Intrinsically Disordered Proteins * chemistry MeSH
- Check Tag
- Humans MeSH
- Publication type
- Journal Article MeSH
- Names of Substances
- Tumor Suppressor Protein p53 MeSH
- Intrinsically Disordered Proteins * MeSH
Intrinsically disordered proteins (IDPs) or intrinsically disordered regions (IDRs) is a class of biologically important proteins exhibiting specific biophysical characteristics. They lack a hydrophobic core, and their conformational behavior is strongly influenced by electrostatic interactions. IDPs and IDRs are highly dynamic, and a characterization of the motions of IDPs and IDRs is essential for their physically correct description. NMR together with molecular dynamics simulations are the methods best suited to such a task because they provide information about dynamics of proteins with atomistic resolution. Here, we present a study of motions of a disordered C-terminal domain of the delta subunit of RNA polymerase from Bacillus subtilis. Positively and negatively charged residues in the studied domain form transient electrostatic contacts critical for the biological function. Our study is focused on investigation of ps-ns dynamics of backbone of the delta subunit based on analysis of amide 15N NMR relaxation data and molecular dynamics simulations. In order to extend an informational content of NMR data to lower frequencies, which are more sensitive to slower motions, we combined standard (high-field) NMR relaxation experiments with high-resolution relaxometry. Altogether, we collected data reporting the relaxation at 12 different magnetic fields, resulting in an unprecedented data set. Our results document that the analysis of such data provides a consistent description of dynamics and confirms the validity of so far used protocols of the analysis of dynamics of IDPs also for a partially folded protein. In addition, the potential to access detailed description of motions at the timescale of tens of ns with the help of relaxometry data is discussed. Interestingly, in our case, it appears to be mostly relevant for a region involved in the formation of temporary contacts within the disordered region, which was previously proven to be biologically important.
- MeSH
- Amides MeSH
- DNA-Directed RNA Polymerases chemistry MeSH
- Protein Conformation MeSH
- Magnetic Resonance Spectroscopy MeSH
- Molecular Dynamics Simulation MeSH
- Intrinsically Disordered Proteins * chemistry MeSH
- Publication type
- Journal Article MeSH
- Research Support, Non-U.S. Gov't MeSH
- Names of Substances
- Amides MeSH
- DNA-Directed RNA Polymerases MeSH
- Intrinsically Disordered Proteins * MeSH
Intrinsically disordered proteins are ubiquitous throughout all known proteomes, playing essential roles in all aspects of cellular and extracellular biochemistry. To understand their function, it is necessary to determine their structural and dynamic behavior and to describe the physical chemistry of their interaction trajectories. Nuclear magnetic resonance is perfectly adapted to this task, providing ensemble averaged structural and dynamic parameters that report on each assigned resonance in the molecule, unveiling otherwise inaccessible insight into the reaction kinetics and thermodynamics that are essential for function. In this review, we describe recent applications of NMR-based approaches to understanding the conformational energy landscape, the nature and time scales of local and long-range dynamics and how they depend on the environment, even in the cell. Finally, we illustrate the ability of NMR to uncover the mechanistic basis of functional disordered molecular assemblies that are important for human health.
- MeSH
- Protein Conformation MeSH
- Humans MeSH
- Magnetic Resonance Spectroscopy MeSH
- Nuclear Magnetic Resonance, Biomolecular MeSH
- Thermodynamics MeSH
- Intrinsically Disordered Proteins * chemistry MeSH
- Check Tag
- Humans MeSH
- Publication type
- Journal Article MeSH
- Research Support, Non-U.S. Gov't MeSH
- Review MeSH
- Names of Substances
- Intrinsically Disordered Proteins * MeSH