Most cited article - PubMed ID 32240155
Ten quick tips for homology modeling of high-resolution protein 3D structures
Side-chain rotamer prediction is one of the most critical late stages in protein 3D structure building. Highly advanced and specialized algorithms (e.g., FASPR, RASP, SCWRL4, and SCWRL4v) optimize this process by use of rotamer libraries, combinatorial searches, and scoring functions. We seek to identify the sources of key rotamer errors as a basis for correcting and improving the accuracy of protein modeling going forward. In order to evaluate the aforementioned programs, we process 2496 high-quality single-chained all-atom filtered 30% homology protein 3D structures and use discretized rotamer analysis to compare original with calculated structures. Among 513,024 filtered residue records, increased amino acid residue-dependent rotamer errors─associated in particular with polar and charged amino acid residues (ARG, LYS, and GLN)─clearly correlate with increased amino acid residue solvent accessibility and an increased residue tendency toward the adoption of non-canonical off rotamers which modeling programs struggle to predict accurately. Understanding the impact of solvent accessibility now appears key to improved side-chain prediction accuracies.
- MeSH
- Algorithms MeSH
- Amino Acids * chemistry MeSH
- Protein Conformation MeSH
- Proteins * chemistry MeSH
- Solvents MeSH
- Publication type
- Journal Article MeSH
- Research Support, Non-U.S. Gov't MeSH
- Names of Substances
- Amino Acids * MeSH
- Proteins * MeSH
- Solvents MeSH
Most of the available crystal structures of epidermal growth factor receptor (EGFR) kinase domain, bound to drug inhibitors, originated from ligand-based drug design studies. Here, we used variations in 110 crystal structures to assemble eight distinct families highlighting the C-helix orientation in the N-lobe of the EGFR kinase domain. The families shared similar mutational profiles and similarity in the ligand R-groups (chemical composition, geometry, and charge) facing the C-helix, mutation sites, and DFG domain. For structure-based drug design, we recommend a systematic decision-making process for choice of template, guided by appropriate pairwise fitting and clustering before the molecular docking step. Alternatively, the binding site shape/volume can be used to filter and select the compound libraries.
- MeSH
- ErbB Receptors antagonists & inhibitors chemistry genetics MeSH
- Protein Kinase Inhibitors pharmacology MeSH
- Humans MeSH
- Ligands MeSH
- Mutation MeSH
- Drug Design methods MeSH
- Decision Making MeSH
- Molecular Docking Simulation MeSH
- Binding Sites MeSH
- Check Tag
- Humans MeSH
- Publication type
- Journal Article MeSH
- Research Support, Non-U.S. Gov't MeSH
- Review MeSH
- Names of Substances
- EGFR protein, human MeSH Browser
- ErbB Receptors MeSH
- Protein Kinase Inhibitors MeSH
- Ligands MeSH
Homology modeling is a method for building protein 3D structures using protein primary sequence and utilizing prior knowledge gained from structural similarities with other proteins. The homology modeling process is done in sequential steps where sequence/structure alignment is optimized, then a backbone is built and later, side-chains are added. Once the low-homology loops are modeled, the whole 3D structure is optimized and validated. In the past three decades, a few collective and collaborative initiatives allowed for continuous progress in both homology and ab initio modeling. Critical Assessment of protein Structure Prediction (CASP) is a worldwide community experiment that has historically recorded the progress in this field. Folding@Home and Rosetta@Home are examples of crowd-sourcing initiatives where the community is sharing computational resources, whereas RosettaCommons is an example of an initiative where a community is sharing a codebase for the development of computational algorithms. Foldit is another initiative where participants compete with each other in a protein folding video game to predict 3D structure. In the past few years, contact maps deep machine learning was introduced to the 3D structure prediction process, adding more information and increasing the accuracy of models significantly. In this review, we will take the reader in a journey of exploration from the beginnings to the most recent turnabouts, which have revolutionized the field of homology modeling. Moreover, we discuss the new trends emerging in this rapidly growing field.
- Keywords
- Artificial intelligence, Collective intelligence, Homology modeling, Machine learning, Protein 3D structure, Structural bioinformatics,
- Publication type
- Journal Article MeSH
- Review MeSH