The proteomic code: Novel amino acid residue pairing models "encode" protein folding and protein-protein interactions
Jazyk angličtina Země Spojené státy americké Médium print-electronic
Typ dokumentu časopisecké články
PubMed
40112562
DOI
10.1016/j.compbiomed.2025.110033
PII: S0010-4825(25)00384-1
Knihovny.cz E-zdroje
- Klíčová slova
- Contact map, Protein 3D structure, Protein folding, Protein-protein interactions, Proteomic code, Sense-antisense,
- MeSH
- aminokyseliny * chemie genetika MeSH
- databáze proteinů MeSH
- lidé MeSH
- molekulární modely * MeSH
- proteiny * chemie genetika metabolismus MeSH
- proteomika * MeSH
- sbalování proteinů * MeSH
- Check Tag
- lidé MeSH
- Publikační typ
- časopisecké články MeSH
- Názvy látek
- aminokyseliny * MeSH
- proteiny * MeSH
Recent advances in protein 3D structure prediction using deep learning have focused on the importance of amino acid residue-residue connections (i.e., pairwise atomic contacts) for accuracy at the expense of mechanistic interpretability. Therefore, we decided to perform a series of analyses based on an alternative framework of residue-residue connections making primary use of the TOP2018 dataset. This framework of residue-residue connections is derived from amino acid residue pairing models both historic and new, all based on genetic principles complemented by relevant biophysical principles. Of these pairing models, three new models (named the GU, Transmuted and Shift pairing models) exhibit the highest observed-over-expected ratios and highest correlations in statistical analyses with various intra- and inter-chain datasets, in comparison to the remaining models. In addition, these new pairing models are universally frequent across different connection ranges, secondary structure connections, and protein sizes. Accordingly, following further statistical and other analyses described herein, we have come to a major conclusion that all three pairing models together could represent the basis of a universal proteomic code (second genetic code) sufficient, in and of itself, to "encode" for both protein folding mechanisms and protein-protein interactions.
Citace poskytuje Crossref.org