-
Je něco špatně v tomto záznamu ?
Utilizing knowledge base of amino acids structural neighborhoods to predict protein-protein interaction sites
J. Jelínek, P. Škoda, D. Hoksza,
Jazyk angličtina Země Velká Británie
Typ dokumentu časopisecké články
NLK
BioMedCentral
od 2000-12-01
BioMedCentral Open Access
od 2000
Directory of Open Access Journals
od 2000
Free Medical Journals
od 2000
PubMed Central
od 2000
Europe PubMed Central
od 2000
ProQuest Central
od 2009-01-01
Open Access Digital Library
od 2000-01-01
Open Access Digital Library
od 2000-01-01
Open Access Digital Library
od 2000-07-01
Medline Complete (EBSCOhost)
od 2000-01-01
Health & Medicine (ProQuest)
od 2009-01-01
ROAD: Directory of Open Access Scholarly Resources
od 2000
Springer Nature OA/Free Journals
od 2000-12-01
- MeSH
- aminokyseliny * chemie metabolismus MeSH
- databáze proteinů MeSH
- mapování interakce mezi proteiny metody MeSH
- proteiny * chemie metabolismus MeSH
- software * MeSH
- statistické modely MeSH
- výpočetní biologie MeSH
- znalostní báze * MeSH
- Publikační typ
- časopisecké články MeSH
BACKGROUND: Protein-protein interactions (PPI) play a key role in an investigation of various biochemical processes, and their identification is thus of great importance. Although computational prediction of which amino acids take part in a PPI has been an active field of research for some time, the quality of in-silico methods is still far from perfect. RESULTS: We have developed a novel prediction method called INSPiRE which benefits from a knowledge base built from data available in Protein Data Bank. All proteins involved in PPIs were converted into labeled graphs with nodes corresponding to amino acids and edges to pairs of neighboring amino acids. A structural neighborhood of each node was then encoded into a bit string and stored in the knowledge base. When predicting PPIs, INSPiRE labels amino acids of unknown proteins as interface or non-interface based on how often their structural neighborhood appears as interface or non-interface in the knowledge base. We evaluated INSPiRE's behavior with respect to different types and sizes of the structural neighborhood. Furthermore, we examined the suitability of several different features for labeling the nodes. Our evaluations showed that INSPiRE clearly outperforms existing methods with respect to Matthews correlation coefficient. CONCLUSION: In this paper we introduce a new knowledge-based method for identification of protein-protein interaction sites called INSPiRE. Its knowledge base utilizes structural patterns of known interaction sites in the Protein Data Bank which are then used for PPI prediction. Extensive experiments on several well-established datasets show that INSPiRE significantly surpasses existing PPI approaches.
Citace poskytuje Crossref.org
- 000
- 00000naa a2200000 a 4500
- 001
- bmc18010240
- 003
- CZ-PrNML
- 005
- 20180418095427.0
- 007
- ta
- 008
- 180404s2017 xxk f 000 0|eng||
- 009
- AR
- 024 7_
- $a 10.1186/s12859-017-1921-4 $2 doi
- 035 __
- $a (PubMed)29244012
- 040 __
- $a ABA008 $b cze $d ABA008 $e AACR2
- 041 0_
- $a eng
- 044 __
- $a xxk
- 100 1_
- $a Jelínek, Jan $u Department of Software Engineering, Faculty of Mathematics and Physics, Charles University, Ke Karlovu 3, Prague 2, Czech Republic. jelinek@ksi.mff.cuni.cz.
- 245 10
- $a Utilizing knowledge base of amino acids structural neighborhoods to predict protein-protein interaction sites / $c J. Jelínek, P. Škoda, D. Hoksza,
- 520 9_
- $a BACKGROUND: Protein-protein interactions (PPI) play a key role in an investigation of various biochemical processes, and their identification is thus of great importance. Although computational prediction of which amino acids take part in a PPI has been an active field of research for some time, the quality of in-silico methods is still far from perfect. RESULTS: We have developed a novel prediction method called INSPiRE which benefits from a knowledge base built from data available in Protein Data Bank. All proteins involved in PPIs were converted into labeled graphs with nodes corresponding to amino acids and edges to pairs of neighboring amino acids. A structural neighborhood of each node was then encoded into a bit string and stored in the knowledge base. When predicting PPIs, INSPiRE labels amino acids of unknown proteins as interface or non-interface based on how often their structural neighborhood appears as interface or non-interface in the knowledge base. We evaluated INSPiRE's behavior with respect to different types and sizes of the structural neighborhood. Furthermore, we examined the suitability of several different features for labeling the nodes. Our evaluations showed that INSPiRE clearly outperforms existing methods with respect to Matthews correlation coefficient. CONCLUSION: In this paper we introduce a new knowledge-based method for identification of protein-protein interaction sites called INSPiRE. Its knowledge base utilizes structural patterns of known interaction sites in the Protein Data Bank which are then used for PPI prediction. Extensive experiments on several well-established datasets show that INSPiRE significantly surpasses existing PPI approaches.
- 650 12
- $a aminokyseliny $x chemie $x metabolismus $7 D000596
- 650 _2
- $a výpočetní biologie $7 D019295
- 650 _2
- $a databáze proteinů $7 D030562
- 650 12
- $a znalostní báze $7 D051188
- 650 _2
- $a statistické modely $7 D015233
- 650 _2
- $a mapování interakce mezi proteiny $x metody $7 D025941
- 650 12
- $a proteiny $x chemie $x metabolismus $7 D011506
- 650 12
- $a software $7 D012984
- 655 _2
- $a časopisecké články $7 D016428
- 700 1_
- $a Škoda, Petr $u Department of Software Engineering, Faculty of Mathematics and Physics, Charles University, Ke Karlovu 3, Prague 2, Czech Republic.
- 700 1_
- $a Hoksza, David $u Department of Software Engineering, Faculty of Mathematics and Physics, Charles University, Ke Karlovu 3, Prague 2, Czech Republic.
- 773 0_
- $w MED00008167 $t BMC bioinformatics $x 1471-2105 $g Roč. 18, Suppl 15 (2017), s. 492
- 856 41
- $u https://pubmed.ncbi.nlm.nih.gov/29244012 $y Pubmed
- 910 __
- $a ABA008 $b sig $c sign $y a $z 0
- 990 __
- $a 20180404 $b ABA008
- 991 __
- $a 20180418095527 $b ABA008
- 999 __
- $a ok $b bmc $g 1287725 $s 1007052
- BAS __
- $a 3
- BAS __
- $a PreBMC
- BMC __
- $a 2017 $b 18 $c Suppl 15 $d 492 $e 20171206 $i 1471-2105 $m BMC bioinformatics $n BMC Bioinformatics $x MED00008167
- LZP __
- $a Pubmed-20180404