• Je něco špatně v tomto záznamu ?

Utilizing knowledge base of amino acids structural neighborhoods to predict protein-protein interaction sites

J. Jelínek, P. Škoda, D. Hoksza,

. 2017 ; 18 (Suppl 15) : 492. [pub] 20171206

Jazyk angličtina Země Velká Británie

Typ dokumentu časopisecké články

Perzistentní odkaz   https://www.medvik.cz/link/bmc18010240

BACKGROUND: Protein-protein interactions (PPI) play a key role in an investigation of various biochemical processes, and their identification is thus of great importance. Although computational prediction of which amino acids take part in a PPI has been an active field of research for some time, the quality of in-silico methods is still far from perfect. RESULTS: We have developed a novel prediction method called INSPiRE which benefits from a knowledge base built from data available in Protein Data Bank. All proteins involved in PPIs were converted into labeled graphs with nodes corresponding to amino acids and edges to pairs of neighboring amino acids. A structural neighborhood of each node was then encoded into a bit string and stored in the knowledge base. When predicting PPIs, INSPiRE labels amino acids of unknown proteins as interface or non-interface based on how often their structural neighborhood appears as interface or non-interface in the knowledge base. We evaluated INSPiRE's behavior with respect to different types and sizes of the structural neighborhood. Furthermore, we examined the suitability of several different features for labeling the nodes. Our evaluations showed that INSPiRE clearly outperforms existing methods with respect to Matthews correlation coefficient. CONCLUSION: In this paper we introduce a new knowledge-based method for identification of protein-protein interaction sites called INSPiRE. Its knowledge base utilizes structural patterns of known interaction sites in the Protein Data Bank which are then used for PPI prediction. Extensive experiments on several well-established datasets show that INSPiRE significantly surpasses existing PPI approaches.

Citace poskytuje Crossref.org

000      
00000naa a2200000 a 4500
001      
bmc18010240
003      
CZ-PrNML
005      
20180418095427.0
007      
ta
008      
180404s2017 xxk f 000 0|eng||
009      
AR
024    7_
$a 10.1186/s12859-017-1921-4 $2 doi
035    __
$a (PubMed)29244012
040    __
$a ABA008 $b cze $d ABA008 $e AACR2
041    0_
$a eng
044    __
$a xxk
100    1_
$a Jelínek, Jan $u Department of Software Engineering, Faculty of Mathematics and Physics, Charles University, Ke Karlovu 3, Prague 2, Czech Republic. jelinek@ksi.mff.cuni.cz.
245    10
$a Utilizing knowledge base of amino acids structural neighborhoods to predict protein-protein interaction sites / $c J. Jelínek, P. Škoda, D. Hoksza,
520    9_
$a BACKGROUND: Protein-protein interactions (PPI) play a key role in an investigation of various biochemical processes, and their identification is thus of great importance. Although computational prediction of which amino acids take part in a PPI has been an active field of research for some time, the quality of in-silico methods is still far from perfect. RESULTS: We have developed a novel prediction method called INSPiRE which benefits from a knowledge base built from data available in Protein Data Bank. All proteins involved in PPIs were converted into labeled graphs with nodes corresponding to amino acids and edges to pairs of neighboring amino acids. A structural neighborhood of each node was then encoded into a bit string and stored in the knowledge base. When predicting PPIs, INSPiRE labels amino acids of unknown proteins as interface or non-interface based on how often their structural neighborhood appears as interface or non-interface in the knowledge base. We evaluated INSPiRE's behavior with respect to different types and sizes of the structural neighborhood. Furthermore, we examined the suitability of several different features for labeling the nodes. Our evaluations showed that INSPiRE clearly outperforms existing methods with respect to Matthews correlation coefficient. CONCLUSION: In this paper we introduce a new knowledge-based method for identification of protein-protein interaction sites called INSPiRE. Its knowledge base utilizes structural patterns of known interaction sites in the Protein Data Bank which are then used for PPI prediction. Extensive experiments on several well-established datasets show that INSPiRE significantly surpasses existing PPI approaches.
650    12
$a aminokyseliny $x chemie $x metabolismus $7 D000596
650    _2
$a výpočetní biologie $7 D019295
650    _2
$a databáze proteinů $7 D030562
650    12
$a znalostní báze $7 D051188
650    _2
$a statistické modely $7 D015233
650    _2
$a mapování interakce mezi proteiny $x metody $7 D025941
650    12
$a proteiny $x chemie $x metabolismus $7 D011506
650    12
$a software $7 D012984
655    _2
$a časopisecké články $7 D016428
700    1_
$a Škoda, Petr $u Department of Software Engineering, Faculty of Mathematics and Physics, Charles University, Ke Karlovu 3, Prague 2, Czech Republic.
700    1_
$a Hoksza, David $u Department of Software Engineering, Faculty of Mathematics and Physics, Charles University, Ke Karlovu 3, Prague 2, Czech Republic.
773    0_
$w MED00008167 $t BMC bioinformatics $x 1471-2105 $g Roč. 18, Suppl 15 (2017), s. 492
856    41
$u https://pubmed.ncbi.nlm.nih.gov/29244012 $y Pubmed
910    __
$a ABA008 $b sig $c sign $y a $z 0
990    __
$a 20180404 $b ABA008
991    __
$a 20180418095527 $b ABA008
999    __
$a ok $b bmc $g 1287725 $s 1007052
BAS    __
$a 3
BAS    __
$a PreBMC
BMC    __
$a 2017 $b 18 $c Suppl 15 $d 492 $e 20171206 $i 1471-2105 $m BMC bioinformatics $n BMC Bioinformatics $x MED00008167
LZP    __
$a Pubmed-20180404

Najít záznam

Citační ukazatele

Nahrávání dat ...

Možnosti archivace

Nahrávání dat ...