-
Something wrong with this record ?
Automated shape-based clustering of 3D immunoglobulin protein structures in chronic lymphocytic leukemia
E. Polychronidou, I. Kalamaras, A. Agathangelidis, LA. Sutton, XJ. Yan, V. Bikos, A. Vardi, K. Mochament, N. Chiorazzi, C. Belessi, R. Rosenquist, P. Ghia, K. Stamatopoulos, P. Vlamos, A. Chailyan, N. Overby, P. Marcatili, A. Hatzidimitriou, D. Tzovaras,
Language English Country England, Great Britain
Document type Journal Article
NLK
BioMedCentral
from 2000-12-01
BioMedCentral Open Access
from 2000
Directory of Open Access Journals
from 2000
Free Medical Journals
from 2000
PubMed Central
from 2000
Europe PubMed Central
from 2000
ProQuest Central
from 2009-01-01
Open Access Digital Library
from 2000-01-01
Open Access Digital Library
from 2000-07-01
Open Access Digital Library
from 2000-01-01
Medline Complete (EBSCOhost)
from 2000-01-01
Health & Medicine (ProQuest)
from 2009-01-01
ROAD: Directory of Open Access Scholarly Resources
from 2000
Springer Nature OA/Free Journals
from 2000-12-01
- MeSH
- Molecular Sequence Annotation MeSH
- Automation MeSH
- Leukemia, Lymphocytic, Chronic, B-Cell metabolism MeSH
- Databases, Protein MeSH
- Immunoglobulins chemistry MeSH
- Humans MeSH
- Amino Acid Sequence MeSH
- Imaging, Three-Dimensional * MeSH
- Check Tag
- Humans MeSH
- Publication type
- Journal Article MeSH
BACKGROUND: Although the etiology of chronic lymphocytic leukemia (CLL), the most common type of adult leukemia, is still unclear, strong evidence implicates antigen involvement in disease ontogeny and evolution. Primary and 3D structure analysis has been utilised in order to discover indications of antigenic pressure. The latter has been mostly based on the 3D models of the clonotypic B cell receptor immunoglobulin (BcR IG) amino acid sequences. Therefore, their accuracy is directly dependent on the quality of the model construction algorithms and the specific methods used to compare the ensuing models. Thus far, reliable and robust methods that can group the IG 3D models based on their structural characteristics are missing. RESULTS: Here we propose a novel method for clustering a set of proteins based on their 3D structure focusing on 3D structures of BcR IG from a large series of patients with CLL. The method combines techniques from the areas of bioinformatics, 3D object recognition and machine learning. The clustering procedure is based on the extraction of 3D descriptors, encoding various properties of the local and global geometrical structure of the proteins. The descriptors are extracted from aligned pairs of proteins. A combination of individual 3D descriptors is also used as an additional method. The comparison of the automatically generated clusters to manual annotation by experts shows an increased accuracy when using the 3D descriptors compared to plain bioinformatics-based comparison. The accuracy is increased even more when using the combination of 3D descriptors. CONCLUSIONS: The experimental results verify that the use of 3D descriptors commonly used for 3D object recognition can be effectively applied to distinguishing structural differences of proteins. The proposed approach can be applied to provide hints for the existence of structural groups in a large set of unannotated BcR IG protein files in both CLL and, by logical extension, other contexts where it is relevant to characterize BcR IG structural similarity. The method does not present any limitations in application and can be extended to other types of proteins.
Carlsberg Research Laboratory Copenhagen Denmark
Center for Biological Sequence Analysis Technical University of Denmark Copenhagen Denmark
Department of Informatics Ionian University Corfu Greece
Hematology Department and HCT Unit G Papanikolaou Hospital Thessaloniki Greece
Masaryk University Central European Institute of Technology Brno Czech Republic
References provided by Crossref.org
- 000
- 00000naa a2200000 a 4500
- 001
- bmc19000261
- 003
- CZ-PrNML
- 005
- 20190114100253.0
- 007
- ta
- 008
- 190107s2018 enk f 000 0|eng||
- 009
- AR
- 024 7_
- $a 10.1186/s12859-018-2381-1 $2 doi
- 035 __
- $a (PubMed)30453883
- 040 __
- $a ABA008 $b cze $d ABA008 $e AACR2
- 041 0_
- $a eng
- 044 __
- $a enk
- 100 1_
- $a Polychronidou, Eleftheria $u Information Technologies Institute, Centre for Research and Technology Hellas, 6th km Harilaou-Thermi Road, Thessaloniki, Greece. epolyc@iti.gr.
- 245 10
- $a Automated shape-based clustering of 3D immunoglobulin protein structures in chronic lymphocytic leukemia / $c E. Polychronidou, I. Kalamaras, A. Agathangelidis, LA. Sutton, XJ. Yan, V. Bikos, A. Vardi, K. Mochament, N. Chiorazzi, C. Belessi, R. Rosenquist, P. Ghia, K. Stamatopoulos, P. Vlamos, A. Chailyan, N. Overby, P. Marcatili, A. Hatzidimitriou, D. Tzovaras,
- 520 9_
- $a BACKGROUND: Although the etiology of chronic lymphocytic leukemia (CLL), the most common type of adult leukemia, is still unclear, strong evidence implicates antigen involvement in disease ontogeny and evolution. Primary and 3D structure analysis has been utilised in order to discover indications of antigenic pressure. The latter has been mostly based on the 3D models of the clonotypic B cell receptor immunoglobulin (BcR IG) amino acid sequences. Therefore, their accuracy is directly dependent on the quality of the model construction algorithms and the specific methods used to compare the ensuing models. Thus far, reliable and robust methods that can group the IG 3D models based on their structural characteristics are missing. RESULTS: Here we propose a novel method for clustering a set of proteins based on their 3D structure focusing on 3D structures of BcR IG from a large series of patients with CLL. The method combines techniques from the areas of bioinformatics, 3D object recognition and machine learning. The clustering procedure is based on the extraction of 3D descriptors, encoding various properties of the local and global geometrical structure of the proteins. The descriptors are extracted from aligned pairs of proteins. A combination of individual 3D descriptors is also used as an additional method. The comparison of the automatically generated clusters to manual annotation by experts shows an increased accuracy when using the 3D descriptors compared to plain bioinformatics-based comparison. The accuracy is increased even more when using the combination of 3D descriptors. CONCLUSIONS: The experimental results verify that the use of 3D descriptors commonly used for 3D object recognition can be effectively applied to distinguishing structural differences of proteins. The proposed approach can be applied to provide hints for the existence of structural groups in a large set of unannotated BcR IG protein files in both CLL and, by logical extension, other contexts where it is relevant to characterize BcR IG structural similarity. The method does not present any limitations in application and can be extended to other types of proteins.
- 650 _2
- $a sekvence aminokyselin $7 D000595
- 650 _2
- $a automatizace $7 D001331
- 650 _2
- $a databáze proteinů $7 D030562
- 650 _2
- $a lidé $7 D006801
- 650 12
- $a zobrazování trojrozměrné $7 D021621
- 650 _2
- $a imunoglobuliny $x chemie $7 D007136
- 650 _2
- $a chronická lymfatická leukemie $x metabolismus $7 D015451
- 650 _2
- $a anotace sekvence $7 D058977
- 655 _2
- $a časopisecké články $7 D016428
- 700 1_
- $a Kalamaras, Ilias $u Information Technologies Institute, Centre for Research and Technology Hellas, 6th km Harilaou-Thermi Road, Thessaloniki, Greece.
- 700 1_
- $a Agathangelidis, Andreas $u Institute of Applied Biosciences, Centre for Research and Technology Hellas, 6th km Harilaou-Thermi Road, Thessaloniki, Greece.
- 700 1_
- $a Sutton, Lesley-Ann $u Department of Immunology, Technical University of Denmark,Genetics and Pathology, Science for Life Laboratory, Uppsala University, Uppsala, Sweden.
- 700 1_
- $a Yan, Xiao-Jie $u Karches Center for Chronic Lymphocytic Leukemia Research, The Feinstein Institute for Medical Research, Manhasset, NY, USA.
- 700 1_
- $a Bikos, Vasilis $7 xx0231091 $u Masaryk University, Central European Institute of Technology, Brno, Czech Republic.
- 700 1_
- $a Vardi, Anna $u Hematology Department and HCT Unit, G. Papanikolaou Hospital, Thessaloniki, Greece.
- 700 1_
- $a Mochament, Konstantinos $u Information Technologies Institute, Centre for Research and Technology Hellas, 6th km Harilaou-Thermi Road, Thessaloniki, Greece.
- 700 1_
- $a Chiorazzi, Nicholas $u Karches Center for Chronic Lymphocytic Leukemia Research, The Feinstein Institute for Medical Research, Manhasset, NY, USA.
- 700 1_
- $a Belessi, Chrysoula $u Nikea General Hospital, Hematology Department, Piraeus, Greece.
- 700 1_
- $a Rosenquist, Richard $u Department of Immunology, Technical University of Denmark,Genetics and Pathology, Science for Life Laboratory, Uppsala University, Uppsala, Sweden.
- 700 1_
- $a Ghia, Paolo $u IRCCS San Raffaele Scientific Institute and Università, VitaSalute, San Raffaele, Division of Experimental Oncology, Milan, Italy.
- 700 1_
- $a Stamatopoulos, Kostas $u Institute of Applied Biosciences, Centre for Research and Technology Hellas, 6th km Harilaou-Thermi Road, Thessaloniki, Greece.
- 700 1_
- $a Vlamos, Panayiotis $u Department of Informatics,Ionian University, Corfu, Greece.
- 700 1_
- $a Chailyan, Anna $u Carlsberg Research Laboratory, Copenhagen, Denmark.
- 700 1_
- $a Overby, Nanna $u Center for Biological Sequence Analysis, Technical University of Denmark, Copenhagen, Denmark.
- 700 1_
- $a Marcatili, Paolo $u Center for Biological Sequence Analysis, Technical University of Denmark, Copenhagen, Denmark.
- 700 1_
- $a Hatzidimitriou, Anastasia $u Institute of Applied Biosciences, Centre for Research and Technology Hellas, 6th km Harilaou-Thermi Road, Thessaloniki, Greece.
- 700 1_
- $a Tzovaras, Dimitrios $u Information Technologies Institute, Centre for Research and Technology Hellas, 6th km Harilaou-Thermi Road, Thessaloniki, Greece.
- 773 0_
- $w MED00008167 $t BMC bioinformatics $x 1471-2105 $g Roč. 19, Suppl 14 (2018), s. 414
- 856 41
- $u https://pubmed.ncbi.nlm.nih.gov/30453883 $y Pubmed
- 910 __
- $a ABA008 $b sig $c sign $y a $z 0
- 990 __
- $a 20190107 $b ABA008
- 991 __
- $a 20190114100502 $b ABA008
- 999 __
- $a ok $b bmc $g 1364381 $s 1038384
- BAS __
- $a 3
- BAS __
- $a PreBMC
- BMC __
- $a 2018 $b 19 $c Suppl 14 $d 414 $e 20181120 $i 1471-2105 $m BMC bioinformatics $n BMC Bioinformatics $x MED00008167
- LZP __
- $a Pubmed-20190107