-
Something wrong with this record ?
Automatic workflow for the classification of local DNA conformations
P. Čech, J. Kukal, J. Černý, B. Schneider, D. Svozil,
Language English Country England, Great Britain
Document type Journal Article, Research Support, Non-U.S. Gov't
NLK
BioMedCentral
from 2000-01-12
BioMedCentral Open Access
from 2000
Directory of Open Access Journals
from 2000
Free Medical Journals
from 2000
PubMed Central
from 2000
Europe PubMed Central
from 2000
ProQuest Central
from 2009-01-01
Open Access Digital Library
from 2000-07-01
Open Access Digital Library
from 2000-01-01
Open Access Digital Library
from 2000-01-01
Medline Complete (EBSCOhost)
from 2000-01-01
Health & Medicine (ProQuest)
from 2009-01-01
ROAD: Directory of Open Access Scholarly Resources
from 2000
Springer Nature OA/Free Journals
from 2000-12-01
- MeSH
- Algorithms MeSH
- DNA chemistry MeSH
- G-Quadruplexes MeSH
- Classification methods MeSH
- Nucleic Acid Conformation MeSH
- Crystallography, X-Ray MeSH
- Nuclear Magnetic Resonance, Biomolecular MeSH
- Workflow MeSH
- Cluster Analysis MeSH
- Publication type
- Journal Article MeSH
- Research Support, Non-U.S. Gov't MeSH
BACKGROUND: A growing number of crystal and NMR structures reveals a considerable structural polymorphism of DNA architecture going well beyond the usual image of a double helical molecule. DNA is highly variable with dinucleotide steps exhibiting a substantial flexibility in a sequence-dependent manner. An analysis of the conformational space of the DNA backbone and the enhancement of our understanding of the conformational dependencies in DNA are therefore important for full comprehension of DNA structural polymorphism. RESULTS: A detailed classification of local DNA conformations based on the technique of Fourier averaging was published in our previous work. However, this procedure requires a considerable amount of manual work. To overcome this limitation we developed an automatic classification method consisting of the combination of supervised and unsupervised approaches. A proposed workflow is composed of k-NN method followed by a non-hierarchical single-pass clustering algorithm. We applied this workflow to analyze 816 X-ray and 664 NMR DNA structures released till February 2013. We identified and annotated six new conformers, and we assigned four of these conformers to two structurally important DNA families: guanine quadruplexes and Holliday (four-way) junctions. We also compared populations of the assigned conformers in the dataset of X-ray and NMR structures. CONCLUSIONS: In the present work we developed a machine learning workflow for the automatic classification of dinucleotide conformations. Dinucleotides with unassigned conformations can be either classified into one of already known 24 classes or they can be flagged as unclassifiable. The proposed machine learning workflow permits identification of new classes among so far unclassifiable data, and we identified and annotated six new conformations in the X-ray structures released since our previous analysis. The results illustrate the utility of machine learning approaches in the classification of local DNA conformations.
References provided by Crossref.org
- 000
- 00000naa a2200000 a 4500
- 001
- bmc14050958
- 003
- CZ-PrNML
- 005
- 20140408113949.0
- 007
- ta
- 008
- 140401s2013 enk f 000 0|eng||
- 009
- AR
- 024 7_
- $a 10.1186/1471-2105-14-205 $2 doi
- 035 __
- $a (PubMed)23800225
- 040 __
- $a ABA008 $b cze $d ABA008 $e AACR2
- 041 0_
- $a eng
- 044 __
- $a enk
- 100 1_
- $a Čech, Petr
- 245 10
- $a Automatic workflow for the classification of local DNA conformations / $c P. Čech, J. Kukal, J. Černý, B. Schneider, D. Svozil,
- 520 9_
- $a BACKGROUND: A growing number of crystal and NMR structures reveals a considerable structural polymorphism of DNA architecture going well beyond the usual image of a double helical molecule. DNA is highly variable with dinucleotide steps exhibiting a substantial flexibility in a sequence-dependent manner. An analysis of the conformational space of the DNA backbone and the enhancement of our understanding of the conformational dependencies in DNA are therefore important for full comprehension of DNA structural polymorphism. RESULTS: A detailed classification of local DNA conformations based on the technique of Fourier averaging was published in our previous work. However, this procedure requires a considerable amount of manual work. To overcome this limitation we developed an automatic classification method consisting of the combination of supervised and unsupervised approaches. A proposed workflow is composed of k-NN method followed by a non-hierarchical single-pass clustering algorithm. We applied this workflow to analyze 816 X-ray and 664 NMR DNA structures released till February 2013. We identified and annotated six new conformers, and we assigned four of these conformers to two structurally important DNA families: guanine quadruplexes and Holliday (four-way) junctions. We also compared populations of the assigned conformers in the dataset of X-ray and NMR structures. CONCLUSIONS: In the present work we developed a machine learning workflow for the automatic classification of dinucleotide conformations. Dinucleotides with unassigned conformations can be either classified into one of already known 24 classes or they can be flagged as unclassifiable. The proposed machine learning workflow permits identification of new classes among so far unclassifiable data, and we identified and annotated six new conformations in the X-ray structures released since our previous analysis. The results illustrate the utility of machine learning approaches in the classification of local DNA conformations.
- 650 _2
- $a algoritmy $7 D000465
- 650 _2
- $a klasifikace $x metody $7 D002965
- 650 _2
- $a shluková analýza $7 D016000
- 650 _2
- $a krystalografie rentgenová $7 D018360
- 650 _2
- $a DNA $x chemie $7 D004247
- 650 _2
- $a G-kvadruplexy $7 D054856
- 650 _2
- $a nukleární magnetická rezonance biomolekulární $7 D019906
- 650 _2
- $a konformace nukleové kyseliny $7 D009690
- 650 _2
- $a průběh práce $7 D057188
- 655 _2
- $a časopisecké články $7 D016428
- 655 _2
- $a práce podpořená grantem $7 D013485
- 700 1_
- $a Kukal, Jaromír $u -
- 700 1_
- $a Černý, Jiří $u -
- 700 1_
- $a Schneider, Bohdan $u -
- 700 1_
- $a Svozil, Daniel $u -
- 773 0_
- $w MED00008167 $t BMC bioinformatics $x 1471-2105 $g Roč. 14, č. - (2013), s. 205
- 856 41
- $u https://pubmed.ncbi.nlm.nih.gov/23800225 $y Pubmed
- 910 __
- $a ABA008 $b sig $c sign $y a $z 0
- 990 __
- $a 20140401 $b ABA008
- 991 __
- $a 20140408114037 $b ABA008
- 999 __
- $a ok $b bmc $g 1018094 $s 849538
- BAS __
- $a 3
- BAS __
- $a PreBMC
- BMC __
- $a 2013 $b 14 $c - $d 205 $i 1471-2105 $m BMC bioinformatics $n BMC Bioinformatics $x MED00008167
- LZP __
- $a Pubmed-20140401