JavaScript is NOT enabled !

Please enable JavaScript.

Article
Online article

FT
Medvik - BMC

Something wrong with this record ?

A dynamic programming algorithm for identification of triplex-forming sequences

M. Lexa, T. Martínek, I. Burgetová, D. Kopeček, M. Brázdová

Lexa, Matej, 1966-
Author Authority Department of Information Technology, Faculty of Informatics, Masaryk University, 60200 Brno, Czech Republic
Martínek, Tomáš
Author Martínek, Tomáš Department of Computer Systems, Faculty of Information Technology, Brno Technical University, 61266 Brno, Czech Republic
Burgetová, Ivana
Author Burgetová, Ivana Department of Computer Systems, Faculty of Information Technology, Brno Technical University, 61266 Brno, Czech Republic
Kopeček, Daniel
Author Kopeček, Daniel Department of Information Technology, Faculty of Informatics, Masaryk University, 60200 Brno, Czech Republic
Brázdová, Marie
Author Authority ORCID Department of Biophysical Chemistry and Molecular Oncology, Institute of Biophysics, Academy of Sciences of the Czech Republic v.v.i., CZ-61265 Brno, Czech Republic

Bioinformatics. 2011 ; 27 (18) : 2510-2517. [pub] 20110726

ISSN 1367-4811
Medvik
Source

Language English Country Great Britain

Document type Journal Article, Research Support, Non-U.S. Gov't

Persistent link https://www.medvik.cz/link/bmc12022128

Online Full text

NLK Free Medical Journals from 1996 to 1 year ago
PubMed Central from 2007
Open Access Digital Library from 1996-01-01
Medline Complete (EBSCOhost) from 1998-01-01
Oxford Journals Open Access Collection from 1985-01-01 to 2022-09-30
Oxford Journals Open Access Collection from 1985-01-01
ROAD: Directory of Open Access Scholarly Resources from 1998

PubMed 21791534
DOI 10.1093/bioinformatics/btr439
Knihovny.cz E-resources

MOTIVATION: Current methods for identification of potential triplex-forming sequences in genomes and similar sequence sets rely primarily on detecting homopurine and homopyrimidine tracts. Procedures capable of detecting sequences supporting imperfect, but structurally feasible intramolecular triplex structures are needed for better sequence analysis. RESULTS: We modified an algorithm for detection of approximate palindromes, so as to account for the special nature of triplex DNA structures. From available literature, we conclude that approximate triplexes tolerate two classes of errors. One, analogical to mismatches in duplex DNA, involves nucleotides in triplets that do not readily form Hoogsteen bonds. The other class involves geometrically incompatible neighboring triplets hindering proper alignment of strands for optimal hydrogen bonding and stacking. We tested the statistical properties of the algorithm, as well as its correctness when confronted with known triplex sequences. The proposed algorithm satisfactorily detects sequences with intramolecular triplex-forming potential. Its complexity is directly comparable to palindrome searching. AVAILABILITY: Our implementation of the algorithm is available at http://www.fi.muni.cz/lexa/triplex as source code and a web-based search tool. The source code compiles into a library providing searching capability to other programs, as well as into a stand-alone command-line application based on this library. CONTACT: lexa@fi.muni.cz SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Department of Biophysical Chemistry and Molecular Oncology Institute of Biophysics Academy of Sciences of the Czech Republic v v i CZ 61265 Brno Czech Republic

Department of Computer Systems Faculty of Information Technology Brno Technical University 61266 Brno Czech Republic

Department of Information Technology Faculty of Informatics Masaryk University 60200 Brno Czech Republic

References provided by Crossref.org

000: 00000naa a2200000 a 4500

001: bmc12022128

003: CZ-PrNML

005: 20151124100855.0

007: ta

008: 120806s2011 xxk f 000 0#eng||

009: AR

024 7_: $a 10.1093/bioinformatics/btr439 $2 doi

035 __: $a (PubMed)21791534

040 __: $a ABA008 $b cze $d ABA008 $e AACR2

041 0_: $a eng

044 __: $a xxk

100 1_: $a Lexa, Matej, $d 1966- $7 xx0063347 $u Department of Information Technology, Faculty of Informatics, Masaryk University, 60200 Brno, Czech Republic

245 12: $a A dynamic programming algorithm for identification of triplex-forming sequences / $c M. Lexa, T. Martínek, I. Burgetová, D. Kopeček, M. Brázdová

520 9_: $a MOTIVATION: Current methods for identification of potential triplex-forming sequences in genomes and similar sequence sets rely primarily on detecting homopurine and homopyrimidine tracts. Procedures capable of detecting sequences supporting imperfect, but structurally feasible intramolecular triplex structures are needed for better sequence analysis. RESULTS: We modified an algorithm for detection of approximate palindromes, so as to account for the special nature of triplex DNA structures. From available literature, we conclude that approximate triplexes tolerate two classes of errors. One, analogical to mismatches in duplex DNA, involves nucleotides in triplets that do not readily form Hoogsteen bonds. The other class involves geometrically incompatible neighboring triplets hindering proper alignment of strands for optimal hydrogen bonding and stacking. We tested the statistical properties of the algorithm, as well as its correctness when confronted with known triplex sequences. The proposed algorithm satisfactorily detects sequences with intramolecular triplex-forming potential. Its complexity is directly comparable to palindrome searching. AVAILABILITY: Our implementation of the algorithm is available at http://www.fi.muni.cz/lexa/triplex as source code and a web-based search tool. The source code compiles into a library providing searching capability to other programs, as well as into a stand-alone command-line application based on this library. CONTACT: lexa@fi.muni.cz SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

650 _2: $a algoritmy $7 D000465

650 _2: $a chybné párování bází $7 D020137

650 _2: $a sekvence nukleotidů $7 D001483

650 _2: $a DNA $x chemie $x metabolismus $7 D004247

650 _2: $a Escherichia coli K12 $x genetika $7 D048168

650 _2: $a genom $7 D016678

650 _2: $a lidé $7 D006801

650 _2: $a obrácené repetice $7 D055029

650 _2: $a pravděpodobnostní funkce $7 D016013

650 _2: $a konformace nukleové kyseliny $7 D009690

650 _2: $a sekvenční analýza DNA $x metody $7 D017422

655 _2: $a časopisecké články $7 D016428

655 _2: $a práce podpořená grantem $7 D013485

700 1_: $a Martínek, Tomáš $u Department of Computer Systems, Faculty of Information Technology, Brno Technical University, 61266 Brno, Czech Republic

700 1_: $a Burgetová, Ivana $u Department of Computer Systems, Faculty of Information Technology, Brno Technical University, 61266 Brno, Czech Republic

700 1_: $a Kopeček, Daniel $u Department of Information Technology, Faculty of Informatics, Masaryk University, 60200 Brno, Czech Republic

700 1_: $a Brázdová, Marie $7 xx0100610 $u Department of Biophysical Chemistry and Molecular Oncology, Institute of Biophysics, Academy of Sciences of the Czech Republic v.v.i., CZ-61265 Brno, Czech Republic

773 0_: $w MED00008115 $t Bioinformatics (Oxford, England) $x 1367-4811 $g Roč. 27, č. 18 (2011), s. 2510-2517

856 41: $u https://pubmed.ncbi.nlm.nih.gov/21791534 $y Pubmed

910 __: $a ABA008 $b sig $c sign $y m $z 0

990 __: $a 20120806 $b ABA008

991 __: $a 20151124100923 $b ABA008

999 __: $a ok $b bmc $g 944041 $s 779425

BAS __: $a 3

BAS __: $a PreBMC

BMC __: $a 2011 $b 27 $c 18 $d 2510-2517 $e 20110726 $i 1367-4811 $m Bioinformatics $n Bioinformatics $x MED00008115

LZP __: $b NLK111 $a Pubmed-20120806/12/01

Borrow
RIS

Find record

In PubMed

A dynamic programming algorithm for identification of triplex-forming sequences

Find record

Citation metrics

Archiving options