-
Something wrong with this record ?
SOPanG: online text searching over a pan-genome
A. Cislak, S. Grabowski, J. Holub,
Language English Country Great Britain
Document type Journal Article, Research Support, Non-U.S. Gov't
NLK
Free Medical Journals
from 1996 to 1 year ago
PubMed Central
from 2007
Open Access Digital Library
from 1996-01-01
Medline Complete (EBSCOhost)
from 1998-01-01
Oxford Journals Open Access Collection
from 1985-01-01 to 2022-09-30
Oxford Journals Open Access Collection
from 1985-01-01
ROAD: Directory of Open Access Scholarly Resources
from 1998
- MeSH
- Algorithms MeSH
- Genome * genetics MeSH
- Genomics * methods MeSH
- Internet MeSH
- Software * standards MeSH
- Information Storage and Retrieval MeSH
- Publication type
- Journal Article MeSH
- Research Support, Non-U.S. Gov't MeSH
Motivation: The many thousands of high-quality genomes available now-a-days imply a shift from single genome to pan-genomic analyses. A basic algorithmic building brick for such a scenario is online search over a collection of similar texts, a problem with surprisingly few solutions presented so far. Results: We present SOPanG, a simple tool for exact pattern matching over an elastic-degenerate string, a recently proposed simplified model for the pan-genome. Thanks to bit-parallelism, it achieves pattern matching speeds above 400 MB/s, more than an order of magnitude higher than of other software. Availability and implementation: SOPanG is available for free from: https://github.com/MrAlexSee/sopang. Supplementary information: Supplementary data are available at Bioinformatics online.
Faculty of Information Technology Czech Technical University Prague Czechia
Institute of Applied Computer Science Lodz University of Technology Lódz Poland
References provided by Crossref.org
- 000
- 00000naa a2200000 a 4500
- 001
- bmc19045399
- 003
- CZ-PrNML
- 005
- 20200113082143.0
- 007
- ta
- 008
- 200109s2018 xxk f 000 0|eng||
- 009
- AR
- 024 7_
- $a 10.1093/bioinformatics/bty506 $2 doi
- 035 __
- $a (PubMed)29939210
- 040 __
- $a ABA008 $b cze $d ABA008 $e AACR2
- 041 0_
- $a eng
- 044 __
- $a xxk
- 100 1_
- $a Cislak, Aleksander $u Institute of Applied Computer Science, Lodz University of Technology, Lódz, Poland.
- 245 10
- $a SOPanG: online text searching over a pan-genome / $c A. Cislak, S. Grabowski, J. Holub,
- 520 9_
- $a Motivation: The many thousands of high-quality genomes available now-a-days imply a shift from single genome to pan-genomic analyses. A basic algorithmic building brick for such a scenario is online search over a collection of similar texts, a problem with surprisingly few solutions presented so far. Results: We present SOPanG, a simple tool for exact pattern matching over an elastic-degenerate string, a recently proposed simplified model for the pan-genome. Thanks to bit-parallelism, it achieves pattern matching speeds above 400 MB/s, more than an order of magnitude higher than of other software. Availability and implementation: SOPanG is available for free from: https://github.com/MrAlexSee/sopang. Supplementary information: Supplementary data are available at Bioinformatics online.
- 650 _2
- $a algoritmy $7 D000465
- 650 12
- $a genom $x genetika $7 D016678
- 650 12
- $a genomika $x metody $7 D023281
- 650 _2
- $a ukládání a vyhledávání informací $7 D016247
- 650 _2
- $a internet $7 D020407
- 650 12
- $a software $x normy $7 D012984
- 655 _2
- $a časopisecké články $7 D016428
- 655 _2
- $a práce podpořená grantem $7 D013485
- 700 1_
- $a Grabowski, Szymon $u Institute of Applied Computer Science, Lodz University of Technology, Lódz, Poland.
- 700 1_
- $a Holub, Jan $u Faculty of Information Technology, Czech Technical University in Prague, Czechia.
- 773 0_
- $w MED00008115 $t Bioinformatics (Oxford, England) $x 1367-4811 $g Roč. 34, č. 24 (2018), s. 4290-4292
- 856 41
- $u https://pubmed.ncbi.nlm.nih.gov/29939210 $y Pubmed
- 910 __
- $a ABA008 $b sig $c sign $y a $z 0
- 990 __
- $a 20200109 $b ABA008
- 991 __
- $a 20200113082515 $b ABA008
- 999 __
- $a ok $b bmc $g 1483668 $s 1084072
- BAS __
- $a 3
- BAS __
- $a PreBMC
- BMC __
- $a 2018 $b 34 $c 24 $d 4290-4292 $e 20181215 $i 1367-4811 $m Bioinformatics $n Bioinformatics $x MED00008115
- LZP __
- $a Pubmed-20200109