-
Something wrong with this record ?
Global analysis of repetitive DNA from unassembled sequence reads using RepeatExplorer2
P. Novák, P. Neumann, J. Macas,
Language English Country Great Britain
Document type Journal Article, Research Support, Non-U.S. Gov't
Grant support
LM2015047
Ministerstvo Školství, Mládeže a Tělovýchovy (Ministry of Education, Youth and Sports) - International
CZ.02.1.01/0.0/0.0/16_013/0001777
Ministerstvo Školství, Mládeže a Tělovýchovy (Ministry of Education, Youth and Sports) - International
- MeSH
- DNA Probes chemistry genetics MeSH
- DNA chemistry genetics MeSH
- Genomics methods MeSH
- Humans MeSH
- Repetitive Sequences, Nucleic Acid MeSH
- Sequence Analysis, DNA methods MeSH
- Cluster Analysis MeSH
- Software MeSH
- DNA Transposable Elements MeSH
- High-Throughput Nucleotide Sequencing methods MeSH
- Animals MeSH
- Check Tag
- Humans MeSH
- Animals MeSH
- Publication type
- Journal Article MeSH
- Research Support, Non-U.S. Gov't MeSH
RepeatExplorer2 is a novel version of a computational pipeline that uses graph-based clustering of next-generation sequencing reads for characterization of repetitive DNA in eukaryotes. The clustering algorithm facilitates repeat identification in any genome by using relatively small quantities of short sequence reads, and additional tools within the pipeline perform automatic annotation and quantification of the identified repeats. The pipeline is integrated into the Galaxy platform, which provides a user-friendly web interface for script execution and documentation of the results. Compared to the original version of the pipeline, RepeatExplorer2 provides automated annotation of transposable elements, identification of tandem repeats and enhanced visualization of analysis results. Here, we present an overview of the RepeatExplorer2 workflow and provide procedures for its application to (i) de novo repeat identification in a single species, (ii) comparative repeat analysis in a set of species, (iii) development of satellite DNA probes for cytogenetic experiments and (iv) identification of centromeric repeats based on ChIP-seq data. Each procedure takes approximately 2 d to complete. RepeatExplorer2 is available at https://repeatexplorer-elixir.cerit-sc.cz .
References provided by Crossref.org
- 000
- 00000naa a2200000 a 4500
- 001
- bmc20027701
- 003
- CZ-PrNML
- 005
- 20210114152238.0
- 007
- ta
- 008
- 210105s2020 xxk f 000 0|eng||
- 009
- AR
- 024 7_
- $a 10.1038/s41596-020-0400-y $2 doi
- 035 __
- $a (PubMed)33097925
- 040 __
- $a ABA008 $b cze $d ABA008 $e AACR2
- 041 0_
- $a eng
- 044 __
- $a xxk
- 100 1_
- $a Novák, Petr $u Biology Centre, Czech Academy of Sciences, Institute of Plant Molecular Biology, České Budějovice, Czech Republic.
- 245 10
- $a Global analysis of repetitive DNA from unassembled sequence reads using RepeatExplorer2 / $c P. Novák, P. Neumann, J. Macas,
- 520 9_
- $a RepeatExplorer2 is a novel version of a computational pipeline that uses graph-based clustering of next-generation sequencing reads for characterization of repetitive DNA in eukaryotes. The clustering algorithm facilitates repeat identification in any genome by using relatively small quantities of short sequence reads, and additional tools within the pipeline perform automatic annotation and quantification of the identified repeats. The pipeline is integrated into the Galaxy platform, which provides a user-friendly web interface for script execution and documentation of the results. Compared to the original version of the pipeline, RepeatExplorer2 provides automated annotation of transposable elements, identification of tandem repeats and enhanced visualization of analysis results. Here, we present an overview of the RepeatExplorer2 workflow and provide procedures for its application to (i) de novo repeat identification in a single species, (ii) comparative repeat analysis in a set of species, (iii) development of satellite DNA probes for cytogenetic experiments and (iv) identification of centromeric repeats based on ChIP-seq data. Each procedure takes approximately 2 d to complete. RepeatExplorer2 is available at https://repeatexplorer-elixir.cerit-sc.cz .
- 650 _2
- $a zvířata $7 D000818
- 650 _2
- $a shluková analýza $7 D016000
- 650 _2
- $a DNA $x chemie $x genetika $7 D004247
- 650 _2
- $a DNA sondy $x chemie $x genetika $7 D015342
- 650 _2
- $a transpozibilní elementy DNA $7 D004251
- 650 _2
- $a genomika $x metody $7 D023281
- 650 _2
- $a vysoce účinné nukleotidové sekvenování $x metody $7 D059014
- 650 _2
- $a lidé $7 D006801
- 650 _2
- $a repetitivní sekvence nukleových kyselin $7 D012091
- 650 _2
- $a sekvenční analýza DNA $x metody $7 D017422
- 650 _2
- $a software $7 D012984
- 655 _2
- $a časopisecké články $7 D016428
- 655 _2
- $a práce podpořená grantem $7 D013485
- 700 1_
- $a Neumann, Pavel $u Biology Centre, Czech Academy of Sciences, Institute of Plant Molecular Biology, České Budějovice, Czech Republic.
- 700 1_
- $a Macas, Jiří $u Biology Centre, Czech Academy of Sciences, Institute of Plant Molecular Biology, České Budějovice, Czech Republic. macas@umbr.cas.cz.
- 773 0_
- $w MED00159607 $t Nature protocols $x 1750-2799 $g Roč. 15, č. 11 (2020), s. 3745-3776
- 856 41
- $u https://pubmed.ncbi.nlm.nih.gov/33097925 $y Pubmed
- 910 __
- $a ABA008 $b sig $c sign $y a $z 0
- 990 __
- $a 20210105 $b ABA008
- 991 __
- $a 20210114152236 $b ABA008
- 999 __
- $a ok $b bmc $g 1608036 $s 1118881
- BAS __
- $a 3
- BAS __
- $a PreBMC
- BMC __
- $a 2020 $b 15 $c 11 $d 3745-3776 $e 20201023 $i 1750-2799 $m Nature protocols $n Nat Protoc $x MED00159607
- GRA __
- $a LM2015047 $p Ministerstvo Školství, Mládeže a Tělovýchovy (Ministry of Education, Youth and Sports) $2 International
- GRA __
- $a CZ.02.1.01/0.0/0.0/16_013/0001777 $p Ministerstvo Školství, Mládeže a Tělovýchovy (Ministry of Education, Youth and Sports) $2 International
- LZP __
- $a Pubmed-20210105