RepeatExplorer: a Galaxy-based web server for genome-wide characterization of eukaryotic repetitive elements from next-generation sequence reads
Jazyk angličtina Země Velká Británie, Anglie Médium print-electronic
Typ dokumentu časopisecké články, práce podpořená grantem
PubMed
23376349
DOI
10.1093/bioinformatics/btt054
PII: btt054
Knihovny.cz E-zdroje
- MeSH
- algoritmy MeSH
- DNA chemie MeSH
- Eukaryota genetika MeSH
- fylogeneze MeSH
- genom MeSH
- internet MeSH
- repetitivní sekvence nukleových kyselin * MeSH
- sekvenční analýza DNA * MeSH
- shluková analýza MeSH
- software * MeSH
- vysoce účinné nukleotidové sekvenování * MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Názvy látek
- DNA MeSH
MOTIVATION: Repetitive DNA makes up large portions of plant and animal nuclear genomes, yet it remains the least-characterized genome component in most species studied so far. Although the recent availability of high-throughput sequencing data provides necessary resources for in-depth investigation of genomic repeats, its utility is hampered by the lack of specialized bioinformatics tools and appropriate computational resources that would enable large-scale repeat analysis to be run by biologically oriented researchers. RESULTS: Here we present RepeatExplorer, a collection of software tools for characterization of repetitive elements, which is accessible via web interface. A key component of the server is the computational pipeline using a graph-based sequence clustering algorithm to facilitate de novo repeat identification without the need for reference databases of known elements. Because the algorithm uses short sequences randomly sampled from the genome as input, it is ideal for analyzing next-generation sequence reads. Additional tools are provided to aid in classification of identified repeats, investigate phylogenetic relationships of retroelements and perform comparative analysis of repeat composition between multiple species. The server allows to analyze several million sequence reads, which typically results in identification of most high and medium copy repeats in higher plant genomes.
Citace poskytuje Crossref.org
A chromosome-scale reference genome of grasspea (Lathyrus sativus)
Sexy ways: approaches to studying plant sex chromosomes
Cytogenetic Analysis of Satellitome of Madagascar Leaf-Tailed Geckos
Evolution of ancient satellite DNAs in extant alligators and caimans (Crocodylia, Reptilia)
Analysis of 5S rDNA Genomic Organization Through the RepeatExplorer2 Pipeline: A Simplified Protocol
The Role of Repetitive Sequences in Repatterning of Major Ribosomal DNA Clusters in Lepidoptera
The ecology of palm genomes: repeat-associated genome size expansion is constrained by aridity
Genome diploidization associates with cladogenesis, trait disparity, and plastid gene evolution