RepeatExplorer2 is a novel version of a computational pipeline that uses graph-based clustering of next-generation sequencing reads for characterization of repetitive DNA in eukaryotes. The clustering algorithm facilitates repeat identification in any genome by using relatively small quantities of short sequence reads, and additional tools within the pipeline perform automatic annotation and quantification of the identified repeats. The pipeline is integrated into the Galaxy platform, which provides a user-friendly web interface for script execution and documentation of the results. Compared to the original version of the pipeline, RepeatExplorer2 provides automated annotation of transposable elements, identification of tandem repeats and enhanced visualization of analysis results. Here, we present an overview of the RepeatExplorer2 workflow and provide procedures for its application to (i) de novo repeat identification in a single species, (ii) comparative repeat analysis in a set of species, (iii) development of satellite DNA probes for cytogenetic experiments and (iv) identification of centromeric repeats based on ChIP-seq data. Each procedure takes approximately 2 d to complete. RepeatExplorer2 is available at https://repeatexplorer-elixir.cerit-sc.cz .
- MeSH
- DNA Probes chemistry genetics MeSH
- DNA chemistry genetics MeSH
- Genomics methods MeSH
- Humans MeSH
- Repetitive Sequences, Nucleic Acid MeSH
- Sequence Analysis, DNA methods MeSH
- Cluster Analysis MeSH
- Software MeSH
- DNA Transposable Elements MeSH
- High-Throughput Nucleotide Sequencing methods MeSH
- Animals MeSH
- Check Tag
- Humans MeSH
- Animals MeSH
- Publication type
- Journal Article MeSH
- Research Support, Non-U.S. Gov't MeSH
Interleukin 24 (IL-24) is a cytokine with the potential to be an effective treatment for autoimmune diseases and cancer. However, its instability and difficulties in its production have hampered detailed biological and biophysical studies. We approached the challenges of IL-24 production by using the PROSS algorithm to design more stable variants of IL-24. We used homology models built from the sequences and known structures of IL-20 and IL-19 and predicted and produced several extensively mutated IL-24 variants that were highly stable and produced in large yields; one of them was crystallized (IL-24B, PDB ID 6GG1; 3D Interactive at http://proteopedia.org/w/Journal: FEBS_Journal:1). The mutated variants, however, lost most of their binding capacity to the extracellular parts of cognate receptors. While the affinity to the receptor 2 (IL-20R2) was preserved, the variants lost affinity to IL-20R1 and IL-22R1 (shared receptors 1). Back engineering of the variants revealed that reintroduction of a single IL-24 wild-type residue (T198) to the patch interacting with receptors 1 restored 80% of the binding affinity and signaling capacity, accompanied by an acceptable drop in the protein stability by 9 °C. Multiple sequence alignment explains the stabilizing effect of the mutated residues in the IL-24 variants by their presence in the related and more stable cytokines IL-20 and IL-19. Our homology-based approach can enhance existing methods for protein engineering and represents a viable alternative to study and produce difficult proteins for which only in silico structural information is available, estimated as >40% of all important drug targets.