variant annotation
Dotaz
Zobrazit nápovědu
Accurate annotation of genomic variants in human diseases is essential to allow personalized medicine. Assessment of somatic and germline TP53 alterations has now reached the clinic and is required in several circumstances such as the identification of the most effective cancer therapy for patients with chronic lymphocytic leukemia (CLL). Here, we present Seshat, a Web service for annotating TP53 information derived from sequencing data. A flexible framework allows the use of standard file formats such as Mutation Annotation Format (MAF) or Variant Call Format (VCF), as well as common TXT files. Seshat performs accurate variant annotations using the Human Genome Variation Society (HGVS) nomenclature and the stable TP53 genomic reference provided by the Locus Reference Genomic (LRG). In addition, using the 2017 release of the UMD_TP53 database, Seshat provides multiple statistical information for each TP53 variant including database frequency, functional activity, or pathogenicity. The information is delivered in standardized output tables that minimize errors and facilitate comparison of mutational data across studies. Seshat is a beneficial tool to interpret the ever-growing TP53 sequencing data generated by multiple sequencing platforms and it is freely available via the TP53 Website, http://p53.fr or directly at http://vps338341.ovh.net/.
- MeSH
- anotace sekvence MeSH
- databáze genetické * MeSH
- genetická variace genetika MeSH
- genomika trendy MeSH
- internet MeSH
- lidé MeSH
- mutace MeSH
- nádorový supresorový protein p53 genetika MeSH
- software * MeSH
- výpočetní biologie trendy MeSH
- vysoce účinné nukleotidové sekvenování MeSH
- Check Tag
- lidé MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
Východiska: MikroRNA (miRNA) jsou jednou ze skupin krátkých nekódujících RNA, které na posttranskripční úrovni regulují genovou expresi. Jsou zapojeny do klíčových buněčných procesů a podílejí se na vzniku a progresi mnohých onemocnění. Hladiny miRNA mohou odrážet fyziologický stav organizmů, a proto jsou v rámci biomarkerových studií analyzovány jejich expresní profily. Díky svým vlastnostem se jeví jako slibné diagnostické, prognostické a prediktivní biomarkery nádorových onemocnění. Nedávné studie poukazují na existenci sekvenčních variant miRNA, tzv. izomiRs, které se od anotovaných miRNA liší pozměněnou sekvencí v důsledku posttranskripčních modifikací. Tyto izomiRs mohou vykazovat vyšší zastoupení než některé kanonické miRNA. Bližší studium izomiRs odhaluje jejich nenáhodnou distribuci i odlišné biologické vlastnosti a poukazuje tak na možný biologický význam modifikací sekvence. Přítomnost izomiRs může také zásadně ovlivňovat výsledky biomarkerových studií. V současné době je pozornost zaměřena na jejich možný klinický význam. Cíl: Cílem této práce je poskytnout přehled současných poznatků o sekvenčních variantách miRNA. V práci jsou popsány mechanizmy vzniku izomiRs a vliv heterogenity v sekvenci na stabilitu, funkci a analýzu miRNA. Prostor je věnován také roli izomiRs v biomarkerových studiích.
Background: MicroRNA (miRNA) are a class of short non-coding RNAs that regulate gene expression at the posttranscriptional level. They are involved in key cellular processes and development as well as progression of many diseases. Their levels reflect the physiological state of organisms; therefore, the expression profiles of these molecules are analyzed in biomarker studies. Due to their properties, miRNA appear to be promising diagnostic, prognostic and predictive biomarkers of cancer. Recent studies indicate the existence of sequence variants in miRNA, so-called isomiRs, which differ from the annotated miRNAs by altered sequences due to posttranscriptional modifications. These isomiRs may have a higher abundance than canonical miRNA. The characterization of isomiRs reveals their regulated distribution and different biological properties and thus suggest the possible biological significance of the modifications. The presence of isomiRs can also significantly affect the results of biomarker studies. Currently, the research is focused on their possible clinical significance. Purpose: The aim of this review is to provide an overview of current knowledge about sequence variants in miRNA. The review summarizes the mechanisms of isomiRs biogenesis and describes the effects of sequence heterogeneity on miRNA stability, function and analysis. Subsequently, the role of isomiRs in biomarker studies is discussed.
Human genome sequencing efforts have greatly expanded, and a plethora of missense variants identified both in patients and in the general population is now publicly accessible. Interpretation of the molecular-level effect of missense variants, however, remains challenging and requires a particular investigation of amino acid substitutions in the context of protein structure and function. Answers to questions like 'Is a variant perturbing a site involved in key macromolecular interactions and/or cellular signaling?', or 'Is a variant changing an amino acid located at the protein core or part of a cluster of known pathogenic mutations in 3D?' are crucial. Motivated by these needs, we developed MISCAST (missense variant to protein structure analysis web suite; http://miscast.broadinstitute.org/). MISCAST is an interactive and user-friendly web server to visualize and analyze missense variants in protein sequence and structure space. Additionally, a comprehensive set of protein structural and functional features have been aggregated in MISCAST from multiple databases, and displayed on structures alongside the variants to provide users with the biological context of the variant location in an integrated platform. We further made the annotated data and protein structures readily downloadable from MISCAST to foster advanced offline analysis of missense variants by a wide biological community.
- MeSH
- internet MeSH
- konformace proteinů * MeSH
- lidé MeSH
- missense mutace * MeSH
- proteiny chemie genetika MeSH
- software * MeSH
- Check Tag
- lidé MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
Genetic variation occurring within conserved functional protein domains warrants special attention when examining DNA variation in the context of disease causation. Here we introduce a resource, freely available at www.prot2hg.com, that addresses the question of whether a particular variant falls onto an annotated protein domain and directly translates chromosomal coordinates onto protein residues. The tool can perform a multiple-site query in a simple way, and the whole dataset is available for download as well as incorporated into our own accessible pipeline. To create this resource, National Center for Biotechnology Information protein data were retrieved using the Entrez Programming Utilities. After processing all human protein domains, residue positions were reverse translated and mapped to the reference genome hg19 and stored in a MySQL database. In total, 760 487 protein domains from 42 371 protein models were mapped to hg19 coordinates and made publicly available for search or download (www.prot2hg.com). In addition, this annotation was implemented into the genomics research platform GENESIS in order to query nearly 8000 exomes and genomes of families with rare Mendelian disorders (tgp-foundation.org). When applied to patient genetic data, we found that rare (<1%) variants in the Genome Aggregation Database were significantly more annotated onto a protein domain in comparison to common (>1%) variants. Similarly, variants described as pathogenic or likely pathogenic in ClinVar were more likely to be annotated onto a domain. In addition, we tested a dataset consisting of 60 causal variants in a cohort of patients with epileptic encephalopathy and found that 71% of them (43 variants) were propagated onto protein domains. In summary, we developed a resource that annotates variants in the coding part of the genome onto conserved protein domains in order to increase variant prioritization efficiency.Database URL: www.prot2hg.com.
- MeSH
- anotace sekvence metody MeSH
- data mining metody MeSH
- databáze genetické * MeSH
- datové kurátorství metody MeSH
- genetická variace * MeSH
- genom lidský genetika MeSH
- genomika metody MeSH
- internet MeSH
- lidé MeSH
- proteinové domény genetika MeSH
- proteiny chemie genetika metabolismus MeSH
- výpočetní biologie metody MeSH
- Check Tag
- lidé MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
The yeast Magnusiomyces capitatus is an opportunistic human pathogen causing rare yet severe infections, especially in patients with hematological malignancies. Here, we report the 20.2 megabase genome sequence of an environmental strain of this species as well as the genome sequences of eight additional isolates from human and animal sources providing an insight into intraspecies variation. The distribution of single-nucleotide variants is indicative of genetic recombination events, supporting evidence for sexual reproduction in this heterothallic yeast. Using RNAseq-aided annotation, we identified genes for 6518 proteins including several expanded families such as kexin proteases and Hsp70 molecular chaperones. Several of these families are potentially associated with the ability of M. capitatus to infect and colonize humans. For the purpose of comparative analysis, we also determined the genome sequence of a closely related yeast, Magnusiomyces ingens. The genome sequences of M. capitatus and M. ingens exhibit many distinct features and represent a basis for further comparative and functional studies.
- MeSH
- anotace sekvence MeSH
- antifungální látky farmakologie MeSH
- faktory virulence MeSH
- fenotyp MeSH
- fylogeneze MeSH
- genom fungální * MeSH
- genomika * metody MeSH
- lidé MeSH
- mikrobiální testy citlivosti MeSH
- multigenová rodina MeSH
- mykózy mikrobiologie MeSH
- oportunní infekce mikrobiologie MeSH
- rekombinace genetická MeSH
- Saccharomycetales klasifikace genetika růst a vývoj patogenita MeSH
- výpočetní biologie metody MeSH
- Check Tag
- lidé MeSH
- Publikační typ
- časopisecké články MeSH
N6-methyladenosine (m6A) and N6,2'-O-dimethyladenosine (m6Am) are two abundant modifications found in mRNAs and ncRNAs that can regulate multiple aspects of RNA biology. They function mainly by regulating interactions with specific RNA-binding proteins. Both modifications are linked to development, disease and stress response. To date, three methyltransferases and two demethylases have been identified that modify adenosines in mammalian mRNAs. Here, we present a comprehensive analysis of the interactomes of these enzymes. PCIF1 protein network comprises mostly factors involved in nascent RNA synthesis by RNA polymerase II, whereas ALKBH5 is closely linked with most aspects of pre-mRNA processing and mRNA export to the cytoplasm. METTL16 resides in subcellular compartments co-inhabited by several other RNA modifiers and processing factors. FTO interactome positions this demethylase at a crossroad between RNA transcription, RNA processing and DNA replication and repair. Altogether, these enzymes share limited spatial interactomes, pointing to specific molecular mechanisms of their regulation.
- MeSH
- adaptorové proteiny signální transdukční genetika metabolismus MeSH
- adenosin analogy a deriváty metabolismus MeSH
- alfa-ketoglutarát-dependentní dioxygenasa, AlkB homolog 5 genetika metabolismus MeSH
- anotace sekvence MeSH
- gen pro FTO genetika metabolismus MeSH
- genetická transkripce MeSH
- genová ontologie MeSH
- HEK293 buňky MeSH
- jaderné proteiny genetika metabolismus MeSH
- lidé MeSH
- mapování interakce mezi proteiny MeSH
- messenger RNA genetika metabolismus MeSH
- methyltransferasy genetika metabolismus MeSH
- N-demethylasy genetika metabolismus MeSH
- nekódující RNA genetika metabolismus MeSH
- oprava DNA MeSH
- protein - isoformy genetika metabolismus MeSH
- replikace DNA MeSH
- vazba proteinů MeSH
- Check Tag
- lidé MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
With the advent of precision and genomic medicine, a critical issue is whether a disease gene variant is pathogenic or benign. Such is the case for the three autosomal dominant acute hepatic porphyrias (AHPs), including acute intermittent porphyria, hereditary coproporphyria, and variegate porphyria, each resulting from the half-normal enzymatic activities of hydroxymethylbilane synthase, coproporphyrinogen oxidase, and protoporphyrinogen oxidase, respectively. To date, there is no public database that documents the likely pathogenicity of variants causing the porphyrias, and more specifically, the AHPs with biochemically and clinically verified information. Therefore, an international collaborative with the European Porphyria Network and the National Institutes of Health/National Center for Advancing Translational Sciences/National Institute of Diabetes and Digestive and Kidney Diseases (NIH/NCATS/NIDDK)-sponsored Porphyrias Consortium of porphyria diagnostic experts is establishing an online database that will collate biochemical and clinical evidence verifying the pathogenicity of the published and newly identified variants in the AHP-causing genes. The overall goal of the International Porphyria Molecular Diagnostic Collaborative is to determine the pathogenic and benign variants for all eight porphyrias. Here we describe the overall objectives and the initial efforts to validate pathogenic and benign variants in the respective heme biosynthetic genes causing the AHPs.
- MeSH
- akutní intermitentní porfyrie genetika patofyziologie MeSH
- databáze faktografické MeSH
- datové kurátorství metody MeSH
- jaterní porfyrie genetika patofyziologie MeSH
- lidé MeSH
- molekulární patologie MeSH
- porfobilinogensynthasa nedostatek genetika MeSH
- porfyrie genetika patofyziologie MeSH
- virulence genetika MeSH
- Check Tag
- lidé MeSH
- mužské pohlaví MeSH
- ženské pohlaví MeSH
- Publikační typ
- časopisecké články MeSH
- Geografické názvy
- Spojené státy americké MeSH
Introduction: Case-control analyses have shown BARD1 variants to be associated with up to >2-fold increase in risk of breast cancer, and potentially greater risk of triple negative breast cancer. BARD1 is included in several gene sequencing panels currently marketed for the prediction of risk of cancer, however there are no gene-specific guidelines for the classification of BARD1 variants. We present the most comprehensive assessment of BARD1 messenger RNA splicing, and demonstrate the application of these data for the classification of truncating and splice site variants according to American College of Medical Genetics and Genomics and the Association for Molecular Pathology (ACMG/AMP) guidelines. Methods: Nanopore sequencing, short-read RNA-seq (whole transcriptome and targeted), and capillary electrophoresis analysis were performed by four laboratories to investigate alternative BARD1 splicing in blood, breast, and fimbriae/ovary related specimens from non-cancer affected tissues. Splicing data were also collated from published studies of nine different tissues. The impact of the findings for PVS1 annotation was assessed for truncating and splice site variants. Results: We identified 62 naturally occurring alternative spliced BARD1 splicing events, including 19 novel events found by next generation sequencing and/or reverse transcription PCR analysis performed for this study. Quantitative analysis showed that naturally occurring splicing events causing loss of clinically relevant domains or nonsense mediated decay can constitute up to 11.9% of overlapping natural junctions, suggesting that aberrant splicing can be tolerated up to this level. Nanopore sequencing of whole BARD1 transcripts characterized 16 alternative isoforms from healthy controls, revealing that the most complex transcripts combined only two alternative splicing events. Bioinformatic analysis of ClinVar submitted variants at or near BARD1 splice sites suggest that all consensus splice site variants in BARD1 should be considered likely pathogenic, with the possible exception of variants at the donor site of exon 5. Conclusions: No BARD1 candidate rescue transcripts were identified in this study, indicating that all premature translation-termination codons variants can be annotated as PVS1. Furthermore, our analysis suggests that all donor and acceptor (IVS+/-1,2) variants can be considered PVS1 or PVS1_strong, with the exception of variants targeting the exon 5 donor site, that we recommend considering as PVS1_moderate.
- Publikační typ
- časopisecké články MeSH
Two key biological features distinguish Trypanosoma evansi from the T. brucei group: independence from the tsetse fly as obligatory vector, and independence from the need for functional mitochondrial DNA (kinetoplast or kDNA). In an effort to better understand the molecular causes and consequences of these differences, we sequenced the genome of an akinetoplastic T. evansi strain from China and compared it to the T. b. brucei reference strain. The annotated T. evansi genome shows extensive similarity to the reference, with 94.9% of the predicted T. b. brucei coding sequences (CDS) having an ortholog in T. evansi, and 94.6% of the non-repetitive orthologs having a nucleotide identity of 95% or greater. Interestingly, several procyclin-associated genes (PAGs) were disrupted or not found in this T. evansi strain, suggesting a selective loss of function in the absence of the insect life-cycle stage. Surprisingly, orthologous sequences were found in T. evansi for all 978 nuclear CDS predicted to represent the mitochondrial proteome in T. brucei, although a small number of these may have lost functionality. Consistent with previous results, the F1FO-ATP synthase γ subunit was found to have an A281 deletion, which is involved in generation of a mitochondrial membrane potential in the absence of kDNA. Candidates for CDS that are absent from the reference genome were identified in supplementary de novo assemblies of T. evansi reads. Phylogenetic analyses show that the sequenced strain belongs to a dominant group of clonal T. evansi strains with worldwide distribution that also includes isolates classified as T. equiperdum. At least three other types of T. evansi or T. equiperdum have emerged independently. Overall, the elucidation of the T. evansi genome sequence reveals extensive similarity of T. brucei and supports the contention that T. evansi should be classified as a subspecies of T. brucei.
- MeSH
- analýza hlavních komponent MeSH
- fylogeneze * MeSH
- genom protozoální * MeSH
- jednonukleotidový polymorfismus MeSH
- mikrosatelitní repetice MeSH
- protozoální proteiny genetika metabolismus MeSH
- regulace genové exprese MeSH
- Trypanosoma klasifikace genetika MeSH
- trypanosomové variantní povrchové glykoproteiny genetika metabolismus MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Research Support, N.I.H., Extramural MeSH
The mechanisms by which myelodysplastic syndrome (MDS) cells resist the effects of hypomethylating agents (HMA) are currently the subject of intensive research. A better understanding of mechanisms by which the MDS cell becomes to tolerate HMA and progresses to acute myeloid leukemia (AML) requires the development of new cellular models. From MDS/AML cell lines we developed a model of 5-azacytidine (AZA) resistance whose stability was validated by a transplantation approach into immunocompromised mice. When investigating mRNA expression and DNA variants of the AZA resistant phenotype we observed deregulation of several cancer-related pathways including the phosphatidylinosito-3 kinase signaling. We have further shown that these pathways can be modulated by specific inhibitors that, while blocking the proliferation of AZA resistant cells, are unable to increase their sensitivity to AZA. Our data reveal a set of molecular mechanisms that can be targeted to expand therapeutic options during progression on AZA therapy.
- MeSH
- anotace sekvence MeSH
- azacytidin farmakologie MeSH
- biologické modely * MeSH
- chemorezistence * účinky léků genetika MeSH
- DNA nádorová genetika MeSH
- fosfatidylinositol-3-kinasy metabolismus MeSH
- myši SCID MeSH
- myši MeSH
- protoonkogenní proteiny c-akt metabolismus MeSH
- reprodukovatelnost výsledků MeSH
- signální transdukce účinky léků MeSH
- transkriptom genetika MeSH
- zvířata MeSH
- Check Tag
- myši MeSH
- zvířata MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH