JavaScript is NOT enabled !

Please enable JavaScript.

* Show help

Reset

MeSH: Datasets as Topic

48 hits in BMC Filters

Online article

The Allele Catalog Tool: a web-based interactive tool for allele discovery and analysis

Chan, Yen On
Author Chan, Yen On MU Institute for Data Science and Informatics, University of Missouri-Columbia, Columbia, MO, USA Christopher S. Bond Life Sciences Center, University of Missouri-Columbia, Columbia, MO, USA
Dietz, Nicholas
Author Dietz, Nicholas Division of Plant Science and Technology, University of Missouri-Columbia, Columbia, MO, USA
Zeng, Shuai
Author Zeng, Shuai Department of Electrical Engineering and Computer Science, University of Missouri-Columbia, Columbia, MO, USA
Wang, Juexin
Author Wang, Juexin Christopher S. Bond Life Sciences Center, University of Missouri-Columbia, Columbia, MO, USA Department of Electrical Engineering and Computer Science, University of Missouri-Columbia, Columbia, MO, USA
Flint-Garcia, Sherry
Author Flint-Garcia, Sherry United States Department of Agriculture-Agricultural Research Service, Plant Genetics Research Unit, Columbia, MO, USA
Salazar-Vidal, M Nancy
Author Salazar-Vidal, M Nancy Division of Plant Science and Technology, University of Missouri-Columbia, Columbia, MO, USA Department of Evolution and Ecology, University of California-Davis, Davis, CA, USA
Škrabišová, Mária
Author Škrabišová, Mária Department of Biochemistry, Faculty of Science, Palacky University in Olomouc, Olomouc, Czech Republic
Bilyeu, Kristin
Author Bilyeu, Kristin United States Department of Agriculture-Agricultural Research Service, Plant Genetics Research Unit, Columbia, MO, USA. kristin.bilyeu@usda.gov
Joshi, Trupti
Author Joshi, Trupti ORCID MU Institute for Data Science and Informatics, University of Missouri-Columbia, Columbia, MO, USA. Joshitr@missouri.edu Christopher S. Bond Life Sciences Center, University of Missouri-Columbia, Columbia, MO, USA. Joshitr@missouri.edu Department of Electrical Engineering and Computer Science, University of Missouri-Columbia, Columbia, MO, USA. Joshitr@missouri.edu Department of Health Management and Informatics, University of Missouri-Columbia, Columbia, MO, USA. Joshitr@missouri.edu

BMC genomics. 2023 ; 24 (1) : 107. [pub] 20230310

BMC Genomics
ISSN 1471-2164
Medvik
Source

BACKGROUND: The advancement of sequencing technologies today has made a plethora of whole-genome re-sequenced (WGRS) data publicly available. However, research utilizing the WGRS data without further configuration is nearly impossible. To solve this problem, our research group has developed an interactive Allele Catalog Tool to enable researchers to explore the coding region allelic variation present in over 1,000 re-sequenced accessions each for soybean, Arabidopsis, and maize. RESULTS: The Allele Catalog Tool was designed originally with soybean genomic data and resources. The Allele Catalog datasets were generated using our variant calling pipeline (SnakyVC) and the Allele Catalog pipeline (AlleleCatalog). The variant calling pipeline is developed to parallelly process raw sequencing reads to generate the Variant Call Format (VCF) files, and the Allele Catalog pipeline takes VCF files to perform imputations, functional effect predictions, and assemble alleles for each gene to generate curated Allele Catalog datasets. Both pipelines were utilized to generate the data panels (VCF files and Allele Catalog files) in which the accessions of the WGRS datasets were collected from various sources, currently representing over 1,000 diverse accessions for soybean, Arabidopsis, and maize individually. The main features of the Allele Catalog Tool include data query, visualization of results, categorical filtering, and download functions. Queries are performed from user input, and results are a tabular format of summary results by categorical description and genotype results of the alleles for each gene. The categorical information is specific to each species; additionally, available detailed meta-information is provided in modal popups. The genotypic information contains the variant positions, reference or alternate genotypes, the functional effect classes, and the amino-acid changes of each accession. Besides that, the results can also be downloaded for other research purposes. CONCLUSIONS: The Allele Catalog Tool is a web-based tool that currently supports three species: soybean, Arabidopsis, and maize. The Soybean Allele Catalog Tool is hosted on the SoyKB website ( https://soykb.org/SoybeanAlleleCatalogTool/ ), while the Allele Catalog Tool for Arabidopsis and maize is hosted on the KBCommons website ( https://kbcommons.org/system/tools/AlleleCatalogTool/Zmays and https://kbcommons.org/system/tools/AlleleCatalogTool/Athaliana ). Researchers can use this tool to connect variant alleles of genes with meta-information of species.

Article

Reductionist Pathways for Parasitism in Euglenozoans? Expanded Datasets Provide New Insights

Trends in parasitology. 2021 ; 37 (2) : 100-116. [pub] 20201027

Trends Parasitol
ISSN 1471-5007
Medvik
Source

The unicellular trypanosomatids belong to the phylum Euglenozoa and all known species are obligate parasites. Distinct lineages infect plants, invertebrates, and vertebrates, including humans. Genome data for marine diplonemids, together with freshwater euglenids and free-living kinetoplastids, the closest known nonparasitic relatives to trypanosomatids, recently became available. Robust phylogenetic reconstructions across Euglenozoa are now possible and place the results of parasite-focused studies into an evolutionary context. Here we discuss recent advances in identifying the factors shaping the evolution of Euglenozoa, focusing on ancestral features generally considered parasite-specific. Remarkably, most of these predate the transition(s) to parasitism, suggesting that the presence of certain preconditions makes a significant lifestyle change more likely.

Online article

Independent dynamics of low, intermediate, and high frequency spectral intracranial EEG activities during human memory formation

Neuroimage. 2021 ; 245 (-) : 118637. [pub] 20211010

ISSN 1095-9572
Medvik
Source

A wide spectrum of brain rhythms are engaged throughout the human cortex in cognitive functions. How the rhythms of various frequency ranges are coordinated across the space of the human cortex and time of memory processing is inconclusive. They can either be coordinated together across the frequency spectrum at the same cortical site and time or induced independently in particular bands. We used a large dataset of human intracranial electroencephalography (iEEG) to parse the spatiotemporal dynamics of spectral activities induced during formation of verbal memories. Encoding of words for subsequent free recall activated low frequency theta, intermediate frequency alpha and beta, and high frequency gamma power in a mosaic pattern of discrete cortical sites. A majority of the cortical sites recorded activity in only one of these frequencies, except for the visual cortex where spectral power was induced across multiple bands. Each frequency band showed characteristic dynamics of the induced power specific to cortical area and hemisphere. The power of the low, intermediate, and high frequency activities propagated in independent sequences across the visual, temporal and prefrontal cortical areas throughout subsequent phases of memory encoding. Our results provide a holistic, simplified model of the spectral activities engaged in the formation of human memory, suggesting an anatomically and temporally distributed mosaic of coordinated brain rhythms.

MeSH
Datasets as Topic MeSH
Adult MeSH
Electroencephalography methods MeSH
Epilepsy diagnostic imaging surgery MeSH
Humans MeSH
Magnetic Resonance Imaging MeSH
Memory physiology MeSH
Tomography, X-Ray Computed MeSH
Check Tag
Adult MeSH
Humans MeSH
Male MeSH
Female MeSH
Publication type
Journal Article MeSH
Multicenter Study MeSH
Research Support, Non-U.S. Gov't MeSH
Research Support, U.S. Gov't, Non-P.H.S. MeSH

Online article

Establishing community reference samples, data and call sets for benchmarking cancer mutation detection using whole-genome sequencing

Nature biotechnology. 2021 ; 39 (9) : 1151-1160. [pub] 20210909

Nat Biotechnol
ISSN 1546-1696
Medvik
Source

The lack of samples for generating standardized DNA datasets for setting up a sequencing pipeline or benchmarking the performance of different algorithms limits the implementation and uptake of cancer genomics. Here, we describe reference call sets obtained from paired tumor-normal genomic DNA (gDNA) samples derived from a breast cancer cell line-which is highly heterogeneous, with an aneuploid genome, and enriched in somatic alterations-and a matched lymphoblastoid cell line. We partially validated both somatic mutations and germline variants in these call sets via whole-exome sequencing (WES) with different sequencing platforms and targeted sequencing with >2,000-fold coverage, spanning 82% of genomic regions with high confidence. Although the gDNA reference samples are not representative of primary cancer cells from a clinical sample, when setting up a sequencing pipeline, they not only minimize potential biases from technologies, assays and informatics but also provide a unique resource for benchmarking 'tumor-only' or 'matched tumor-normal' analyses.

MeSH
Benchmarking * MeSH
Datasets as Topic MeSH
Humans MeSH
Mutation MeSH
DNA Mutational Analysis standards MeSH
Cell Line, Tumor MeSH
Breast Neoplasms genetics MeSH
Reference Standards MeSH
Reproducibility of Results MeSH
Whole Genome Sequencing standards MeSH
High-Throughput Nucleotide Sequencing standards MeSH
Germ Cells MeSH
Check Tag
Humans MeSH
Publication type
Journal Article MeSH
Research Support, Non-U.S. Gov't MeSH
Research Support, N.I.H., Extramural MeSH

Online article

Tools for time-course simulation in systems biology: a brief overview

Briefings in bioinformatics. 2021 ; 22 (5) : . [pub] 20210902

Brief Bioinform
ISSN 1477-4054
Medvik
Source

Dynamic modeling of biological systems is essential for understanding all properties of a given organism as it allows us to look not only at the static picture of an organism but also at its behavior under various conditions. With the increasing amount of experimental data, the number of tools that enable dynamic analysis also grows. However, various tools are based on different approaches, use different types of data and offer different functions for analyses; so it can be difficult to choose the most suitable tool for a selected type of model. Here, we bring a brief overview containing descriptions of 50 tools for the reconstruction of biological models, their time-course simulation and dynamic analysis. We examined each tool using test data and divided them based on the qualitative and quantitative nature of the mathematical apparatus they use.

Online article

Common variants in Alzheimer's disease and risk stratification by polygenic risk scores

Nature communications. 2021 ; 12 (1) : 3417. [pub] 20210607

Nat Commun
ISSN 2041-1723
Medvik
Source

Genetic discoveries of Alzheimer's disease are the drivers of our understanding, and together with polygenetic risk stratification can contribute towards planning of feasible and efficient preventive and curative clinical trials. We first perform a large genetic association study by merging all available case-control datasets and by-proxy study results (discovery n = 409,435 and validation size n = 58,190). Here, we add six variants associated with Alzheimer's disease risk (near APP, CHRNE, PRKD3/NDUFAF7, PLCG2 and two exonic variants in the SHARPIN gene). Assessment of the polygenic risk score and stratifying by APOE reveal a 4 to 5.5 years difference in median age at onset of Alzheimer's disease patients in APOE ɛ4 carriers. Because of this study, the underlying mechanisms of APP can be studied to refine the amyloid cascade and the polygenic risk score provides a tool to select individuals at high risk of Alzheimer's disease.

MeSH
Alzheimer Disease epidemiology genetics pathology MeSH
Amyloid beta-Protein Precursor genetics metabolism MeSH
Apolipoproteins E genetics MeSH
Genome-Wide Association Study MeSH
Datasets as Topic MeSH
Genetic Predisposition to Disease MeSH
Heterozygote MeSH
Risk Assessment methods MeSH
Polymorphism, Single Nucleotide MeSH
Cohort Studies MeSH
Middle Aged MeSH
Humans MeSH
Multifactorial Inheritance * MeSH
Follow-Up Studies MeSH
Risk Factors MeSH
Aged, 80 and over MeSH
Aged MeSH
Case-Control Studies MeSH
Age of Onset MeSH
Check Tag
Middle Aged MeSH
Humans MeSH
Male MeSH
Aged, 80 and over MeSH
Aged MeSH
Female MeSH
Publication type
Journal Article MeSH
Research Support, Non-U.S. Gov't MeSH
Research Support, N.I.H., Extramural MeSH
Validation Study MeSH

Online article

Development and Validation of a Simplified Score to Predict Early Relapse in Newly Diagnosed Multiple Myeloma in a Pooled Dataset of 2,190 Patients

Clinical cancer research. 2021 ; 27 (13) : 3695-3703. [pub] 20210429

Clin Cancer Res
ISSN 1557-3265
Medvik
Source

PURPOSE: Despite the improvement of therapeutic regimens, several patients with multiple myeloma (MM) still experience early relapse (ER). This subset of patients currently represents an unmet medical need. EXPERIMENTAL DESIGN: We pooled data from seven European multicenter phase II/III clinical trials enrolling 2,190 patients with newly diagnosed MM from 2003 to 2017. Baseline patient evaluation included 14 clinically relevant features. Patients with complete data (n = 1,218) were split into training (n = 844) and validation sets (n = 374). In the training set, a univariate analysis and a multivariate logistic regression model on ER within 18 months (ER18) were made. The most accurate model was selected on the validation set. We also developed a dynamic version of the score by including response to treatment. RESULTS: The Simplified Early Relapse in Multiple Myeloma (S-ERMM) score was modeled on six features weighted by a score: 5 points for high lactate dehydrogenase or t(4;14); 3 for del17p, abnormal albumin, or bone marrow plasma cells >60%; and 2 for λ free light chain. The S-ERMM identified three patient groups with different risks of ER18: Intermediate (Int) versus Low (OR = 2.39, P < 0.001) and High versus Low (OR = 5.59, P < 0.001). S-ERMM High/Int patients had significantly shorter overall survival (High vs. Low: HR = 3.24, P < 0.001; Int vs. Low: HR = 1.86, P < 0.001) and progression-free survival-2 (High vs. Low: HR = 2.89, P < 0.001; Int vs. Low: HR = 1.76, P < 0.001) than S-ERMM Low. The Dynamic S-ERMM (DS-ERMM) modulated the prognostic power of the S-ERMM. CONCLUSIONS: On the basis of simple, widely available baseline features, the S-ERMM and DS-ERMM properly identified patients with different risks of ER and survival outcomes.

Online article

FireProtDB: database of manually curated protein stability data

Nucleic acids research. 2021 ; 49 (D1) : D319-D324. [pub] 20210108

Nucleic Acids Res
ISSN 1362-4962
Medvik
Source

The majority of naturally occurring proteins have evolved to function under mild conditions inside the living organisms. One of the critical obstacles for the use of proteins in biotechnological applications is their insufficient stability at elevated temperatures or in the presence of salts. Since experimental screening for stabilizing mutations is typically laborious and expensive, in silico predictors are often used for narrowing down the mutational landscape. The recent advances in machine learning and artificial intelligence further facilitate the development of such computational tools. However, the accuracy of these predictors strongly depends on the quality and amount of data used for training and testing, which have often been reported as the current bottleneck of the approach. To address this problem, we present a novel database of experimental thermostability data for single-point mutants FireProtDB. The database combines the published datasets, data extracted manually from the recent literature, and the data collected in our laboratory. Its user interface is designed to facilitate both types of the expected use: (i) the interactive explorations of individual entries on the level of a protein or mutation and (ii) the construction of highly customized and machine learning-friendly datasets using advanced searching and filtering. The database is freely available at https://loschmidt.chemi.muni.cz/fireprotdb.

MeSH
Molecular Sequence Annotation MeSH
Point Mutation * MeSH
Databases, Protein * MeSH
Datasets as Topic MeSH
Internet MeSH
Models, Molecular MeSH
Proteins chemistry genetics MeSH
Software MeSH
Protein Stability MeSH
Machine Learning statistics & numerical data MeSH
Computational Biology methods MeSH
Publication type
Journal Article MeSH
Research Support, Non-U.S. Gov't MeSH

Online article

Cumulative Burden of Colorectal Cancer-Associated Genetic Variants Is More Strongly Associated With Early-Onset vs Late-Onset Cancer

Gastroenterology. 2020 ; 158 (5) : 1274-1286.e12. [pub] 20191219

ISSN 1528-0012
Medvik
Source

BACKGROUND & AIMS: Early-onset colorectal cancer (CRC, in persons younger than 50 years old) is increasing in incidence; yet, in the absence of a family history of CRC, this population lacks harmonized recommendations for prevention. We aimed to determine whether a polygenic risk score (PRS) developed from 95 CRC-associated common genetic risk variants was associated with risk for early-onset CRC. METHODS: We studied risk for CRC associated with a weighted PRS in 12,197 participants younger than 50 years old vs 95,865 participants 50 years or older. PRS was calculated based on single nucleotide polymorphisms associated with CRC in a large-scale genome-wide association study as of January 2019. Participants were pooled from 3 large consortia that provided clinical and genotyping data: the Colon Cancer Family Registry, the Colorectal Transdisciplinary Study, and the Genetics and Epidemiology of Colorectal Cancer Consortium and were all of genetically defined European descent. Findings were replicated in an independent cohort of 72,573 participants. RESULTS: Overall associations with CRC per standard deviation of PRS were significant for early-onset cancer, and were stronger compared with late-onset cancer (P for interaction = .01); when we compared the highest PRS quartile with the lowest, risk increased 3.7-fold for early-onset CRC (95% CI 3.28-4.24) vs 2.9-fold for late-onset CRC (95% CI 2.80-3.04). This association was strongest for participants without a first-degree family history of CRC (P for interaction = 5.61 × 10-5). When we compared the highest with the lowest quartiles in this group, risk increased 4.3-fold for early-onset CRC (95% CI 3.61-5.01) vs 2.9-fold for late-onset CRC (95% CI 2.70-3.00). Sensitivity analyses were consistent with these findings. CONCLUSIONS: In an analysis of associations with CRC per standard deviation of PRS, we found the cumulative burden of CRC-associated common genetic variants to associate with early-onset cancer, and to be more strongly associated with early-onset than late-onset cancer, particularly in the absence of CRC family history. Analyses of PRS, along with environmental and lifestyle risk factors, might identify younger individuals who would benefit from preventive measures.

MeSH
Medical History Taking MeSH
Genome-Wide Association Study MeSH
Datasets as Topic MeSH
Genetic Predisposition to Disease * MeSH
Genotyping Techniques MeSH
Polymorphism, Single Nucleotide MeSH
Cohort Studies MeSH
Colorectal Neoplasms genetics MeSH
Middle Aged MeSH
Humans MeSH
DNA Mutational Analysis MeSH
Mutation Rate MeSH
Risk Factors MeSH
Whole Genome Sequencing MeSH
Case-Control Studies MeSH
Age of Onset MeSH
Life Style MeSH
Check Tag
Middle Aged MeSH
Humans MeSH
Male MeSH
Female MeSH
Publication type
Journal Article MeSH
Multicenter Study MeSH
Observational Study MeSH
Research Support, Non-U.S. Gov't MeSH
Research Support, N.I.H., Extramural MeSH
Research Support, N.I.H., Intramural MeSH
Research Support, U.S. Gov't, P.H.S. MeSH

Online article

Typicality of functional connectivity robustly captures motion artifacts in rs-fMRI across datasets, atlases, and preprocessing pipelines

Human brain mapping. 2020 ; 41 (18) : 5325-5340. [pub] 20200902

Hum Brain Mapp
ISSN 1097-0193
Medvik
Source

Functional connectivity analysis of resting-state fMRI data has recently become one of the most common approaches to characterizing individual brain function. It has been widely suggested that the functional connectivity matrix is a useful approximate representation of the brain's connectivity, potentially providing behaviorally or clinically relevant markers. However, functional connectivity estimates are known to be detrimentally affected by various artifacts, including those due to in-scanner head motion. Moreover, as individual functional connections generally covary only very weakly with head motion estimates, motion influence is difficult to quantify robustly, and prone to be neglected in practice. Although the use of individual estimates of head motion, or group-level correlation of motion and functional connectivity has been suggested, a sufficiently sensitive measure of individual functional connectivity quality has not yet been established. We propose a new intuitive summary index, Typicality of Functional Connectivity, to capture deviations from standard brain functional connectivity patterns. In a resting-state fMRI dataset of 245 healthy subjects, this measure was significantly correlated with individual head motion metrics. The results were further robustly reproduced across atlas granularity, preprocessing options, and other datasets, including 1,081 subjects from the Human Connectome Project. In principle, Typicality of Functional Connectivity should be sensitive also to other types of artifacts, processing errors, and possibly also brain pathology, allowing extensive use in data quality screening and quantification in functional connectivity studies as well as methodological investigations.

MeSH
Artifacts MeSH
Atlases as Topic * MeSH
Datasets as Topic * MeSH
Adult MeSH
Head Movements MeSH
Connectome * methods standards MeSH
Humans MeSH
Magnetic Resonance Imaging * methods standards MeSH
Young Adult MeSH
Brain diagnostic imaging physiology MeSH
Image Processing, Computer-Assisted * methods standards MeSH
Check Tag
Adult MeSH
Humans MeSH
Young Adult MeSH
Male MeSH
Female MeSH
Publication type
Journal Article MeSH
Research Support, Non-U.S. Gov't MeSH

Collections

Published

Filters

* Show help

* Show help

Refine by MeSH