unsupervised learning
Dotaz
Zobrazit nápovědu
Research in Artificial Intelligence (AI) has focused mostly on two extremes: either on small improvements in narrow AI domains, or on universal theoretical frameworks which are often uncomputable, or lack practical implementations. In this paper we attempt to follow a big picture view while also providing a particular theory and its implementation to present a novel, purposely simple, and interpretable hierarchical architecture. This architecture incorporates the unsupervised learning of a model of the environment, learning the influence of one's own actions, model-based reinforcement learning, hierarchical planning, and symbolic/sub-symbolic integration in general. The learned model is stored in the form of hierarchical representations which are increasingly more abstract, but can retain details when needed. We demonstrate the universality of the architecture by testing it on a series of diverse environments ranging from audio/visual compression to discrete and continuous action spaces, to learning disentangled representations.
- MeSH
- algoritmy MeSH
- lidé MeSH
- neuronové sítě MeSH
- posilování (psychologie) MeSH
- strojové učení bez učitele MeSH
- učení fyziologie MeSH
- umělá inteligence * MeSH
- životní prostředí * MeSH
- Check Tag
- lidé MeSH
- Publikační typ
- časopisecké články MeSH
The academic curriculum has shown to promote sedentary behavior in college students. This study aimed to profile the physical fitness of physical education majors using unsupervised machine learning and to identify the differences between sexes, academic years, socioeconomic strata, and the generated profiles. A total of 542 healthy and physically active students (445 males, 97 females; 19.8 [2.2] years; 66.0 [10.3] kg; 169.5 [7.8] cm) participated in this cross-sectional study. Their indirect VO2max (Cooper and Shuttle-Run 20 m tests), lower-limb power (horizontal jump), sprint (30 m), agility (shuttle run), and flexibility (sit-and-reach) were assessed. The participants were profiled using clustering algorithms after setting the optimal number of clusters through an internal validation using R packages. Non-parametric tests were used to identify the differences (p < 0.05). The higher percentage of the population were freshmen (51.4%) and middle-income (64.0%) students. Seniors and juniors showed a better physical fitness than first-year students. No significant differences were found between their socioeconomic strata (p > 0.05). Two profiles were identified using hierarchical clustering (Cluster 1 = 318 vs. Cluster 2 = 224). The matching analysis revealed that physical fitness explained the variation in the data, with Cluster 2 as a sex-independent and more physically fit group. All variables differed significantly between the sexes (except the body mass index [p = 0.218]) and the generated profiles (except stature [p = 0.559] and flexibility [p = 0.115]). A multidimensional analysis showed that the body mass, cardiorespiratory fitness, and agility contributed the most to the data variation so that they can be used as profiling variables. This profiling method accurately identified the relevant variables to reinforce exercise recommendations in a low physical performance and overweight majors.
- Klíčová slova
- cardiorespiratory fitness, muscle power, physical endurance, range of motion, sprint speed, unsupervised machine learning,
- MeSH
- cvičení MeSH
- index tělesné hmotnosti MeSH
- lidé MeSH
- průřezové studie MeSH
- strojové učení bez učitele * MeSH
- tělesná výchova * MeSH
- tělesná výkonnost MeSH
- Check Tag
- lidé MeSH
- mužské pohlaví MeSH
- ženské pohlaví MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
PURPOSE: A supervised deep learning (DL) approach for frequency and phase correction (FPC) of MRS data recently showed encouraging results, but obtaining transients with labels for supervised learning is challenging. This work investigates the feasibility and efficiency of unsupervised deep learning-based FPC. METHODS: Two novel deep learning-based FPC methods (deep learning-based Cr referencing and deep learning-based spectral registration), which use a priori physics domain knowledge, are presented. The proposed networks were trained, validated, and evaluated using simulated, phantom, and publicly accessible in vivo MEGA-edited MRS data. The performance of our proposed FPC methods was compared with other generally used FPC methods, in terms of precision and time efficiency. A new measure was proposed in this study to evaluate the FPC method performance. The ability of each of our methods to carry out FPC at varying SNR levels was evaluated. A Monte Carlo study was carried out to investigate the performance of our proposed methods. RESULTS: The validation using low-SNR manipulated simulated data demonstrated that the proposed methods could perform FPC comparably with other methods. The evaluation showed that the deep learning-based spectral registration over a limited frequency range method achieved the highest performance in phantom data. The applicability of the proposed method for FPC of GABA-edited in vivo MRS data was demonstrated. Our proposed networks have the potential to reduce computation time significantly. CONCLUSIONS: The proposed physics-informed deep neural networks trained in an unsupervised manner with complex data can offer efficient FPC of large MRS data in a shorter time.
- Klíčová slova
- MR spectroscopy, deep learning, edited MRS, frequency correction, phase correction,
- MeSH
- deep learning * MeSH
- fantomy radiodiagnostické MeSH
- metoda Monte Carlo MeSH
- neuronové sítě MeSH
- počítačové zpracování obrazu metody MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
Bioelectrical impedance analysis (BIA) was established to quantify diverse cellular characteristics. This technique has been widely used in various species, such as fish, poultry, and humans for compositional analysis. This technology was limited to offline quality assurance/detection of woody breast (WB); however, inline technology that can be retrofitted on the conveyor belt would be more helpful to processors. Freshly deboned (n = 80) chicken breast fillets were collected from a local processor and analyzed by hand-palpation for different WB severity levels. Data collected from both BIA setups were subjected to supervised and unsupervised learning algorithms. The modified BIA showed better detection ability for regular fillets than the probe BIA setup. In the plate BIA setup, fillets were 80.00% for normal, 66.67% for moderate (data for mild and moderate merged), and 85.00% for severe WB. However, hand-held BIA showed 77.78, 85.71, and 88.89% for normal, moderate, and severe WB, respectively. Plate BIA setup is more effective in detecting WB myopathies and could be installed without slowing the processing line. Breast fillet detection on the processing line can be significantly improved using a modified automated plate BIA.
- Klíčová slova
- bioelectrical impedance, hand palpation, in-line processing, supervised learning, unsupervised learning, woody breast,
- Publikační typ
- časopisecké články MeSH
Identification of active electrodes that record task-relevant neurophysiological activity is needed for clinical and industrial applications as well as for investigating brain functions. We developed an unsupervised, fully automated approach to classify active electrodes showing event-related intracranial EEG (iEEG) responses from 115 patients performing a free recall verbal memory task. Our approach employed new interpretable metrics that quantify spectral characteristics of the normalized iEEG signal based on power-in-band and synchrony measures. Unsupervised clustering of the metrics identified distinct sets of active electrodes across different subjects. In the total population of 11,869 electrodes, our method achieved 97% sensitivity and 92.9% specificity with the most efficient metric. We validated our results with anatomical localization revealing significantly greater distribution of active electrodes in brain regions that support verbal memory processing. We propose our machine-learning framework for objective and efficient classification and interpretation of electrophysiological signals of brain activities supporting memory and cognition.
- MeSH
- algoritmy MeSH
- biomedicínské inženýrství metody trendy MeSH
- datové soubory jako téma MeSH
- elektroencefalografie metody MeSH
- elektrofyziologické jevy MeSH
- elektrokortikografie * metody MeSH
- epilepsie diagnóza patofyziologie psychologie MeSH
- evokované potenciály fyziologie MeSH
- implantované elektrody * MeSH
- kognice fyziologie MeSH
- krátkodobá paměť fyziologie MeSH
- lidé MeSH
- mapování mozku metody MeSH
- mozek diagnostické zobrazování fyziologie MeSH
- plnění a analýza úkolů * MeSH
- retrospektivní studie MeSH
- senzitivita a specificita MeSH
- strojové učení bez učitele * MeSH
- verbální chování fyziologie MeSH
- Check Tag
- lidé MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Research Support, U.S. Gov't, Non-P.H.S. MeSH
- validační studie MeSH
Acute heart failure (AHF) is a life-threatening, heterogeneous disease requiring urgent diagnosis and treatment. The clinical severity and medical procedures differ according to a complex interplay between the deterioration cause, underlying cardiac substrate, and comorbidities. This study aimed to analyze the natural phenotypic heterogeneity of the AHF population and evaluate the possibilities offered by clustering (unsupervised machine-learning technique) in a medical data assessment. We evaluated data from 381 AHF patients. Sixty-three clinical and biochemical features were assessed at the admission of the patients and were included in the analysis after the preprocessing. The K-medoids algorithm was implemented to create the clusters, and optimization, based on the Davies-Bouldin index, was used. The clustering was performed while blinded to the outcome. The outcome associations were evaluated using the Kaplan-Meier curves and Cox proportional-hazards regressions. The algorithm distinguished six clusters that differed significantly in 58 variables concerning i.e., etiology, clinical status, comorbidities, laboratory parameters and lifestyle factors. The clusters differed in terms of the one-year mortality (p = 0.002). Using the clustering techniques, we extracted six phenotypes from AHF patients with distinct clinical characteristics and outcomes. Our results can be valuable for future trial constructions and customized treatment.
- Klíčová slova
- acute heart failure, clustering, machine learning,
- Publikační typ
- časopisecké články MeSH
This study explores the communication patterns of Slovak banks with stakeholders through mandatory disclosures mandated by Basel III's Pillar 3 framework and annual reports in 2007-2022. Our primary objective is to identify key topics communicated by banks and analysing the sentiment of this communication during turbulent periods (i.e., alternating periods of stability and crisis) in 2007-2022. Textual data was collected from Pillar 3 disclosures, annual reports, and additional regulatory reports. A hybrid model was developed to extract the most important keywords from each collected document chapter. This hybrid model (model combining multiple approaches) combines elements of statistical approaches to keyword extraction, (keyword frequency dictionary), linguistic approaches (pair-of-speech tagging in order to select noun-phrases), and machine-learning based approaches (BERT) to extract meaningful keywords. Subsequently, a sentiment analysis was performed on the extracted keywords using a Loughran-McDonald lexicon (list of words labelled with sentiment) specially designed for financial texts. Based on the adjusted univariate results, we can reject the global null hypothesis of independence of the sentiment category of keywords from time for negative sentiment at p = 0.0000 for positive sentiment at p = 0.0005, and for neutral sentiment at p = 0.0000 significant level. The multilevel comparison revealed that negative sentiment was most frequent during the global financial crisis and the COVID-19 pandemic, likely impacting stakeholder confidence and trust. Conversely, positive sentiment dominated during periods of financial stability, potentially enhancing stakeholder satisfaction and investment decisions. This research points out that the sentiment of the selected commercial bank documents changes depending on the years. A commercial bank can use this knowledge and include sentiment information as predictors when modelling financial distress. For bank management of selected commercial bank the examined documents are an important communication tool, the wording of which can have a significant impact on stakeholder behaviour towards the bank, their styling is very important.
- MeSH
- COVID-19 epidemiologie MeSH
- komunikace * MeSH
- lidé MeSH
- strojové učení bez učitele * MeSH
- Check Tag
- lidé MeSH
- Publikační typ
- časopisecké články MeSH
Current studies of gene × air pollution interaction typically seek to identify unknown heritability of common complex illnesses arising from variability in the host's susceptibility to environmental pollutants of interest. Accordingly, a single component generalized linear models are often used to model the risk posed by an environmental exposure variable of interest in relation to a priori determined DNA variants. However, reducing the phenotypic heterogeneity may further optimize such approach, primarily represented by the modeled DNA variants. Here, we reduce phenotypic heterogeneity of asthma severity, and also identify single nucleotide polymorphisms (SNP) associated with phenotype subgroups. Specifically, we first apply an unsupervised learning algorithm method and a non-parametric regression to find a biclustering structure of children according to their allergy and asthma severity. We then identify a set of SNPs most closely correlated with each sub-group. We subsequently fit a logistic regression model for each group against the healthy controls using benzo[a]pyrene (B[a]P) as a representative airborne carcinogen. Application of such approach in a case-control data set shows that SNP clustering may help to partly explain heterogeneity in children's asthma susceptibility in relation to ambient B[a]P concentration with greater efficiency.
- Klíčová slova
- air pollution, asthma, gene-environment interaction, polycyclic aromatic hydrocarbon, single nucleotide polymorphism,
- MeSH
- algoritmy MeSH
- benzopyren toxicita MeSH
- bronchiální astma chemicky indukované genetika MeSH
- dítě MeSH
- genetická predispozice k nemoci * MeSH
- interakce genů a prostředí MeSH
- jednonukleotidový polymorfismus MeSH
- látky znečišťující vzduch toxicita MeSH
- lidé MeSH
- multifaktoriální dědičnost * MeSH
- statistika jako téma MeSH
- strojové učení bez učitele MeSH
- studie případů a kontrol MeSH
- vystavení vlivu životního prostředí škodlivé účinky MeSH
- znečištění ovzduší škodlivé účinky MeSH
- Check Tag
- dítě MeSH
- lidé MeSH
- mužské pohlaví MeSH
- ženské pohlaví MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Názvy látek
- benzopyren MeSH
- látky znečišťující vzduch MeSH
BACKGROUND: Increasingly large and complex biomedical data sets challenge conventional hypothesis-driven analytical approaches, however, data-driven unsupervised learning can detect inherent patterns in such data sets. METHODS: While unsupervised analysis in the medical literature commonly only utilizes a single clustering algorithm for a given data set, we developed a large-scale model with 605 different combinations of target dimensionalities as well as transformation and clustering algorithms and subsequent meta-clustering of individual results. With this model, we investigated a large cohort of 1383 patients from 59 centers in Germany with newly diagnosed acute myeloid leukemia for whom 212 clinical, laboratory, cytogenetic and molecular genetic parameters were available. RESULTS: Unsupervised learning identifies four distinct patient clusters, and statistical analysis shows significant differences in rate of complete remissions, event-free, relapse-free and overall survival between the four clusters. In comparison to the standard-of-care hypothesis-driven European Leukemia Net (ELN2017) risk stratification model, we find all three ELN2017 risk categories being represented in all four clusters in varying proportions indicating unappreciated complexity of AML biology in current established risk stratification models. Further, by using assigned clusters as labels we subsequently train a supervised model to validate cluster assignments on a large external multicenter cohort of 664 intensively treated AML patients. CONCLUSIONS: Dynamic data-driven models are likely more suitable for risk stratification in the context of increasingly complex medical data than rigid hypothesis-driven models to allow for a more personalized treatment allocation and gain novel insights into disease biology.
There are various ways in which clinicians can predict the risk of disease progression in patients with leukemia, helping them to treat the patients accordingly. However, these approaches are usually designed by human experts and might not fully capture the complexity of a patient’s disease. Here, with a large cohort of patients with acute myeloid leukemia, we design an unsupervised machine learning model – a type of computer model that learns from patterns in data without human input—to separate these patients into subgroups according to risk. We identify four distinct groups which differ with regards to patient genetics, laboratory values, and clinical characteristics. These groups have differences in response to treatment and patient survival, and we validate our findings in another dataset. Our approach might help clinicians to better predict outcomes in patients with leukemia and make decisions on treatment.
- Publikační typ
- časopisecké články MeSH
BACKGROUND: A statistical pipeline was developed and used for determining candidate genes and candidate gene coexpression networks involved in 2 alcohol (i.e., ethanol [EtOH]) metabolism phenotypes, namely alcohol clearance and acetate area under the curve in a recombinant inbred (RI) (HXB/BXH) rat panel. The approach was also used to provide an indication of how EtOH metabolism can impact the normal function of the identified networks. METHODS: RNA was extracted from alcohol-naïve liver tissue of 30 strains of HXB/BXH RI rats. The reconstructed transcripts were quantitated, and data were used to construct gene coexpression modules and networks. A separate group of rats, comprising the same 30 strains, were injected with EtOH (2 g/kg) for measurement of blood EtOH and acetate levels. These data were used for quantitative trait loci (QTL) analysis of the rate of EtOH disappearance and circulating acetate levels. The analysis pipeline required calculation of the module eigengene values, the correction of these values with EtOH metabolism rates and acetate levels across the rat strains, and the determination of the eigengene QTLs. For a module to be considered a candidate for determining phenotype, the module eigengene values had to have significant correlation with the strain phenotypic values and the module eigengene QTLs had to overlap the phenotypic QTLs. RESULTS: Of the 658 transcript coexpression modules generated from liver RNA sequencing data, a single module satisfied all criteria for being a candidate for determining the alcohol clearance trait. This module contained 2 alcohol dehydrogenase genes, including the gene whose product was previously shown to be responsible for the majority of alcohol elimination in the rat. This module was also the only module identified as a candidate for influencing circulating acetate levels. This module was also linked to the process of generation and utilization of retinoic acid as related to the autonomous immune response. CONCLUSIONS: We propose that our analytical pipeline can successfully identify genetic regions and transcripts which predispose a particular phenotype and our analysis provides functional context for coexpression module components.
- Klíčová slova
- Alcohol Metabolism, HXB/BXH Recombinant Inbred Rat Panel, Liver, Quantitative Trait Locus Mapping, RNA Sequencing, Weighted Gene Coexpression Network Analysis,
- MeSH
- ethanol aplikace a dávkování metabolismus MeSH
- játra účinky léků metabolismus MeSH
- krysa rodu Rattus MeSH
- metabolická clearance účinky léků fyziologie MeSH
- multifaktoriální dědičnost účinky léků fyziologie MeSH
- pití alkoholu genetika metabolismus MeSH
- potkani inbrední BN MeSH
- potkani inbrední SHR MeSH
- potkani transgenní MeSH
- strojové učení bez učitele * MeSH
- systémová biologie metody MeSH
- zvířata MeSH
- Check Tag
- krysa rodu Rattus MeSH
- mužské pohlaví MeSH
- zvířata MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Research Support, N.I.H., Extramural MeSH
- Názvy látek
- ethanol MeSH