supervised machine learning
Dotaz
Zobrazit nápovědu
Supervised machine learning (ML) is used extensively in biology and deserves closer scrutiny. The Data Optimization Model Evaluation (DOME) recommendations aim to enhance the validation and reproducibility of ML research by establishing standards for key aspects such as data handling and processing, optimization, evaluation, and model interpretability. The recommendations help to ensure that key details are reported transparently by providing a structured set of questions. Here, we introduce the DOME registry (URL: registry.dome-ml.org), a database that allows scientists to manage and access comprehensive DOME-related information on published ML studies. The registry uses external resources like ORCID, APICURON, and the Data Stewardship Wizard to streamline the annotation process and ensure comprehensive documentation. By assigning unique identifiers and DOME scores to publications, the registry fosters a standardized evaluation of ML methods. Future plans include continuing to grow the registry through community curation, improving the DOME score definition and encouraging publishers to adopt DOME standards, and promoting transparency and reproducibility of ML in the life sciences.
- MeSH
- databáze faktografické MeSH
- lidé MeSH
- registrace * MeSH
- reprodukovatelnost výsledků MeSH
- řízené strojové učení * MeSH
- Check Tag
- lidé MeSH
- Publikační typ
- časopisecké články MeSH
The scarcity of high-quality annotations in many application scenarios has recently led to an increasing interest in devising learning techniques that combine unlabeled data with labeled data in a network. In this work, we focus on the label propagation problem in multilayer networks. Our approach is inspired by the heat diffusion model, which shows usefulness in machine learning problems such as classification and dimensionality reduction. We propose a novel boundary-based heat diffusion algorithm that guarantees a closed-form solution with an efficient implementation. We experimentally validated our method on synthetic networks and five real-world multilayer network datasets representing scientific coauthorship, spreading drug adoption among physicians, two bibliographic networks, and a movie network. The results demonstrate the benefits of the proposed algorithm, where our boundary-based heat diffusion dominates the performance of the state-of-the-art methods.
- MeSH
- algoritmy MeSH
- řízené strojové učení * MeSH
- strojové učení MeSH
- vysoká teplota * MeSH
- Publikační typ
- časopisecké články MeSH
Loss of olfactory function is a typical acute coronavirus disease 2019 (COVID-19) symptom, at least in early variants of SARS-CoV2. The time that has elapsed since the emergence of COVID-19 now allows for assessing the long-term prognosis of its olfactory impact. Participants (n = 722) of whom n = 464 reported having had COVID-19 dating back with a mode of 174 days were approached in a museum as a relatively unbiased environment. Olfactory function was diagnosed by assessing odor threshold and odor identification performance. Subjects also rated their actual olfactory function on an 11-point numerical scale [0,...10]. Neither the frequency of olfactory diagnostic categories nor olfactory test scores showed any COVID-19-related effects. Olfactory diagnostic categories (anosmia, hyposmia, or normosmia) were similarly distributed among former patients and controls (0.86%, 18.97%, and 80.17% for former patients and 1.17%, 17.51%, and 81.32% for controls). Former COVID-19 patients, however, showed differences in their subjective perception of their own olfactory function. The impact of this effect was substantial enough that supervised machine learning algorithms detected past COVID-19 infections in new subjects, based on reduced self-awareness of olfactory performance and parosmia, while the diagnosed olfactory function did not contribute any relevant information in this context. Based on diagnosed olfactory function, results suggest a positive prognosis for COVID-19-related olfactory loss in the long term. Traces of former infection are found in self-perceptions of olfaction, highlighting the importance of investigating the long-term effects of COVID-19 using reliable and validated diagnostic measures in olfactory testing.
- MeSH
- anosmie diagnóza etiologie MeSH
- čich MeSH
- COVID-19 * MeSH
- lidé MeSH
- poruchy čichu * diagnóza MeSH
- řízené strojové učení MeSH
- RNA virová MeSH
- SARS-CoV-2 MeSH
- Check Tag
- lidé MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
In this paper, we propose an integrated biologically inspired visual collision avoidance approach that is deployed on a real hexapod walking robot. The proposed approach is based on the Lobula giant movement detector (LGMD), a neural network for looming stimuli detection that can be found in visual pathways of insects, such as locusts. Although a superior performance of the LGMD in the detection of intercepting objects has been shown in many collision avoiding scenarios, its direct integration with motion control is an unexplored topic. In our work, we propose to utilize the LGMD neural network for visual interception detection with a central pattern generator (CPG) for locomotion control of a hexapod walking robot that are combined in the controller based on the long short-term memory (LSTM) recurrent neural network. Moreover, we propose self-supervised learning of the integrated controller to autonomously find a suitable setting of the system using a realistic robotic simulator. Thus, individual neural networks are trained in a simulation to enhance the performance of the controller that is then experimentally verified with a real hexapod walking robot in both collision and interception avoidance scenario and navigation in a cluttered environment.
- MeSH
- chování zvířat fyziologie MeSH
- chůze fyziologie MeSH
- kobylky fyziologie MeSH
- neuronové sítě MeSH
- řízené strojové učení MeSH
- robotika přístrojové vybavení MeSH
- učení vyhýbat se fyziologie MeSH
- zvířata MeSH
- Check Tag
- zvířata MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
Decision making on the treatment of vestibular schwannoma (VS) is mainly based on the symptoms, tumor size, patient's preference, and experience of the medical team. Here we provide objective tools to support the decision process by answering two questions: can a single checkup predict the need of active treatment?, and which attributes of VS development are important in decision making on active treatment? Using a machine-learning analysis of medical records of 93 patients, the objectives were addressed using two classification tasks: a time-independent case-based reasoning (CBR), where each medical record was treated as independent, and a personalized dynamic analysis (PDA), during which we analyzed the individual development of each patient's state in time. Using the CBR method we found that Koos classification of tumor size, speech reception threshold, and pure tone audiometry, collectively predict the need for active treatment with approximately 90% accuracy; in the PDA task, only the increase of Koos classification and VS size were sufficient. Our results indicate that VS treatment may be reliably predicted using only a small set of basic parameters, even without the knowledge of individual development, which may help to simplify VS treatment strategies, reduce the number of examinations, and increase cause effectiveness.
- MeSH
- dospělí MeSH
- klinické rozhodování * MeSH
- lidé středního věku MeSH
- lidé MeSH
- management nemoci * MeSH
- reprodukovatelnost výsledků MeSH
- řízené strojové učení MeSH
- ROC křivka MeSH
- rozhodovací stromy MeSH
- senioři MeSH
- sluch MeSH
- sluchové testy MeSH
- strojové učení * MeSH
- určení symptomu MeSH
- vestibulární schwannom diagnóza terapie MeSH
- Check Tag
- dospělí MeSH
- lidé středního věku MeSH
- lidé MeSH
- mužské pohlaví MeSH
- senioři MeSH
- ženské pohlaví MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
Pathophysiological recordings of patients measured from various testing methods are frequently used in the medical field for determining symptoms as well as for probability prediction for selected diseases. There are numerous symptoms among the Parkinson's disease (PD) population, however changes in speech and articulation – is potentially the most significant biomarker. This article is focused on PD diagnosis classification based on their speech signals using pattern recognition methods (AdaBoost, Bagged trees, Quadratic SVM and k-NN). The dataset investigated in the article consists of 30 PD and 30 HC individuals' voice measurements, with each individual being represented with 2 recordings within the dataset. Training signals for PD and HC underwent an extraction of relatively well-discriminating features relating to energy and spectral speech properties. Model implementations included a 5-fold cross validation. The accuracy of the values obtained employing the models was calculated using the confusion matrix. The average value of the overall accuracy = 82.3 % and averaged AUC = 0.88 (min. AUC = 0.86) on the available data.
In response to our study, the commentary by Infanti et al. (2024) raised critical points regarding (i) the conceptualization and utility of the user-avatar bond in addressing gaming disorder (GD) risk, and (ii) the optimization of supervised machine learning techniques applied to assess GD risk. To advance the scientific dialogue and progress in these areas, the present paper aims to: (i) enhance the clarity and understanding of the concepts of the avatar, the user-avatar bond, and the digital phenotype concerning gaming disorder (GD) within the broader field of behavioral addictions, and (ii) comparatively assess how the user-avatar bond (UAB) may predict GD risk, by both removing data augmentation before the data split and by implementing alternative data imbalance treatment approaches in programming.
- MeSH
- avatar MeSH
- lidé MeSH
- netholismus * MeSH
- řízené strojové učení MeSH
- strojové učení * MeSH
- uživatelské rozhraní počítače MeSH
- videohry MeSH
- Check Tag
- lidé MeSH
- Publikační typ
- časopisecké články MeSH
Narcolepsy is a rare life-long disease that exists in two forms, narcolepsy type-1 (NT1) or type-2 (NT2), but only NT1 is accepted as clearly defined entity. Both types of narcolepsies belong to the group of central hypersomnias (CH), a spectrum of poorly defined diseases with excessive daytime sleepiness as a core feature. Due to the considerable overlap of symptoms and the rarity of the diseases, it is difficult to identify distinct phenotypes of CH. Machine learning (ML) can help to identify phenotypes as it learns to recognize clinical features invisible for humans. Here we apply ML to data from the huge European Narcolepsy Network (EU-NN) that contains hundreds of mixed features of narcolepsy making it difficult to analyze with classical statistics. Stochastic gradient boosting, a supervised learning model with built-in feature selection, results in high performances in testing set. While cataplexy features are recognized as the most influential predictors, machine find additional features, e.g. mean rapid-eye-movement sleep latency of multiple sleep latency test contributes to classify NT1 and NT2 as confirmed by classical statistical analysis. Our results suggest ML can identify features of CH on machine scale from complex databases, thus providing 'ideas' and promising candidates for future diagnostic classifications.
- MeSH
- biologické modely * MeSH
- databáze faktografické statistika a číselné údaje MeSH
- datové soubory jako téma MeSH
- dospělí MeSH
- interpretace statistických dat MeSH
- lidé MeSH
- mladý dospělý MeSH
- narkolepsie klasifikace diagnóza patofyziologie MeSH
- polysomnografie statistika a číselné údaje MeSH
- řízené strojové učení * MeSH
- ROC křivka MeSH
- spánek REM fyziologie MeSH
- spánková latence fyziologie MeSH
- stochastické procesy MeSH
- vzácné nemoci klasifikace diagnóza patofyziologie MeSH
- Check Tag
- dospělí MeSH
- lidé MeSH
- mladý dospělý MeSH
- mužské pohlaví MeSH
- ženské pohlaví MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
The goal of this research was to design a solution to detect non-reported incidents, especially severe incidents. To achieve this goal, we proposed a method to process electronic medical records and automatically extract clinical notes describing severe incidents. To evaluate the proposed method, we implemented a system and used the system. The system successfully detected a non-reported incident to the safety management department.
Techniky strojového učení jsou metody, které umožní vytvořit z trénovací množiny případů model pro kategorie dat tak, že mohou být nové (neznámé) případy zařazeny do jedné nebo více kategorií schématem odpovídajícím modelu. Pro tento typ analýzy jsou velmi vhodná data ze studií sledujících určitou skupinu osob s opakovaným sběrem dat stejného typu. K vyhledávání znalostí z medicínských dat bylo užito různých algoritmů strojového učení. Bylo testováno několik algoritmů tak, aby bylo možno pokrýt většinu způsobů učení s učitelem. Byly provedeny dva typy pokusů. Jeden hledal vztahy mezi atributy, druhý testoval predikci budoucích příhod. Pro pokusy v tomto sdělení byla užita data z dvacet let trvající longitudinální primárně preventivní studie rizikových faktorů (RF) aterosklerózy u mužů středního věku. Studie se nazývá STULONG (LONGitudinal STUdy). Výsledky ukazují, že některé metody předpovídají některé poruchy lépe než jiné a že je tedy vhodné použít všechny algoritmy najednou a posuzovat spolehlivost výsledku na základě známého trendu každé metody. Algoritmy strojového učení byly také použity k předpovědi příčiny úmrtí. V tomto případě byly výsledky nevalné, pravděpodobně pro malé množství informace ve vstupních položkách v datového souboru.
Machine learning techniques are methods that given a training set of examples infer a model for the categories of the data, so that new (unknown) examples could be assigned to one or more categories by pattern matching within the model. The data from follow-up studies with repeated collection of the same type of data are very suitable for this analysis. Machine learning algorithms belonging to a variety of paradigms have been applied to knowledge discovery on medical data. All the used algorithms belong to the supervised learning paradigm. Several algorithms have been tested, trying to cover most of the kinds of supervised learning. Two kinds of experiments have been carried out. The first is intended to discover associations between attributes. The second kind is intended to test prediction of future disorders. For the experiments in this paper the data used was from the twenty years lasting primary preventive longitudinal study of the risk factors (RF) of atherosclerosis in middle aged men. Study is named STULONG (LONGitudinal STUdy). The results show that some methods predict some disorders better than others, so it is interesting to use all the algorithms at a time and consider the result confidence based upon the known tendency of each method. The machine learning algorithms have been also used in the prediction of death cause, obtaining poor results in this case, maybe due to the small amount of information (entries) of this type in the dataset.
- Klíčová slova
- dobývání znalostí, strojové učení s učitelem, vytěžování z biomedicínských dat, rizikové faktory aterosklerózy,
- MeSH
- algoritmy MeSH
- ateroskleróza diagnóza MeSH
- databáze faktografické MeSH
- financování organizované MeSH
- lidé středního věku MeSH
- lidé MeSH
- metody pro podporu rozhodování MeSH
- prognóza MeSH
- rizikové faktory MeSH
- systémy pro podporu klinického rozhodování MeSH
- ukládání a vyhledávání informací MeSH
- znalostní báze MeSH
- Check Tag
- lidé středního věku MeSH
- lidé MeSH
- mužské pohlaví MeSH