Random forest
Dotaz
Zobrazit nápovědu
With the incorporation of effective therapies for myelofibrosis (MF), accurately predicting outcomes after allogeneic hematopoietic cell transplantation (allo-HCT) is crucial for determining the optimal timing for this procedure. Using data from 5183 patients with MF who underwent first allo-HCT between 2005 and 2020 at European Society for Blood and Marrow Transplantation centers, we examined different machine learning (ML) models to predict overall survival after transplant. The cohort was divided into a training set (75%) and a test set (25%) for model validation. A random survival forests (RSF) model was developed based on 10 variables: patient age, comorbidity index, performance status, blood blasts, hemoglobin, leukocytes, platelets, donor type, conditioning intensity, and graft-versus-host disease prophylaxis. Its performance was compared with a 4-level Cox regression-based score and other ML-based models derived from the same data set, and with the Center for International Blood and Marrow Transplant Research score. The RSF outperformed all comparators, achieving better concordance indices across both primary and postessential thrombocythemia/polycythemia vera MF subgroups. The robustness and generalizability of the RSF model was confirmed by Akaike information criterion and time-dependent receiver operating characteristic area under the curve metrics in both sets. Although all models were prognostic for nonrelapse mortality, the RSF provided better curve separation, effectively identifying a high-risk group comprising 25% of patients. In conclusion, ML enhances risk stratification in patients with MF undergoing allo-HCT, paving the way for personalized medicine. A web application (https://gemfin.click/ebmt) based on the RSF model offers a practical tool to identify patients at high risk for poor transplantation outcomes, supporting informed treatment decisions and advancing individualized care.
- MeSH
- dospělí MeSH
- lidé středního věku MeSH
- lidé MeSH
- míra přežití MeSH
- primární myelofibróza * terapie mortalita MeSH
- prognóza MeSH
- senioři MeSH
- strojové učení * MeSH
- transplantace hematopoetických kmenových buněk * mortalita MeSH
- Check Tag
- dospělí MeSH
- lidé středního věku MeSH
- lidé MeSH
- mužské pohlaví MeSH
- senioři MeSH
- ženské pohlaví MeSH
- Publikační typ
- časopisecké články MeSH
BACKGROUND: Advancements in artificial intelligence (AI) and machine learning (ML) have revolutionized the medical field and transformed translational medicine. These technologies enable more accurate disease trajectory models while enhancing patient-centered care. However, challenges such as heterogeneous datasets, class imbalance, and scalability remain barriers to achieving optimal predictive performance. METHODS: This study proposes a novel AI-based framework that integrates Gradient Boosting Machines (GBM) and Deep Neural Networks (DNN) to address these challenges. The framework was evaluated using two distinct datasets: MIMIC-IV, a critical care database containing clinical data of critically ill patients, and the UK Biobank, which comprises genetic, clinical, and lifestyle data from 500,000 participants. Key performance metrics, including Accuracy, Precision, Recall, F1-Score, and AUROC, were used to assess the framework against traditional and advanced ML models. RESULTS: The proposed framework demonstrated superior performance compared to classical models such as Logistic Regression, Random Forest, Support Vector Machines (SVM), and Neural Networks. For example, on the UK Biobank dataset, the model achieved an AUROC of 0.96, significantly outperforming Neural Networks (0.92). The framework was also efficient, requiring only 32.4 s for training on MIMIC-IV, with low prediction latency, making it suitable for real-time applications. CONCLUSIONS: The proposed AI-based framework effectively addresses critical challenges in translational medicine, offering superior predictive accuracy and efficiency. Its robust performance across diverse datasets highlights its potential for integration into real-time clinical decision support systems, facilitating personalized medicine and improving patient outcomes. Future research will focus on enhancing scalability and interpretability for broader clinical applications.
BACKGROUND: Prognostic machine learning research in multiple sclerosis has been mainly focusing on black-box models predicting whether a patients' disability will progress in a fixed number of years. However, as this is a binary yes/no question, it cannot take individual disease severity into account. Therefore, in this work we propose to model the time to disease progression instead. Additionally, we use explainable machine learning techniques to make the model outputs more interpretable. METHODS: A preprocessed subset of 29,201 patients of the international data registry MSBase was used. Disability was assessed in terms of the Expanded Disability Status Scale (EDSS). We predict the time to significant and confirmed disability progression using random survival forests, a machine learning model for survival analysis. Performance is evaluated on a time-dependent area under the receiver operating characteristic and the precision-recall curves. Importantly, predictions are then explained using SHAP and Bellatrex, two explainability toolboxes, and lead to both global (population-wide) as well as local (patient visit-specific) insights. RESULTS: On the task of predicting progression in 2 years, the random survival forest achieves state-of-the-art performance, comparable to previous work employing a random forest. However, here the random survival forest has the added advantage of being able to predict progression over a longer time horizon, with AUROC >60% for the first 10 years after baseline. Explainability techniques further validated the model by extracting clinically valid insights from the predictions made by the model. For example, a clear decline in the per-visit probability of progression is observed in more recent years since 2012, likely reflecting globally increasing use of more effective MS therapies. CONCLUSION: The binary classification models found in the literature can be extended to a time-to-event setting without loss of performance, thus allowing a more comprehensive prediction of patient prognosis. Furthermore, explainability techniques proved to be key to reach a better understanding of the model and increase validation of its behaviour.
- MeSH
- algoritmy MeSH
- časové faktory MeSH
- dospělí MeSH
- lidé MeSH
- prognóza MeSH
- progrese nemoci * MeSH
- registrace MeSH
- ROC křivka MeSH
- roztroušená skleróza * patofyziologie MeSH
- strojové učení * MeSH
- Check Tag
- dospělí MeSH
- lidé MeSH
- mužské pohlaví MeSH
- ženské pohlaví MeSH
- Publikační typ
- časopisecké články MeSH
The Arteriovenous Access Stage (AVAS) classification simplifies information about suitability of vessels for vascular access (VA). It's been previously validated in a clinical study. Here, AVAS performance was tested against multiple ultrasound mapping measurements using machine learning. A prospective multicentre international study (NCT04796558) with patient recruitment from March 2021-July 2024. Demographics, risk factors, vessels parameters, types of predicted and created VA (pVA, cVA) were collected. We modelled pVA and cVA using the Random Forest algorithm. Model performance was estimated and compared using Bayesian generalized linear models. ROC AUC with 95% credible intervals was the performance metric. 1151 patients were included. ROC AUC for pVA prediction by AVAS was 0.79 (0.77;0.82) and by mapping was 0.85 (0.83;0.88). ROC AUC for cVA prediction by AVAS was 0.71 (0.69;0.74) and by mapping was 0.8 (0.78;0.83). Using AVAS with other parameters increased the ROC AUC to 0.87 for pVA (0.84;0.89) and 0.82 (0.79;0.84) for cVA. Using mapping with other parameters increased the ROC AUC to 0.88 for pVA (0.86;0.91) and 0.85 (0.83;0.88) for cVA. Multiple mapping measurements showed higher performance at VA prediction than AVAS. However, AVAS is simpler and quicker, so may be preferable for routine clinical practice.
- MeSH
- arteriovenózní zkrat MeSH
- Bayesova věta MeSH
- lidé středního věku MeSH
- lidé MeSH
- prospektivní studie MeSH
- ROC křivka MeSH
- senioři MeSH
- strojové učení * MeSH
- ultrasonografie * metody MeSH
- Check Tag
- lidé středního věku MeSH
- lidé MeSH
- mužské pohlaví MeSH
- senioři MeSH
- ženské pohlaví MeSH
- Publikační typ
- časopisecké články MeSH
- multicentrická studie MeSH
- práce podpořená grantem MeSH
- srovnávací studie MeSH
BACKGROUND: The advancement of nanotechnology underscores the imperative need for establishing in silico predictive models to assess safety, particularly in the context of chronic respiratory afflictions such as lung fibrosis, a pathogenic transformation that is irreversible. While the compilation of predictive descriptors is pivotal for in silico model development, key features specifically tailored for predicting lung fibrosis remain elusive. This study aimed to uncover the essential predictive descriptors governing nanoparticle-induced pulmonary fibrosis. METHODS: We conducted a comprehensive analysis of the trajectory of metal oxide nanoparticles (MeONPs) within pulmonary systems. Two biological media (simulated lung fluid and phagolysosomal simulated fluid) and two cell lines (macrophages and epithelial cells) were meticulously chosen to scrutinize MeONP behaviors. Their interactions with MeONPs, also referred to as nano-bio interactions, can lead to alterations in the properties of the MeONPs as well as specific cellular responses. Physicochemical properties of MeONPs were assessed in biological media. The impact of MeONPs on cell membranes, lysosomes, mitochondria, and cytoplasmic components was evaluated using fluorescent probes, colorimetric enzyme substrates, and ELISA. The fibrogenic potential of MeONPs in mouse lungs was assessed by examining collagen deposition and growth factor release. Random forest classification was employed for analyzing in chemico, in vitro and in vivo data to identify predictive descriptors. RESULTS: The nano-bio interactions induced diverse changes in the 4 characteristics of MeONPs and had variable effects on the 14 cellular functions, which were quantitatively evaluated in chemico and in vitro. Among these 18 quantitative features, seven features were found to play key roles in predicting the pro-fibrogenic potential of MeONPs. Notably, IL-1β was identified as the most important feature, contributing 27.8% to the model's prediction. Mitochondrial activity (specifically NADH levels) in macrophages followed closely with a contribution of 17.6%. The remaining five key features include TGF-β1 release and NADH levels in epithelial cells, dissolution in lysosomal simulated fluids, zeta potential, and the hydrodynamic size of MeONPs. CONCLUSIONS: The pro-fibrogenic potential of MeONPs can be predicted by combination of key features at nano-bio interfaces, simulating their behavior and interactions within the lung environment. Among the 18 quantitative features, a combination of seven in chemico and in vitro descriptors could be leveraged to predict lung fibrosis in animals. Our findings offer crucial insights for developing in silico predictive models for nano-induced pulmonary fibrosis.
- MeSH
- buňky A549 MeSH
- kovové nanočástice * toxicita chemie MeSH
- lidé MeSH
- myši inbrední C57BL MeSH
- myši MeSH
- plíce účinky léků patologie metabolismus MeSH
- plicní fibróza * chemicky indukované metabolismus patologie MeSH
- zvířata MeSH
- Check Tag
- lidé MeSH
- myši MeSH
- zvířata MeSH
- Publikační typ
- časopisecké články MeSH
BACKGROUND AND AIMS: Risk stratification of sudden cardiac death after myocardial infarction and prevention by defibrillator rely on left ventricular ejection fraction (LVEF). Improved risk stratification across the whole LVEF range is required for decision-making on defibrillator implantation. METHODS: The analysis pooled 20 data sets with 140 204 post-myocardial infarction patients containing information on demographics, medical history, clinical characteristics, biomarkers, electrocardiography, echocardiography, and cardiac magnetic resonance imaging. Separate analyses were performed in patients (i) carrying a primary prevention cardioverter-defibrillator with LVEF ≤ 35% [implantable cardioverter-defibrillator (ICD) patients], (ii) without cardioverter-defibrillator with LVEF ≤ 35% (non-ICD patients ≤ 35%), and (iii) without cardioverter-defibrillator with LVEF > 35% (non-ICD patients >35%). Primary outcome was sudden cardiac death or, in defibrillator carriers, appropriate defibrillator therapy. Using a competing risk framework and systematic internal-external cross-validation, a model using LVEF only, a multivariable flexible parametric survival model, and a multivariable random forest survival model were developed and externally validated. Predictive performance was assessed by random effect meta-analysis. RESULTS: There were 1326 primary outcomes in 7543 ICD patients, 1193 in 25 058 non-ICD patients ≤35%, and 1567 in 107 603 non-ICD patients >35% during mean follow-up of 30.0, 46.5, and 57.6 months, respectively. In these three subgroups, LVEF poorly predicted sudden cardiac death (c-statistics between 0.50 and 0.56). Considering additional parameters did not improve calibration and discrimination, and model generalizability was poor. CONCLUSIONS: More accurate risk stratification for sudden cardiac death and identification of low-risk individuals with severely reduced LVEF or of high-risk individuals with preserved LVEF was not feasible, neither using LVEF nor using other predictors.
- MeSH
- defibrilátory implantabilní * MeSH
- elektrokardiografie MeSH
- hodnocení rizik metody MeSH
- infarkt myokardu * mortalita komplikace MeSH
- lidé MeSH
- náhlá srdeční smrt * prevence a kontrola epidemiologie etiologie MeSH
- tepový objem * fyziologie MeSH
- Check Tag
- lidé MeSH
- Publikační typ
- časopisecké články MeSH
- metaanalýza MeSH
BACKGROUND: Wildfire activity is an important source of tropospheric ozone (O3) pollution. However, no study to date has systematically examined the associations of wildfire-related O3 exposure with mortality globally. METHODS: We did a multicountry two-stage time series analysis. From the Multi-City Multi-Country (MCC) Collaborative Research Network, data on daily all-cause, cardiovascular, and respiratory deaths were obtained from 749 locations in 43 countries or areas, representing overlapping periods from Jan 1, 2000, to Dec 31, 2016. We estimated the daily concentration of wildfire-related O3 in study locations using a chemical transport model, and then calibrated and downscaled O3 estimates to a resolution of 0·25° × 0·25° (approximately 28 km2 at the equator). Using a random-effects meta-analysis, we examined the associations of short-term wildfire-related O3 exposure (lag period of 0-2 days) with daily mortality, first at the location level and then pooled at the country, regional, and global levels. Annual excess mortality fraction in each location attributable to wildfire-related O3 was calculated with pooled effect estimates and used to obtain excess mortality fractions at country, regional, and global levels. FINDINGS: Between 2000 and 2016, the highest maximum daily wildfire-related O3 concentrations (≥30 μg/m3) were observed in locations in South America, central America, and southeastern Asia, and the country of South Africa. Across all locations, an increase of 1 μg/m3 in the mean daily concentration of wildfire-related O3 during lag 0-2 days was associated with increases of 0·55% (95% CI 0·29 to 0·80) in daily all-cause mortality, 0·44% (-0·10 to 0·99) in daily cardiovascular mortality, and 0·82% (0·18 to 1·47) in daily respiratory mortality. The associations of daily mortality rates with wildfire-related O3 exposure showed substantial geographical heterogeneity at the country and regional levels. Across all locations, estimated annual excess mortality fractions of 0·58% (95% CI 0·31 to 0·85; 31 606 deaths [95% CI 17 038 to 46 027]) for all-cause mortality, 0·41% (-0·10 to 0·91; 5249 [-1244 to 11 620]) for cardiovascular mortality, and 0·86% (0·18 to 1·51; 4657 [999 to 8206]) for respiratory mortality were attributable to short-term exposure to wildfire-related O3. INTERPRETATION: In this study, we observed an increase in all-cause and respiratory mortality associated with short-term wildfire-related O3 exposure. Effective risk and smoke management strategies should be implemented to protect the public from the impacts of wildfires. FUNDING: Australian Research Council and the Australian National Health and Medical Research Council.
- MeSH
- celosvětové zdraví MeSH
- kardiovaskulární nemoci * mortalita MeSH
- látky znečišťující vzduch * škodlivé účinky analýza MeSH
- lidé MeSH
- nemoci dýchací soustavy * mortalita MeSH
- ozon * škodlivé účinky analýza MeSH
- požáry v divočině * MeSH
- vystavení vlivu životního prostředí škodlivé účinky MeSH
- znečištění ovzduší škodlivé účinky analýza MeSH
- Check Tag
- lidé MeSH
- Publikační typ
- časopisecké články MeSH
OBJECTIVES: The purpose of the current study was to analyse the risks of Lyme borreliosis (LB) among 1,070 forestry workers, the influence of responsible behaviour (use of repellents, skin self-inspection) on Borrelia screening result status, and the occurrence of immediate and mid-term symptoms after tick bites and LB positive serological screening test. METHODS: The questionnaire was conducted as well as blood tests for LB disease by one-stage serological screening procedure using ELISA for specific B. burgdorferi IgM and IgG antibodies (EuroImmun AG company, Germany). RESULTS: While 39.6% of foresters were LB positive among bitten foresters, as many as 27.0% were LB positive among those, who did not recall any tick attacks at all. Individuals with known history of tick bites had significantly higher odds (1.770×) of being LB positive (p < 0.05), while the use of repellents or skin self-inspection after visiting woods had no influence on LB results. The odds of skin discolouration after tick bites was significantly lower (0.682×) in case of LB positive test compared to LB negative test (p < 0.05), which can be explained by the fact that foresters could be unaware about erythema migrans appearance and timing, considering tick bite and developed later rash as completely separate events. Moreover, 69.1% of the bitten foresters with LB positive result developed no secondary symptoms (excluding those related to the skin), and the most frequent clinical symptoms were arthralgia (24.9%), followed by myalgia (7.6%), headache (5.7%), and damage to facial nerve (2.7%), which are non-specific and can be present in other illnesses. CONCLUSION: Therefore, the recommendations proposed would be the regular laboratory testing for LB of sensitive and at-risk population, who visits endemic woody areas, irrespective of all other factors involved.
- MeSH
- Borrelia burgdorferi imunologie izolace a purifikace MeSH
- dospělí MeSH
- ELISA MeSH
- klíště * MeSH
- kousnutí klíštětem * MeSH
- lesnictví MeSH
- lidé středního věku MeSH
- lidé MeSH
- lymeská nemoc * epidemiologie MeSH
- průzkumy a dotazníky MeSH
- senioři MeSH
- zvířata MeSH
- Check Tag
- dospělí MeSH
- lidé středního věku MeSH
- lidé MeSH
- mužské pohlaví MeSH
- senioři MeSH
- ženské pohlaví MeSH
- zvířata MeSH
- Publikační typ
- časopisecké články MeSH
- Geografické názvy
- Německo MeSH
OBJECTIVES: This study aimed to explore the relationship between plasma proteome and the clinical features of Major Depressive Disorder (MDD) during treatment of acute episode. METHODS: In this longitudinal observational study, 26 patients hospitalized for moderate to severe MDD were analyzed. The study utilized Liquid Chromatography with Tandem Mass Spectrometry (LC-MS/MS) alongside clinical metrics, including symptomatology derived from the Montgomery-Åsberg Depression Rating Scale (MADRS). Plasma protein analysis was conducted at the onset of acute depression and 6 weeks into treatment. Analytical methods comprised of Linear Models for Microarray Data (LIMMA), Weighted Correlation Network Analysis (WGCNA), Generalized Linear Models, Random Forests, and The Database for Annotation, Visualization and Integrated Discovery (DAVID). RESULTS: Five distinct plasma protein modules were identified, correlating with specific biological processes, and uniquely associated with symptom presentation, the disorder's trajectory, and treatment response. A module rich in proteins related to adaptive immunity was correlated with the manifestation of somatic syndrome, treatment response, and inversely associated with achieving remission. A module associated with cell adhesion was linked to affective symptoms and avolition, and played a role in the initial episodes and treatment response. Another module, characterized by proteins involved in blood coagulation and lipid transport, exhibited negative correlations with a variety of MDD symptoms and was predominantly associated with the manifestation of psychotic symptoms. CONCLUSION: This research points to a complex interplay between the plasma proteome and MDD's clinical presentation, suggesting that somatic, affective, and psychotic symptoms may represent distinct endophenotypic manifestations of MDD. These insights hold potential for advancing targeted therapeutic strategies and diagnostic tools. LIMITATIONS: The study's limited sample size and its naturalistic design, encompassing diverse treatment modalities, present methodological constraints. Furthermore, the analysis focused on peripheral blood proteins, with potential implications for interpretability.
- Publikační typ
- časopisecké články MeSH
MOTIVATION: The association between weather conditions and stroke incidence has been a subject of interest for several years, yet the findings from various studies remain inconsistent. Additionally, predictive modelling in this context has been infrequent. This study explores the relationship of extremely high ischaemic stroke incidence and meteorological factors within the Slovak population. Furthermore, it aims to construct forecasting models of extremely high number of strokes. METHODS: Over a five-year period, a total of 52,036 cases of ischemic stroke were documented. Days exhibiting a notable surge in ischemic stroke occurrences (surpassing the 90th percentile of historical records) were identified as extreme cases. These cases were then scrutinized alongside daily meteorological parameters spanning from 2015 to 2019. To create forecasts for the occurrence of these extreme cases one day in advance, three distinct methods were employed: Logistic regression, Random Forest for Time Series, and Croston's method. RESULTS: For each of the analyzed stroke centers, the cross-correlations between instances of extremely high stroke numbers and meteorological factors yielded negligible results. Predictive performance achieved by forecasts generated through multivariate logistic regression and Random Forest for time series analysis, which incorporated meteorological data, was on par with that of Croston's method. Notably, Croston's method relies solely on the stroke time series data. All three forecasting methods exhibited limited predictive accuracy. CONCLUSIONS: The task of predicting days characterized by an exceptionally high number of strokes proved to be challenging across all three explored methods. The inclusion of meteorological parameters did not yield substantive improvements in forecasting accuracy.
- MeSH
- incidence MeSH
- ischemická cévní mozková příhoda * epidemiologie MeSH
- lidé MeSH
- logistické modely MeSH
- meteorologické pojmy MeSH
- počasí * MeSH
- předpověď * metody MeSH
- senioři MeSH
- Check Tag
- lidé MeSH
- mužské pohlaví MeSH
- senioři MeSH
- ženské pohlaví MeSH
- Publikační typ
- časopisecké články MeSH
- Geografické názvy
- Slovenská republika MeSH