Data-driven analysis
Dotaz
Zobrazit nápovědu
In the Czech Hydrometeorological Institute (CHMI) there exists a unique set of meteorological measurements consisting of the values of vertical atmospheric levels of beta and gamma radiation. In this paper a stochastic data-driven model based on nonlinear regression and on nonhomogeneous Poisson process is suggested. In the first part of the paper, growth curves were used to establish an appropriate nonlinear regression model. For comparison we considered a nonhomogeneous Poisson process with its intensity based on growth curves. In the second part both approaches were applied to the real data and compared. Computational aspects are briefly discussed as well. The primary goal of this paper is to present an improved understanding of the distribution of environmental radiation as obtained from the measurements of the vertical radioactivity profiles by the radioactivity sonde system.
Waste sorting at the source is a vital strategy of waste management and to improve urban sustainability. If the strategy is implemented by relying solely on publicity and civic awareness, the impact is less significant. Proactive measures, such as policy regulations, supervisory guidance, and stimulating incentives, play essential roles for better management. The unknown waste-dumping behaviour of residents is a great challenge for decision-makers to allocate resources for waste-collection operations and to refine regulations. Traditional behaviour analysis methods such as questionnaire surveys and simulation methods have limitations considering the population size and the complexity of individual behaviour. This study aims to design a data-driven analytical framework to analyse household waste-dumping behaviour and facilitate policy regulations by using the Internet of Things (IoT) and data mining technologies. The analytical framework is further developed into a four-step management cycle. A case study in Shanghai is employed to demonstrate the effectiveness of the analytical framework and management cycle. The results of behaviour analyses reveal that (1) waste-dumping frequency is high in the evening but negligible in the early afternoon; (2) compared to working days, peak-value time at weekends occurs later in the morning and earlier in the evening; (3) residents require longer waste-dumping time windows than those empirically recommended by administrators. Managerial insights and decision support based on these research results have been presented for decision-makers to guide operations management and facilitate policy regulations.
- MeSH
- nakládání s odpady * MeSH
- odpadky - odstraňování * MeSH
- skládková zařízení MeSH
- trvale udržitelný růst MeSH
- velkoměsta MeSH
- Publikační typ
- časopisecké články MeSH
- Geografické názvy
- Čína MeSH
- velkoměsta MeSH
A new test of the proportional hazards assumption in the Cox model is proposed. The idea is based on Neyman's smooth tests. The Cox model with proportional hazards (i.e. time-constant covariate effects) is embedded in a model with a smoothly time-varying covariate effect that is expressed as a combination of some basis functions (e.g., Legendre polynomials, cosines). Then the smooth test is the score test for significance of these artificial covariates. Furthermore, we apply a modification of Schwarz's selection rule to choosing the dimension of the smooth model (the number of the basis functions). The score test is then used in the selected model. In a simulation study, we compare the proposed tests with standard tests based on the score process.
... Features of gait 3 -- 1.4 Model-based versus model-free gait assessment 4 -- 1.5 Applications of gait analysis ... ... Statistics and computational intelligence in clinical gait analysis 17 -- 2.1 Introduction 17 -- 2.2 ... ... Low-cost sensors for gait analysis 25 -- 3.1 Introduction 25 -- 3.2 Motion capture sensors for gait 25 ... ... A low-cost electromyography (EMG) sensor-based gait activity analysis 101 -- 8.1 Introduction 101 -- ... ... 114 -- 8.7 Gait analysis using surface electromyography sensors 118 -- 8.8 Summary 124 -- References ...
x, 179 stran : ilustrace, grafy, schémata, portréty
BACKGROUND: Antineutrophil cytoplasmic antibody (ANCA)-associated vasculitis is a heterogenous autoimmune disease. While traditionally stratified into two conditions, granulomatosis with polyangiitis (GPA) and microscopic polyangiitis (MPA), the subclassification of ANCA-associated vasculitis is subject to continued debate. Here we aim to identify phenotypically distinct subgroups and develop a data-driven subclassification of ANCA-associated vasculitis, using a large real-world dataset. METHODS: In the collaborative data reuse project FAIRVASC (Findable, Accessible, Interoperable, Reusable, Vasculitis), registry records of patients with ANCA-associated vasculitis were retrieved from six European vasculitis registries: the Czech Registry of ANCA-associated vasculitis (Czech Republic), the French Vasculitis Study Group Registry (FVSG; France), the Joint Vasculitis Registry in German-speaking Countries (GeVas; Germany), the Polish Vasculitis Registry (POLVAS; Poland), the Irish Rare Kidney Disease Registry (RKD; Ireland), and the Skåne Vasculitis Cohort (Sweden). We performed model-based clustering of 17 mixed-type clinical variables using a parsimonious mixture of two latent Gaussian variable models. Clinical validation of the optimal cluster solution was made through summary statistics of the clusters' demography, phenotypic and serological characteristics, and outcome. The predictive value of models featuring the cluster affiliations were compared with classifications based on clinical diagnosis and ANCA specificity. People with lived experience were involved throughout the FAIRVASVC project. FINDINGS: A total of 3868 patients diagnosed with ANCA-associated vasculitis between Nov 1, 1966, and March 1, 2023, were included in the study across the six registries (Czech Registry n=371, FVSG n=1780, GeVas n=135, POLVAS n=792, RKD n=439, and Skåne Vasculitis Cohort n=351). There were 2434 (62·9%) patients with GPA and 1434 (37·1%) with MPA. Mean age at diagnosis was 57·2 years (SD 16·4); 2006 (51·9%) of 3867 patients were men and 1861 (48·1%) were women. We identified five clusters, with distinct phenotype, biochemical presentation, and disease outcome. Three clusters were characterised by kidney involvement: one severe kidney cluster (555 [14·3%] of 3868 patients) with high C-reactive protein (CRP) and serum creatinine concentrations, and variable ANCA specificity (SK cluster); one myeloperoxidase (MPO)-ANCA-positive kidney involvement cluster (782 [20·2%]) with limited extrarenal disease (MPO-K cluster); and one proteinase 3 (PR3)-ANCA-positive kidney involvement cluster (683 [17·7%]) with widespread extrarenal disease (PR3-K cluster). Two clusters were characterised by relative absence of kidney involvement: one was a predominantly PR3-ANCA-positive cluster (1202 [31·1%]) with inflammatory multisystem disease (IMS cluster), and one was a cluster (646 [16·7%]) with predominantly ear-nose-throat involvement and low CRP, with mainly younger patients (YR cluster). Compared with models fitted with clinical diagnosis or ANCA status, cluster-assigned models demonstrated improved predictive power with respect to both patient and kidney survival. INTERPRETATION: Our study reinforces the view that ANCA-associated vasculitis is not merely a binary construct. Data-driven subclassification of ANCA-associated vasculitis exhibits higher predictive value than current approaches for key outcomes. FUNDING: European Union's Horizon 2020 research and innovation programme under the European Joint Programme on Rare Diseases.
- MeSH
- ANCA-asociované vaskulitidy * klasifikace diagnóza epidemiologie krev imunologie MeSH
- dospělí MeSH
- kohortové studie MeSH
- lidé středního věku MeSH
- lidé MeSH
- mikroskopická polyangiitida klasifikace epidemiologie krev diagnóza imunologie MeSH
- registrace * statistika a číselné údaje MeSH
- senioři MeSH
- shluková analýza MeSH
- Check Tag
- dospělí MeSH
- lidé středního věku MeSH
- lidé MeSH
- mužské pohlaví MeSH
- senioři MeSH
- ženské pohlaví MeSH
- Publikační typ
- časopisecké články MeSH
- Geografické názvy
- Evropa MeSH
In social interactions, each individual's brain drives an action that, in turn, elicits systematic neural responses in their partner that drive a reaction. Consequently, the brain responses of both interactants become temporally contingent upon one another through the actions they generate, and different interaction dynamics will be underpinned by distinct forms of between-brain coupling. In this study, we investigated this by "performing functional magnetic resonance imaging on two individuals simultaneously (dual-fMRI) while they competed or cooperated with one another in a turn-based or concurrent fashion." To assess whether distinct patterns of neural coupling were associated with these different interactions, we combined two data-driven, model-free analytical techniques: group-independent component analysis and inter-subject correlation. This revealed four distinct patterns of brain responses that were temporally aligned between interactants: one emerged during co-operative exchanges and encompassed brain regions involved in social cognitive processing, such as the temporo-parietal cortex. The other three were associated with competitive exchanges and comprised brain systems implicated in visuo-motor processing and social decision-making, including the cerebellum and anterior cingulate cortex. Interestingly, neural coupling was significantly stronger in concurrent relative to turn-based exchanges. These results demonstrate the utility of data-driven approaches applied to "dual-fMRI" data in elucidating the interpersonal neural processes that give rise to the two-in-one dynamic characterizing social interaction.
- MeSH
- dospělí MeSH
- kompetitivní chování * MeSH
- kooperační chování * MeSH
- lidé MeSH
- magnetická rezonanční tomografie MeSH
- mapování mozku metody MeSH
- mladý dospělý MeSH
- mozeček diagnostické zobrazování fyziologie MeSH
- mozková kůra diagnostické zobrazování fyziologie MeSH
- sociální interakce * MeSH
- sociální kognice * MeSH
- Check Tag
- dospělí MeSH
- lidé MeSH
- mladý dospělý MeSH
- mužské pohlaví MeSH
- ženské pohlaví MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
Untargeted metabolomic approaches offer new opportunities for a deeper understanding of the molecular events related to toxic exposure. This study proposes a metabolomic investigation of biochemical alterations occurring in urine as a result of dioxin toxicity. Urine samples were collected from Czech chemical workers submitted to severe dioxin occupational exposure in a herbicide production plant in the late 1960s. Experiments were carried out with ultra-high pressure liquid chromatography (UHPLC) coupled to high-resolution quadrupole time-of-flight (QTOF) mass spectrometry. A chemistry-driven feature selection was applied to focus on steroid-related metabolites. Supervised multivariate data analysis allowed biomarkers, mainly related to bile acids, to be highlighted. These results supported the hypothesis of liver damage and oxidative stress for long-term dioxin toxicity. As a second step of data analysis, the information gained from the urine analysis of Victor Yushchenko after his poisoning was examined. A subset of relevant urinary markers of acute dioxin toxicity from this extreme phenotype, including glucuro- and sulfo-conjugated endogenous steroid metabolites and bile acids, was assessed for its ability to detect long-term effects of exposure. The metabolomic strategy presented in this work allowed the determination of metabolic patterns related to dioxin effects in human and the discovery of highly predictive subsets of biologically meaningful and clinically relevant compounds. These results are expected to provide valuable information for a deeper understanding of the molecular events related to dioxin toxicity. Furthermore, it presents an original methodology of data dimensionality reduction by using extreme phenotype as a guide to select relevant features prior to data modeling (biologically driven data reduction).
- MeSH
- biologické markery moč MeSH
- data mining MeSH
- játra účinky léků metabolismus MeSH
- lidé MeSH
- metabolomika metody MeSH
- monitorování životního prostředí metody MeSH
- oxidační stres účinky léků MeSH
- polychlorované dibenzodioxiny toxicita MeSH
- pracovní expozice analýza MeSH
- vysokoúčinná kapalinová chromatografie MeSH
- Check Tag
- lidé MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
Statistics for biology and health
1st ed. xix, 455 s., obr. příl.
BACKGROUND AND OBJECTIVES: Recent studies fueled doubts as to whether all currently defined central disorders of hypersomnolence are stable entities, especially narcolepsy type 2 and idiopathic hypersomnia. New reliable biomarkers are needed, and the question arises of whether current diagnostic criteria of hypersomnolence disorders should be reassessed. The main aim of this data-driven observational study was to see whether data-driven algorithms would segregate narcolepsy type 1 and identify more reliable subgrouping of individuals without cataplexy with new clinical biomarkers. METHODS: We used agglomerative hierarchical clustering, an unsupervised machine learning algorithm, to identify distinct hypersomnolence clusters in the large-scale European Narcolepsy Network database. We included 97 variables, covering all aspects of central hypersomnolence disorders such as symptoms, demographics, objective and subjective sleep measures, and laboratory biomarkers. We specifically focused on subgrouping of patients without cataplexy. The number of clusters was chosen to be the minimal number for which patients without cataplexy were put in distinct groups. RESULTS: We included 1,078 unmedicated adolescents and adults. Seven clusters were identified, of which 4 clusters included predominantly individuals with cataplexy. The 2 most distinct clusters consisted of 158 and 157 patients, were dominated by those without cataplexy, and among other variables, significantly differed in presence of sleep drunkenness, subjective difficulty awakening, and weekend-week sleep length difference. Patients formally diagnosed as having narcolepsy type 2 and idiopathic hypersomnia were evenly mixed in these 2 clusters. DISCUSSION: Using a data-driven approach in the largest study on central disorders of hypersomnolence to date, our study identified distinct patient subgroups within the central disorders of hypersomnolence population. Our results contest inclusion of sleep-onset REM periods in diagnostic criteria for people without cataplexy and provide promising new variables for reliable diagnostic categories that better resemble different patient phenotypes. Cluster-guided classification will result in a more solid hypersomnolence classification system that is less vulnerable to instability of single features.
- MeSH
- idiopatická hypersomnie * diagnóza MeSH
- kataplexie * diagnóza MeSH
- lidé MeSH
- mladiství MeSH
- narkolepsie * diagnóza farmakoterapie MeSH
- poruchy nadměrné spavosti * diagnóza epidemiologie MeSH
- shluková analýza MeSH
- Check Tag
- lidé MeSH
- mladiství MeSH
- Publikační typ
- časopisecké články MeSH
- pozorovací studie MeSH
Dementia with Lewy bodies (DLB) is a neurodegenerative disorder with a wide heterogeneity of symptoms, which suggests the existence of different subtypes. We used data-driven analysis of magnetic resonance imaging (MRI) data to investigate DLB subtypes. We included 165 DLB from the Mayo Clinic and 3 centers from the European DLB consortium and performed a hierarchical cluster analysis to identify subtypes based on gray matter (GM) volumes. To characterize the subtypes, we used demographic and clinical data, as well as β-amyloid, tau, and cerebrovascular biomarkers at baseline, and cognitive decline over three years. We identified 3 subtypes: an older subtype with reduced cortical GM volumes, worse cognition, and faster cognitive decline (n = 49, 30%); a subtype with low GM volumes in fronto-occipital regions (n = 76, 46%); and a subtype of younger patients with the highest cortical GM volumes, proportionally lower GM volumes in basal ganglia and the highest frequency of cognitive fluctuations (n = 40, 24%). This study shows the existence of MRI subtypes in DLB, which may have implications for clinical workout, research, and therapeutic decisions.
- Publikační typ
- časopisecké články MeSH