Nejvíce citovaný článek - PubMed ID 33373483
Guidelines for Speech Recording and Acoustic Analyses in Dysarthrias of Movement Disorders
Speech abnormalities in Parkinson's disease (PD) are heterogeneous and often considered resistant to levodopa. However, human hearing may miss subtle treatment-related speech changes. Digital speech biomarkers offer a sensitive alternative to measure such changes objectively. Speech was recorded in 51 PD patients during ON and OFF medication states and compared to 43 healthy controls matched for language and gender. Acute levodopa effects were significant in prosodic (F0 standard deviation, p = 0.03, effect size = 0.47), respiratory (intensity slope, p = 0.02, effect size = 0.49), and spectral domains (LTAS mean, p = 0.01, effect size = 0.35). Stepwise backward regression identified 8 biomarkers reflecting hypokinetic symptoms, 6 for dyskinetic symptoms, and 7 for medication-state transitions. Hypokinetic compound score correlated strongly with MDS-UPDRS-III changes (r = 0.70; MAE = 6.06/92), and the dyskinetic compound score with dyskinesia ratings (r = 0.50; MAE = 1.81/12). Medication-state transitions were detected with AUC = 0.86. This study highlights the potential of digital speech biomarkers to objectively measure levodopa-induced changes in PD symptoms and medication states.
- Publikační typ
- časopisecké články MeSH
Head tremor is a common symptom in both essential tremor (ET) and cervical dystonia (CD). Distinguishing between these two conditions can be challenging in clinical practice, particularly when head tremor is the dominant feature. Our goal was to explore the potential of speech assessment in recognizing the mechanisms of head tremor in patients with ET and CD. Objective acoustic vocal assessments of oral diadochokinesis, phonatory stability, vocal tremor, and speech timing were performed. Of the 93 patients assessed, 39 had cervical dystonia (CD) with head tremor, 38 had ET with head tremor (ET-HT), and 16 had ET with no head tremor (ET-nHT). Compared to both CD and ET-nHT, ET-HT showed irregular sequential motion rate, excessive pitch fluctuations, increased noise, and higher extent of vocal vibrato. Compared to CD, ET-HT also demonstrated slower sequential motion rate, prolonged pauses, and a slower articulation rate. Additionally, ET-HT had more pronounced vocal tremolo compared to ET-nHT. Speech assessment provided discrimination between the CD and ET-HT groups with an area under curve of 0.80. This study underscores the promising potential of speech analysis in recognizing mechanisms of head tremor in patients with ET or CD, revealing more severe and distinct speech impairments in ET-HT patients compared to those with CD.
- Klíčová slova
- Acoustic analysis, Cervical dystonia, Dysarthria, Essential tremor, Speech disorder,
- Publikační typ
- časopisecké články MeSH
Over the past decade, neuropsychiatric fluctuations in Parkinson's disease (PD) have been increasingly recognized for their impact on patients' quality of life. Speech, a complex function carrying motor, emotional, and cognitive information, offers potential insights into these fluctuations. While previous studies have focused on acoustic analysis to assess motor speech disorders reliably, the potential of linguistic patterns associated with neuropsychiatric fluctuations in PD remains unexplored. This study analyzed the content of spontaneous speech from 33 PD patients in ON and OFF medication states, using machine learning and large language models (LLMs) to predict medication states and a neuropsychiatric state score. The top-performing model, the LLM Gemma-2 (9B), achieved 98% accuracy in differentiating ON and OFF states and its predicted scores were highly correlated with actual scores (Spearman's ρ = 0.81). These methods could provide a more comprehensive assessment of PD treatment effects, allowing remote neuropsychiatric symptom monitoring via mobile devices.
- Publikační typ
- časopisecké články MeSH
INTRODUCTION: Dysarthria, a motor speech disorder caused by muscle weakness or paralysis, severely impacts speech intelligibility and quality of life. The condition is prevalent in motor speech disorders such as Parkinson's disease (PD), atypical parkinsonism such as progressive supranuclear palsy (PSP), Huntington's disease (HD), and amyotrophic lateral sclerosis (ALS). Improving intelligibility is not only an outcome that matters to patients but can also play a critical role as an endpoint in clinical research and drug development. This study validates a digital measure for speech intelligibility, the ki: SB-M intelligibility score, across various motor speech disorders and languages following the Digital Medicine Society (DiMe) V3 framework. METHODS: The study used four datasets: healthy controls (HCs) and patients with PD, HD, PSP, and ALS from Czech, Colombian, and German populations. Participants' speech intelligibility was assessed using the ki: SB-M intelligibility score, which is derived from automatic speech recognition (ASR) systems. Verification with inter-ASR reliability and temporal consistency, analytical validation with correlations to gold standard clinical dysarthria scores in each disease, and clinical validation with group comparisons between HCs and patients were performed. RESULTS: Verification showed good to excellent inter-rater reliability between ASR systems and fair to good consistency. Analytical validation revealed significant correlations between the SB-M intelligibility score and established clinical measures for speech impairments across all patient groups and languages. Clinical validation demonstrated significant differences in intelligibility scores between pathological groups and healthy controls, indicating the measure's discriminative capability. DISCUSSION: The ki: SB-M intelligibility score is a reliable, valid, and clinically relevant tool for assessing speech intelligibility in motor speech disorders. It holds promise for improving clinical trials through automated, objective, and scalable assessments. Future studies should explore its utility in monitoring disease progression and therapeutic efficacy as well as add data from further dysarthrias to the validation.
Approximately 90% of Parkinson's patients (PD) suffer from dysarthria. However, there is currently a lack of research on acoustic measurements and speech impairment patterns among Mandarin-speaking individuals with PD. This study aims to assess the diagnosis and disease monitoring possibility in Mandarin-speaking PD patients through the recommended speech paradigm for non-tonal languages, and to explore the anatomical and functional substrates. We examined total of 160 native Mandarin-speaking Chinese participants consisting of 80 PD patients, 40 healthy controls (HC), and 40 MRI controls. We screened the optimal acoustic metric combination for PD diagnosis. Finally, we used the objective metrics to predict the patient's motor status using the Naïve Bayes model and analyzed the correlations between cortical thickness, subcortical volumes, functional connectivity, and network properties. Comprehensive acoustic screening based on prosodic, articulation, and phonation abnormalities allows differentiation between HC and PD with an area under the curve of 0.931. Patients with slowed reading exhibited atrophy of the fusiform gyrus (FDR p = 0.010, R = 0.391), reduced functional connectivity between the fusiform gyrus and motor cortex, and increased nodal local efficiency (NLE) and nodal efficiency (NE) in bilateral pallidum. Patients with prolonged pauses demonstrated atrophy in the left hippocampus, along with decreased NLE and NE. The acoustic assessment in Mandarin proves effective in diagnosis and disease monitoring for Mandarin-speaking PD patients, generalizing standardized acoustic guidelines beyond non-tonal languages. The speech impairment in Mandarin-speaking PD patients not only involves motor aspects of speech but also encompasses the cognitive processes underlying language generation.
- Publikační typ
- časopisecké články MeSH
- přehledy MeSH
Parkinson's disease (PD) and essential tremor (ET) are prevalent movement disorders that mainly affect elderly people, presenting diagnostic challenges due to shared clinical features. While both disorders exhibit distinct speech patterns-hypokinetic dysarthria in PD and hyperkinetic dysarthria in ET-the efficacy of speech assessment for differentiation remains unexplored. Developing technology for automatic discrimination could enable early diagnosis and continuous monitoring. However, the lack of data for investigating speech behavior in these patients has inhibited the development of a framework for diagnostic support. In addition, phonetic variability across languages poses practical challenges in establishing a universal speech assessment system. Therefore, it is necessary to develop models robust to the phonetic variability present in different languages worldwide. We propose a method based on Gaussian mixture models to assess domain adaptation from models trained in German and Spanish to classify PD and ET patients in Czech. We modeled three different speech dimensions: articulation, phonation, and prosody and evaluated the models' performance in both bi-class and tri-class classification scenarios (with the addition of healthy controls). Our results show that a fusion of the three speech dimensions achieved optimal results in binary classification, with accuracies up to 81.4 and 86.2% for monologue and /pa-ta-ka/ tasks, respectively. In tri-class scenarios, incorporating healthy speech signals resulted in accuracies of 63.3 and 71.6% for monologue and /pa-ta-ka/ tasks, respectively. Our findings suggest that automated speech analysis, combined with machine learning is robust, accurate, and can be adapted to different languages to distinguish between PD and ET patients.
- Publikační typ
- časopisecké články MeSH
BACKGROUND AND OBJECTIVE: An aging society requires easy-to-use approaches for diagnosis and monitoring of neurodegenerative disorders, such as Parkinson's disease (PD), so that clinicians can effectively adjust a treatment policy and improve patients' quality of life. Current methods of PD diagnosis and monitoring usually require the patients to come to a hospital, where they undergo several neurological and neuropsychological examinations. These examinations are usually time-consuming, expensive, and performed just a few times per year. Hence, this study explores the possibility of fusing computerized analysis of hypomimia and hypokinetic dysarthria (two motor symptoms manifested in the majority of PD patients) with the goal of proposing a new methodology of PD diagnosis that could be easily integrated into mHealth systems. METHODS: We enrolled 73 PD patients and 46 age- and gender-matched healthy controls, who performed several speech/voice tasks while recorded by a microphone and a camera. Acoustic signals were parametrized in the fields of phonation, articulation and prosody. Video recordings of a face were analyzed in terms of facial landmarks movement. Both modalities were consequently modeled by the XGBoost algorithm. RESULTS: The acoustic analysis enabled diagnosis of PD with 77% balanced accuracy, while in the case of the facial analysis, we observed 81% balanced accuracy. The fusion of both modalities increased the balanced accuracy to 83% (88% sensitivity and 78% specificity). The most informative speech exercise in the multimodality system turned out to be a tongue twister. Additionally, we identified muscle movements that are characteristic of hypomimia. CONCLUSIONS: The introduced methodology, which is based on the myriad of speech exercises likewise audio and video modality, allows for the detection of PD with an accuracy of up to 83%. The speech exercise - tongue twisters occurred to be the most valuable from the clinical point of view. Additionally, the clinical interpretation of the created models is illustrated. The presented computer-supported methodology could serve as an extra tool for neurologists in PD detection and the proposed potential solution of mHealth will facilitate the patient's and doctor's life.
- Klíčová slova
- Acoustic analysis, Facial analysis, Hypokinetic dysarthria, Hypomimia, Machine learning, Parkinson's disease,
- Publikační typ
- časopisecké články MeSH
AIM: To investigate the presence and relationship of temporal speech and gait parameters in patients with postural instability/gait disorder (PIGD) and tremor-dominant (TD) motor subtypes of Parkinson's disease (PD). METHODS: Speech samples and instrumented walkway system assessments were acquired from a total of 60 de-novo PD patients (40 in TD and 20 in PIGD subtype) and 40 matched healthy controls. Objective acoustic vocal assessment of seven distinct speech timing dimensions was related to instrumental gait measures including velocity, cadence, and stride length. RESULTS: Compared to controls, PIGD subtype showed greater consonant timing abnormalities by prolonged voice onset time (VOT) while also shorter stride length during both normal walking and dual task, while decreased velocity and cadence only during dual task. Speaking rate was faster in PIGD than TD subtype. In PIGD subtype, prolonged VOT correlated with slower gait velocity (r = -0.56, p = 0.01) and shorter stride length (r = -0.59, p = 0.008) during normal walking, whereas relationships were also found between decreased cadence in dual task and irregular alternating motion rates (r = -0.48, p = 0.04) and prolonged pauses (r = -0.50, p = 0.03). No correlation between speech and gait was detected in TD subtype. CONCLUSION: Our findings suggest that speech and gait rhythm disorder share similar underlying pathomechanisms specific for PIGD subtype.
- Klíčová slova
- Parkinson's disease, dysarthria, gait, postural instability gait difficulty, speech disorder,
- MeSH
- chůze (způsob) MeSH
- chůze MeSH
- lidé MeSH
- neurologické poruchy chůze * etiologie MeSH
- Parkinsonova nemoc * komplikace MeSH
- posturální rovnováha MeSH
- řeč MeSH
- tremor MeSH
- Check Tag
- lidé MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
INTRODUCTION: While the role of physiotherapy as part of a comprehensive inpatient rehabilitation is indisputable, clear evidence concerning the effectiveness of different rehabilitation managements [interdisciplinary implementing the International Classification of Functioning, disability and health (ICF) vs. multidisciplinary model] and physiotherapy categories (neuroproprioceptive "facilitation, inhibition" vs. motor/skill acquisitions using technologies) are still lacking. In this study, four kinds of comprehensive inpatient rehabilitation with different management and content of physical therapy will be compared. Moreover, focus will be placed on the identification of novel biological molecules reflective of effective rehabilitation. Long non-coding RNAs (lncRNAs) are transcripts (>200 bps) of limited coding potential, which have recently been recognized as key factors in neuronal signaling pathways in ischemic stroke and as such, may provide a valuable readout of patient recovery and neuroprotection during therapeutic progression. METHODS AND ANALYSIS: Adults after the first ischemic stroke in an early sub-acute phase with motor disability will be randomly assigned to one of four groups and undergo a 3 weeks comprehensive inpatient rehabilitation of different types: interdisciplinary team work using ICF model as a guide; multidisciplinary teamwork implementing neuroproprioceptive "facilitation and inhibition" physiotherapy; multidisciplinary teamwork implementing technology-based physiotherapy; and standard multidisciplinary teamwork. Primary (the Goal Attainment Scale, the Patient-Reported Outcomes Measurement Information System, and the World Health Organization Disability Assessment Schedule) and secondary (motor, cognitive, psychological, speech and swallowing functions, functional independence) outcomes will be measured. A blood sample will be obtained upon consent (20 mls; representing pre-rehabilitation molecular) before and after the inpatient program. Primary outcomes will be followed up again 3 and 12 months after the end of the program. The overarching aim of this study is to determine the effectiveness of various rehabilitation managements and physiotherapeutic categories implemented by patients post ischemic stroke via analysis of primary, secondary and long non-coding RNA readouts. This clinical trial will offer an innovative approach not previously tested and will provide new complex analysis along with public assessable molecular biological evidence of various rehabilitation methodology for the alleviation of the effects of ischemic stroke. CLINICAL TRIAL REGISTRATION: NCT05323916, https://clinicaltrials.gov/ct2/show/NCT05323916.
While speech disorder represents an early and prominent clinical feature of atypical parkinsonian syndromes such as multiple system atrophy (MSA) and progressive supranuclear palsy (PSP), little is known about the sensitivity of speech assessment as a potential diagnostic tool. Speech samples were acquired from 215 subjects, including 25 MSA, 20 PSP, 20 Parkinson's disease participants, and 150 healthy controls. The accurate differential diagnosis of dysarthria subtypes was based on the quantitative acoustic analysis of 26 speech dimensions related to phonation, articulation, prosody, and timing. A semi-supervised weighting-based approach was then applied to find the best feature combinations for separation between PSP and MSA. Dysarthria was perceptible in all PSP and MSA patients and consisted of a combination of hypokinetic, spastic, and ataxic components. Speech features related to respiratory dysfunction, imprecise consonants, monopitch, slow speaking rate, and subharmonics contributed to worse performance in PSP than MSA, whereas phonatory instability, timing abnormalities, and articulatory decay were more distinctive for MSA compared to PSP. The combination of distinct speech patterns via objective acoustic evaluation was able to discriminate between PSP and MSA with very high accuracy of up to 89% as well as between PSP/MSA and PD with up to 87%. Dysarthria severity in MSA/PSP was related to overall disease severity. Speech disorders reflect the differing underlying pathophysiology of tauopathy in PSP and α-synucleinopathy in MSA. Vocal assessment may provide a low-cost alternative screening method to existing subjective clinical assessment and imaging diagnostic approaches.
- Publikační typ
- časopisecké články MeSH