BACKGROUND AND OBJECTIVES: Patients with synucleinopathies such as multiple system atrophy (MSA) and Parkinson's disease (PD) frequently display speech and language abnormalities. We explore the diagnostic potential of automated linguistic analysis of natural spontaneous speech to differentiate MSA and PD. METHODS: Spontaneous speech of 39 participants with MSA compared to 39 drug-naive PD and 39 healthy controls matched for age and sex was transcribed and linguistically annotated using automatic speech recognition and natural language processing. A quantitative analysis was performed using 6 lexical and syntactic and 2 acoustic features. Results were compared with human-controlled analysis to assess the robustness of the approach. Diagnostic accuracy was evaluated using sensitivity analysis. RESULTS: Despite similar disease duration, linguistic abnormalities were generally more severe in MSA than in PD, leading to high diagnostic accuracy with an area under the curve of 0.81. Compared to controls, MSA showed decreased grammatical component usage, more repetitive phrases, shorter sentences, reduced sentence development, slower articulation rate, and increased duration of pauses, whereas PD had only shorter sentences, reduced sentence development, and longer pauses. Only slower articulation rate was distinctive for MSA while unchanged for PD relative to controls. The highest correlation was found between bulbar/pseudobulbar clinical score and sentence length (r = -0.49, p = 0.002). Despite the relatively high severity of dysarthria in MSA, a strong agreement between manually and automatically computed results was achieved. DISCUSSION: Automated linguistic analysis may offer an objective, cost-effective, and widely applicable biomarker to differentiate synucleinopathies with similar clinical manifestations.
- MeSH
- Diagnosis, Differential MeSH
- Middle Aged MeSH
- Humans MeSH
- Multiple System Atrophy * diagnosis physiopathology complications MeSH
- Parkinson Disease * diagnosis complications physiopathology MeSH
- Speech physiology MeSH
- Aged MeSH
- Natural Language Processing MeSH
- Check Tag
- Middle Aged MeSH
- Humans MeSH
- Male MeSH
- Aged MeSH
- Female MeSH
- Publication type
- Journal Article MeSH
OBJECTIVE: This study assessed the relationship between speech and language impairment and outcome in a multicenter cohort of isolated/idiopathic rapid eye movement (REM) sleep behavior disorder (iRBD). METHODS: Patients with iRBD from 7 centers speaking Czech, English, German, French, and Italian languages underwent a detailed speech assessment at baseline. Story-tale narratives were transcribed and linguistically annotated using fully automated methods based on automatic speech recognition and natural language processing algorithms, leading to the 3 distinctive linguistic and 2 acoustic patterns of language deterioration and associated composite indexes of their overall severity. Patients were then prospectively followed and received assessments for parkinsonism or dementia during follow-up. The Cox proportional hazard was performed to evaluate the predictive value of language patterns for phenoconversion over a follow-up period of 5 years. RESULTS: Of 180 patients free of parkinsonism or dementia, 156 provided follow-up information. After a mean follow-up of 2.7 years, 42 (26.9%) patients developed neurodegenerative disease. Patients with higher severity of linguistic abnormalities (hazard ratio [HR = 2.35]) and acoustic abnormalities (HR = 1.92) were more likely to develop a defined neurodegenerative disease, with converters having lower content richness (HR = 1.74), slower articulation rate (HR = 1.58), and prolonged pauses (HR = 1.46). Dementia-first (n = 16) and parkinsonism-first with mild cognitive impairment (n = 9) converters had higher severity of linguistic abnormalities than parkinsonism-first with normal cognition converters (n = 17). INTERPRETATION: Automated language analysis might provide a predictor of phenoconversion from iRBD into synucleinopathy subtypes with cognitive impairment, and thus can be used to stratify patients for neuroprotective trials. ANN NEUROL 2024;95:530-543.
Cieľom výskumu bolo analyzovať heterogenitu kognitívneho deficitu u ľudí závislých od alkoholu a identifikovať empirické typy, ktoré sa líšia v miere oslabenia kognitívnych funkcií v doménach pamäť, pozornosť, jazyk a reč, exekutívne funkcie a psychomotorické tempo. Výskumu sa zúčastnilo 53 pacientov v procese liečby závislosti od alkoholu hospitalizovaných v Odbornom liečebnom ústave psychiatrickom n.o. na Prednej Hore vo veku od 19 do 55 rokov. Na posúdenie kognitívnych výkonov boli použité Test verbálnej fluencie, Pamäťový test učenia slov, Test kódovania symbolov, Test opakovania čísel odpredu a odzadu, Test cesty a Batéria frontálnych funkcií. Pomocou klastrovej analýzy sme identifikovali nasledujúce 4 typy participantov: 1. participanti so zachovanými kognitívnymi funkciami a s kognitívnou rezervou, 2. participanti bez kognitívneho deficitu, 3. participanti s miernym oslabením exekutívnych funkcií, 4. participanti s globálnym kognitívnym deficitom. Z hľadiska vecnej významnosti boli medzi skupinami zistené nezanedbateľné rozdiely z hľadiska veku, vzdelania a dĺžky excesívneho pitia. Výsledky výskumu poukazujú na heterogenitu kognitívneho deficitu u ľudí závislých od alkoholu a možnosť identifikácie viacerých podskupín, ktoré sa z kvantitatívneho aj kvalitatívneho hľadiska líšia v miere oslabenia kognitívnych funkcií.
The aim of the research was to analyze the heterogeneity of cognitive deficit in people with alcohol use disorders and to identify empirical types that differ in the degree of cognitive impairment across the domains of memory, attention, language and speech, executive function, and psychomotor speed. The study involved 53 patients in the process of treatment of alcohol use disorders hospitalized in the Specialized Psychiatric Institute in Predná Hora aged 19 to 55 years. Word Fluency Test, Auditory Verbal Learning Test, Symbol Encoding Test, Forward and Backward Digit Span Test, Trail Making Test and Frontal Assessment Battery were used to quantify cognitive performance. Using cluster analysis, we identified the following 4 types of participants: 1. participants with preserved cognitive functions and cognitive reserve, 2. participants without cognitive deficit, 3. participants with an incipient mild impairment of executive functioning, 4. participants with a global cognitive deficit. In terms of substantive significance, significant differences were found between the groups in terms of age, education and duration of excessive drinking. The results of this study shed light on the heterogeneity of cognitive deficits in people with alcohol use disorders, and the possibility of identifying several subgroups that differ quantitatively and qualitatively in their degree of cognitive impairment
- MeSH
- Alcoholism psychology MeSH
- Adult MeSH
- Empirical Research MeSH
- Inpatients * classification MeSH
- Cognitive Dysfunction * MeSH
- Middle Aged MeSH
- Humans MeSH
- Young Adult MeSH
- Neuropsychological Tests statistics & numerical data MeSH
- Alcohol-Related Disorders * psychology MeSH
- Check Tag
- Adult MeSH
- Middle Aged MeSH
- Humans MeSH
- Young Adult MeSH
BACKGROUND: Speech dysfunction represents one of the initial motor manifestations to develop in Parkinson's disease (PD) and is measurable through smartphone. OBJECTIVE: The aim was to develop a fully automated and noise-resistant smartphone-based system that can unobtrusively screen for prodromal parkinsonian speech disorder in subjects with isolated rapid eye movement sleep behavior disorder (iRBD) in a real-world scenario. METHODS: This cross-sectional study assessed regular, everyday voice call data from individuals with iRBD compared to early PD patients and healthy controls via a developed smartphone application. The participants also performed an active, regular reading of a short passage on their smartphone. Smartphone data were continuously collected for up to 3 months after the standard in-person assessments at the clinic. RESULTS: A total of 3525 calls that led to 5990 minutes of preprocessed speech were extracted from 72 participants, comprising 21 iRBD patients, 26 PD patients, and 25 controls. With a high area under the curve of 0.85 between iRBD patients and controls, the combination of passive and active smartphone data provided a comparable or even more sensitive evaluation than laboratory examination using a high-quality microphone. The most sensitive features to induce prodromal neurodegeneration in iRBD included imprecise vowel articulation during phone calls (P = 0.03) and monopitch in reading (P = 0.05). Eighteen minutes of speech corresponding to approximately nine calls was sufficient to obtain the best sensitivity for the screening. CONCLUSION: We consider the developed tool widely applicable to deep longitudinal digital phenotyping data with future applications in neuroprotective trials, deep brain stimulation optimization, neuropsychiatry, speech therapy, population screening, and beyond. © 2024 The Author(s). Movement Disorders published by Wiley Periodicals LLC on behalf of International Parkinson and Movement Disorder Society.
- MeSH
- Biomarkers MeSH
- Smartphone * MeSH
- Voice physiology MeSH
- Middle Aged MeSH
- Humans MeSH
- Parkinson Disease * physiopathology complications MeSH
- Parkinsonian Disorders physiopathology MeSH
- REM Sleep Behavior Disorder * physiopathology diagnosis MeSH
- Speech Disorders etiology MeSH
- Prodromal Symptoms MeSH
- Cross-Sectional Studies MeSH
- Aged MeSH
- Check Tag
- Middle Aged MeSH
- Humans MeSH
- Male MeSH
- Aged MeSH
- Female MeSH
- Publication type
- Journal Article MeSH
Verifying the speaker of a speech fragment can be crucial in attributing a crime to a suspect. The question can be addressed given disputed and reference speech material, adopting the recommended and scientifically accepted likelihood ratio framework for reporting evidential strength in court. In forensic practice, usually, auditory and acoustic analyses are performed to carry out such a verification task considering a diversity of features, such as language competence, pronunciation, or other linguistic features. Automated speaker comparison systems can also be used alongside those manual analyses. State-of-the-art automatic speaker comparison systems are based on deep neural networks that take acoustic features as input. Additional information, though, may be obtained from linguistic analysis. In this paper, we aim to answer if, when and how modern acoustic-based systems can be complemented by an authorship technique based on frequent words, within the likelihood ratio framework. We consider three different approaches to derive a combined likelihood ratio: using a support vector machine algorithm, fitting bivariate normal distributions, and passing the score of the acoustic system as additional input to the frequent-word analysis. We apply our method to the forensically relevant dataset FRIDA and the FISHER corpus, and we explore under which conditions fusion is valuable. We evaluate our results in terms of log likelihood ratio cost (Cllr) and equal error rate (EER). We show that fusion can be beneficial, especially in the case of intercepted phone calls with noise in the background.
- MeSH
- Speech Acoustics MeSH
- Algorithms MeSH
- Humans MeSH
- Linguistics MeSH
- Likelihood Functions MeSH
- Speech MeSH
- Forensic Sciences * methods MeSH
- Support Vector Machine MeSH
- Check Tag
- Humans MeSH
- Publication type
- Journal Article MeSH
Parkinson's disease (PD) and essential tremor (ET) are prevalent movement disorders that mainly affect elderly people, presenting diagnostic challenges due to shared clinical features. While both disorders exhibit distinct speech patterns-hypokinetic dysarthria in PD and hyperkinetic dysarthria in ET-the efficacy of speech assessment for differentiation remains unexplored. Developing technology for automatic discrimination could enable early diagnosis and continuous monitoring. However, the lack of data for investigating speech behavior in these patients has inhibited the development of a framework for diagnostic support. In addition, phonetic variability across languages poses practical challenges in establishing a universal speech assessment system. Therefore, it is necessary to develop models robust to the phonetic variability present in different languages worldwide. We propose a method based on Gaussian mixture models to assess domain adaptation from models trained in German and Spanish to classify PD and ET patients in Czech. We modeled three different speech dimensions: articulation, phonation, and prosody and evaluated the models' performance in both bi-class and tri-class classification scenarios (with the addition of healthy controls). Our results show that a fusion of the three speech dimensions achieved optimal results in binary classification, with accuracies up to 81.4 and 86.2% for monologue and /pa-ta-ka/ tasks, respectively. In tri-class scenarios, incorporating healthy speech signals resulted in accuracies of 63.3 and 71.6% for monologue and /pa-ta-ka/ tasks, respectively. Our findings suggest that automated speech analysis, combined with machine learning is robust, accurate, and can be adapted to different languages to distinguish between PD and ET patients.
- Publication type
- Journal Article MeSH
BACKGROUND: Impairment of higher language functions associated with natural spontaneous speech in multiple sclerosis (MS) remains underexplored. OBJECTIVES: We presented a fully automated method for discriminating MS patients from healthy controls based on lexical and syntactic linguistic features. METHODS: We enrolled 120 MS individuals with Expanded Disability Status Scale ranging from 1 to 6.5 and 120 age-, sex-, and education-matched healthy controls. Linguistic analysis was performed with fully automated methods based on automatic speech recognition and natural language processing techniques using eight lexical and syntactic features acquired from the spontaneous discourse. Fully automated annotations were compared with human annotations. RESULTS: Compared with healthy controls, lexical impairment in MS consisted of an increase in content words (p = 0.037), a decrease in function words (p = 0.007), and overuse of verbs at the expense of noun (p = 0.047), while syntactic impairment manifested as shorter utterance length (p = 0.002), and low number of coordinate clause (p < 0.001). A fully automated language analysis approach enabled discrimination between MS and controls with an area under the curve of 0.70. A significant relationship was detected between shorter utterance length and lower symbol digit modalities test score (r = 0.25, p = 0.008). Strong associations between a majority of automatically and manually computed features were observed (r > 0.88, p < 0.001). CONCLUSION: Automated discourse analysis has the potential to provide an easy-to-implement and low-cost language-based biomarker of cognitive decline in MS for future clinical trials.
- Publication type
- Journal Article MeSH
BACKGROUND AND PURPOSE: Motor speech alterations are a prominent feature of clinically manifest Huntington's disease (HD). Objective acoustic analysis of speech can quantify speech alterations. It is currently unknown, however, at what stage of HD speech alterations can be reliably detected. We aimed to explore the patterns and extent of speech alterations using objective acoustic analysis in HD and to assess correlations with both rater-assessed phenotypical features and biological determinants of HD. METHODS: Speech samples were acquired from 44 premanifest (29 pre-symptomatic and 15 prodromal) and 25 manifest HD gene expansion carriers, and 25 matched healthy controls. A quantitative automated acoustic analysis of 10 speech dimensions was performed. RESULTS: Automated speech analysis allowed us to differentiate between participants with HD and controls, with areas under the curve of 0.74 for pre-symptomatic, 0.92 for prodromal, and 0.97 for manifest stages. In addition to irregular alternating motion rates and prolonged pauses seen only in manifest HD, both prodromal and manifest HD displayed slowed articulation rate, slowed alternating motion rates, increased loudness variability, and unstable steady-state position of articulators. In participants with premanifest HD, speech alteration severity was associated with cognitive slowing (r = -0.52, p < 0.001) and the extent of bradykinesia (r = 0.43, p = 0.004). Speech alterations correlated with a measure of exposure to mutant gene products (CAG-age-product score; r = 0.60, p < 0.001). CONCLUSION: Speech abnormalities in HD are associated with other motor and cognitive deficits and are measurable already in premanifest stages of HD. Therefore, automated speech analysis might represent a quantitative HD biomarker with potential for assessing disease progression.
BACKGROUND: The mechanisms underlying speech abnormalities in Parkinson's disease (PD) remain poorly understood, with most of the available evidence based on male patients. This study aimed to estimate the occurrence and characteristics of speech disorder in early, drug-naive PD patients with relation to gender and dopamine transporter imaging. METHODS: Speech samples from 60 male and 40 female de novo PD patients as well as 60 male and 40 female age-matched healthy controls were analyzed. Quantitative acoustic vocal assessment of 10 distinct speech dimensions related to phonation, articulation, prosody, and speech timing was performed. All patients were evaluated using [123]I-2b-carbomethoxy-3b-(4-iodophenyl)-N-(3-fluoropropyl) nortropane single-photon emission computed tomography and Montreal Cognitive Assessment. RESULTS: The prevalence of speech abnormalities in the de novo PD cohort was 56% for male and 65% for female patients, mainly manifested with monopitch, monoloudness, and articulatory decay. Automated speech analysis enabled discrimination between PD and controls with an area under the curve of 0.86 in men and 0.93 in women. No gender-specific speech dysfunction in de novo PD was found. Regardless of disease status, females generally showed better performance in voice quality, consonant articulation, and pauses production than males, who were better only in loudness variability. The extent of monopitch was correlated to nigro-putaminal dopaminergic loss in men (r = 0.39, p = 0.003) and the severity of imprecise consonants was related to cognitive deficits in women (r = -0.44, p = 0.005). CONCLUSIONS: Speech abnormalities represent a frequent and early marker of motor abnormalities in PD. Despite some gender differences, our findings demonstrate that speech difficulties are associated with nigro-putaminal dopaminergic deficits.
- MeSH
- Dopamine MeSH
- Tomography, Emission-Computed, Single-Photon MeSH
- Humans MeSH
- Parkinson Disease * complications diagnostic imaging MeSH
- Speech Disorders diagnostic imaging etiology MeSH
- Speech * MeSH
- Check Tag
- Humans MeSH
- Male MeSH
- Female MeSH
- Publication type
- Journal Article MeSH
- Research Support, Non-U.S. Gov't MeSH
Even though hypomimia is a hallmark of Parkinson's disease (PD), objective and easily interpretable tools to capture the disruption of spontaneous and deliberate facial movements are lacking. This study aimed to develop a fully automatic video-based hypomimia assessment tool and estimate the prevalence and characteristics of hypomimia in de-novo PD patients with relation to clinical and dopamine transporter imaging markers. For this cross-sectional study, video samples of spontaneous speech were collected from 91 de-novo, drug-naïve PD participants and 75 age and sex-matched healthy controls. Twelve facial markers covering areas of forehead, nose root, eyebrows, eyes, lateral canthal areas, cheeks, mouth, and jaw were used to quantitatively describe facial dynamics. All patients were evaluated using Movement Disorder Society-Unified PD Rating Scale and Dopamine Transporter Single-Photon Emission Computed Tomography. Newly developed automated facial analysis tool enabled high-accuracy discrimination between PD and controls with area under the curve of 0.87. The prevalence of hypomimia in de-novo PD cohort was 57%, mainly associated with dysfunction of mouth and jaw movements, and decreased variability in forehead and nose root wrinkles (p < 0.001). Strongest correlation was found between reduction of lower lip movements and nigro-putaminal dopaminergic loss (r = 0.32, p = 0.002) as well as limb bradykinesia/rigidity scores (r = -0.37 p < 0.001). Hypomimia represents a frequent, early marker of motor impairment in PD that can be robustly assessed via automatic video-based analysis. Our results support an association between striatal dopaminergic deficit and hypomimia in PD.
- Publication type
- Journal Article MeSH