-
Something wrong with this record ?
Automatic Evaluation of Voice Quality Using Text-Based Laryngograph Measurements and Prosodic Analysis
T. Haderlein, C. Schwemmle, M. Döllinger, V. Matoušek, M. Ptok, E. Nöth,
Language English Country United States
Document type Journal Article, Research Support, Non-U.S. Gov't
NLK
Free Medical Journals
from 2011
PubMed Central
from 2011
Europe PubMed Central
from 2011
Open Access Digital Library
from 1997-01-01
Open Access Digital Library
from 2006-01-01
Open Access Digital Library
from 2011-01-01
Medline Complete (EBSCOhost)
from 2006-03-01 to 2023-06-29
Wiley-Blackwell Open Access Titles
from 1997
PubMed
26136813
DOI
10.1155/2015/316325
Knihovny.cz E-resources
- MeSH
- Hoarseness diagnosis MeSH
- Child MeSH
- Adult MeSH
- Voice Quality * MeSH
- Middle Aged MeSH
- Humans MeSH
- Adolescent MeSH
- Young Adult MeSH
- Speech Perception MeSH
- Signal Processing, Computer-Assisted * MeSH
- Voice Disorders diagnosis MeSH
- Speech * MeSH
- Speech Therapy MeSH
- Regression Analysis MeSH
- Reproducibility of Results MeSH
- Aged, 80 and over MeSH
- Aged MeSH
- Software MeSH
- Sound Spectrography methods MeSH
- Check Tag
- Child MeSH
- Adult MeSH
- Middle Aged MeSH
- Humans MeSH
- Adolescent MeSH
- Young Adult MeSH
- Male MeSH
- Aged, 80 and over MeSH
- Aged MeSH
- Female MeSH
- Publication type
- Journal Article MeSH
- Research Support, Non-U.S. Gov't MeSH
Due to low intra- and interrater reliability, perceptual voice evaluation should be supported by objective, automatic methods. In this study, text-based, computer-aided prosodic analysis and measurements of connected speech were combined in order to model perceptual evaluation of the German Roughness-Breathiness-Hoarseness (RBH) scheme. 58 connected speech samples (43 women and 15 men; 48.7 ± 17.8 years) containing the German version of the text "The North Wind and the Sun" were evaluated perceptually by 19 speech and voice therapy students according to the RBH scale. For the human-machine correlation, Support Vector Regression with measurements of the vocal fold cycle irregularities (CFx) and the closed phases of vocal fold vibration (CQx) of the Laryngograph and 33 features from a prosodic analysis module were used to model the listeners' ratings. The best human-machine results for roughness were obtained from a combination of six prosodic features and CFx (r = 0.71, ρ = 0.57). These correlations were approximately the same as the interrater agreement among human raters (r = 0.65, ρ = 0.61). CQx was one of the substantial features of the hoarseness model. For hoarseness and breathiness, the human-machine agreement was substantially lower. Nevertheless, the automatic analysis method can serve as the basis for a meaningful objective support for perceptual analysis.
References provided by Crossref.org
- 000
- 00000naa a2200000 a 4500
- 001
- bmc16020596
- 003
- CZ-PrNML
- 005
- 20160725121734.0
- 007
- ta
- 008
- 160722s2015 xxu f 000 0|eng||
- 009
- AR
- 024 7_
- $a 10.1155/2015/316325 $2 doi
- 024 7_
- $a 10.1155/2015/316325 $2 doi
- 035 __
- $a (PubMed)26136813
- 040 __
- $a ABA008 $b cze $d ABA008 $e AACR2
- 041 0_
- $a eng
- 044 __
- $a xxu
- 100 1_
- $a Haderlein, Tino $u Lehrstuhl für Mustererkennung, Friedrich-Alexander-Universität Erlangen-Nürnberg (FAU), Martensstraße 3, 91058 Erlangen, Germany.
- 245 10
- $a Automatic Evaluation of Voice Quality Using Text-Based Laryngograph Measurements and Prosodic Analysis / $c T. Haderlein, C. Schwemmle, M. Döllinger, V. Matoušek, M. Ptok, E. Nöth,
- 520 9_
- $a Due to low intra- and interrater reliability, perceptual voice evaluation should be supported by objective, automatic methods. In this study, text-based, computer-aided prosodic analysis and measurements of connected speech were combined in order to model perceptual evaluation of the German Roughness-Breathiness-Hoarseness (RBH) scheme. 58 connected speech samples (43 women and 15 men; 48.7 ± 17.8 years) containing the German version of the text "The North Wind and the Sun" were evaluated perceptually by 19 speech and voice therapy students according to the RBH scale. For the human-machine correlation, Support Vector Regression with measurements of the vocal fold cycle irregularities (CFx) and the closed phases of vocal fold vibration (CQx) of the Laryngograph and 33 features from a prosodic analysis module were used to model the listeners' ratings. The best human-machine results for roughness were obtained from a combination of six prosodic features and CFx (r = 0.71, ρ = 0.57). These correlations were approximately the same as the interrater agreement among human raters (r = 0.65, ρ = 0.61). CQx was one of the substantial features of the hoarseness model. For hoarseness and breathiness, the human-machine agreement was substantially lower. Nevertheless, the automatic analysis method can serve as the basis for a meaningful objective support for perceptual analysis.
- 650 _2
- $a mladiství $7 D000293
- 650 _2
- $a dospělí $7 D000328
- 650 _2
- $a senioři $7 D000368
- 650 _2
- $a senioři nad 80 let $7 D000369
- 650 _2
- $a dítě $7 D002648
- 650 _2
- $a ženské pohlaví $7 D005260
- 650 _2
- $a chrapot $x diagnóza $7 D006685
- 650 _2
- $a lidé $7 D006801
- 650 _2
- $a mužské pohlaví $7 D008297
- 650 _2
- $a lidé středního věku $7 D008875
- 650 _2
- $a regresní analýza $7 D012044
- 650 _2
- $a reprodukovatelnost výsledků $7 D015203
- 650 12
- $a počítačové zpracování signálu $7 D012815
- 650 _2
- $a software $7 D012984
- 650 _2
- $a zvuková spektrografie $x metody $7 D013018
- 650 12
- $a řeč $7 D013060
- 650 _2
- $a percepce řeči $7 D013067
- 650 _2
- $a řečová terapie $7 D013070
- 650 _2
- $a poruchy hlasu $x diagnóza $7 D014832
- 650 12
- $a kvalita hlasu $7 D014833
- 650 _2
- $a mladý dospělý $7 D055815
- 655 _2
- $a časopisecké články $7 D016428
- 655 _2
- $a práce podpořená grantem $7 D013485
- 700 1_
- $a Schwemmle, Cornelia $u Klinik für Hals-, Nasen-, Ohrenheilkunde, Universitätsklinikum Magdeburg, Leipziger Straße 44, 39120 Magdeburg, Germany.
- 700 1_
- $a Döllinger, Michael $u Phoniatrische und Pädaudiologische Abteilung, Klinikum der Universität Erlangen-Nürnberg, Bohlenplatz 21, 91054 Erlangen, Germany.
- 700 1_
- $a Matoušek, Václav $u Department of Computer Science and Engineering, University of West Bohemia in Pilsen, Univerzitní 8, 306 14 Plzeň, Czech Republic.
- 700 1_
- $a Ptok, Martin $u Klinik für Phoniatrie und Pädaudiologie, Medizinische Hochschule Hannover, Carl-Neuberg-Straße 1, 30625 Hannover, Germany.
- 700 1_
- $a Nöth, Elmar $u Lehrstuhl für Mustererkennung, Friedrich-Alexander-Universität Erlangen-Nürnberg (FAU), Martensstraße 3, 91058 Erlangen, Germany.
- 773 0_
- $w MED00173439 $t Computational and mathematical methods in medicine $x 1748-6718 $g Roč. 2015, č. - (2015), s. 316325
- 856 41
- $u https://pubmed.ncbi.nlm.nih.gov/26136813 $y Pubmed
- 910 __
- $a ABA008 $b sig $c sign $y a $z 0
- 990 __
- $a 20160722 $b ABA008
- 991 __
- $a 20160725121952 $b ABA008
- 999 __
- $a ok $b bmc $g 1155266 $s 945124
- BAS __
- $a 3
- BAS __
- $a PreBMC
- BMC __
- $a 2015 $b 2015 $c - $d 316325 $e 20150602 $i 1748-6718 $m Computational and mathematical methods in medicine $n Comput Math Methods Med $x MED00173439
- LZP __
- $a Pubmed-20160722