Speech production under stress for machine learning: multimodal dataset of 79 cases and 8 signals

. 2024 Nov 12 ; 11 (1) : 1221. [epub] 20241112

Jazyk angličtina Země Velká Británie, Anglie Médium electronic

Typ dokumentu dataset, časopisecké články

Perzistentní odkaz   https://www.medvik.cz/link/pmid39532912

Grantová podpora
FIT-S-23-8278 Vysoké Učení Technické v Brně (Brno University of Technology)

Odkazy

PubMed 39532912
PubMed Central PMC11557825
DOI 10.1038/s41597-024-03991-w
PII: 10.1038/s41597-024-03991-w
Knihovny.cz E-zdroje

Early identification of cognitive or physical overload is critical in fields where human decision making matters when preventing threats to safety and property. Pilots, drivers, surgeons, and operators of nuclear plants are among those affected by this challenge, as acute stress can impair their cognition. In this context, the significance of paralinguistic automatic speech processing increases for early stress detection. The intensity, intonation, and cadence of an utterance are examples of paralinguistic traits that determine the meaning of a sentence and are often lost in the verbatim transcript. To address this issue, tools are being developed to recognize paralinguistic traits effectively. However, a data bottleneck still exists in the training of paralinguistic speech traits, and the lack of high-quality reference data for the training of artificial systems persists. Regarding this, we present an original empirical dataset collected using the BESST experimental protocol for capturing speech signals under induced stress. With this data, our aim is to promote the development of pre-emptive intervention systems based on stress estimation from speech.

Zobrazit více v PubMed

Baratta, A. & Colletta, T.

Bernard, C.

Cannon, W. B. Organization for physiological homeostasis. DOI

CANNON, W. B. “voodoo” death. PubMed DOI PMC

Hansen, J. H. L. & Clements, M. A. Evaluation of speech under stress and emotional conditions. DOI

Yap, T. F.

Hansen, J. H. L. Susas ldc99s78. https://catalog.ldc.upenn.edu/LDC99S78 Last accessed on 2022-04-15 (1999).

Hansen, J. H. L. Susas transcripts ldc99t33. https://catalog.ldc.upenn.edu/LDC99T33 Last accessed on 2022-04-15 (1999).

Pešán, J.

Smeets, T. PubMed DOI

Cohen, S., Kamarck, T. & Mermelstein, R. A global measure of perceived stress. PubMed DOI

Pešán, J., Juřík, V., Kolářová, J., Chudý, P. & Černocký, J. Besst: An experimental protocol for collecting speech stress datasets for machine learning (2023). Manuscript submitted for publication.

Hedberg, A. G. Review of state-trait anxiety inventory. DOI

Likert, R.

Hart, S. G. & Staveland, L. E. Development of NASA-TLX (task load index): Results of empirical and theoretical research. In

Smital, L., Marsanova, L., Smisek, R., Nemcova, A. & Vitek, M. Robust qrs detection using combination of three independent methods. In

Plesinger, F., Jurco, J., Halamek, J. & Jurak, P. Signalplant: an open signal processing software platform. PubMed DOI

Pešán, J., Kesiraju, S., Burget, L. & Černocký, J. Beyond the labels: Unveiling text-dependency in paralinguistic speech recognition datasets (2024).

Najít záznam

Citační ukazatele

Pouze přihlášení uživatelé

Možnosti archivace

Nahrávání dat ...