Speech production under stress for machine learning: multimodal dataset of 79 cases and 8 signals
Jazyk angličtina Země Velká Británie, Anglie Médium electronic
Typ dokumentu dataset, časopisecké články
Grantová podpora
FIT-S-23-8278
Vysoké Učení Technické v Brně (Brno University of Technology)
PubMed
39532912
PubMed Central
PMC11557825
DOI
10.1038/s41597-024-03991-w
PII: 10.1038/s41597-024-03991-w
Knihovny.cz E-zdroje
- MeSH
- lidé MeSH
- psychický stres MeSH
- řeč * MeSH
- strojové učení * MeSH
- Check Tag
- lidé MeSH
- Publikační typ
- časopisecké články MeSH
- dataset MeSH
Early identification of cognitive or physical overload is critical in fields where human decision making matters when preventing threats to safety and property. Pilots, drivers, surgeons, and operators of nuclear plants are among those affected by this challenge, as acute stress can impair their cognition. In this context, the significance of paralinguistic automatic speech processing increases for early stress detection. The intensity, intonation, and cadence of an utterance are examples of paralinguistic traits that determine the meaning of a sentence and are often lost in the verbatim transcript. To address this issue, tools are being developed to recognize paralinguistic traits effectively. However, a data bottleneck still exists in the training of paralinguistic speech traits, and the lack of high-quality reference data for the training of artificial systems persists. Regarding this, we present an original empirical dataset collected using the BESST experimental protocol for capturing speech signals under induced stress. With this data, our aim is to promote the development of pre-emptive intervention systems based on stress estimation from speech.
Department of Psychology Faculty of Arts Masaryk University Brno Czech Republic
Faculty of Information Technology Brno University of Technology Brno Czech Republic
Institute of Computer Science University of Würzburg Würzburg Germany
Zobrazit více v PubMed
Baratta, A. & Colletta, T.
Bernard, C.
Cannon, W. B. Organization for physiological homeostasis. DOI
CANNON, W. B. “voodoo” death. PubMed DOI PMC
Hansen, J. H. L. & Clements, M. A. Evaluation of speech under stress and emotional conditions. DOI
Yap, T. F.
Hansen, J. H. L. Susas ldc99s78. https://catalog.ldc.upenn.edu/LDC99S78 Last accessed on 2022-04-15 (1999).
Hansen, J. H. L. Susas transcripts ldc99t33. https://catalog.ldc.upenn.edu/LDC99T33 Last accessed on 2022-04-15 (1999).
Pešán, J.
Cohen, S., Kamarck, T. & Mermelstein, R. A global measure of perceived stress. PubMed DOI
Pešán, J., Juřík, V., Kolářová, J., Chudý, P. & Černocký, J. Besst: An experimental protocol for collecting speech stress datasets for machine learning (2023). Manuscript submitted for publication.
Hedberg, A. G. Review of state-trait anxiety inventory. DOI
Likert, R.
Hart, S. G. & Staveland, L. E. Development of NASA-TLX (task load index): Results of empirical and theoretical research. In
Smital, L., Marsanova, L., Smisek, R., Nemcova, A. & Vitek, M. Robust qrs detection using combination of three independent methods. In
Plesinger, F., Jurco, J., Halamek, J. & Jurak, P. Signalplant: an open signal processing software platform. PubMed DOI
Pešán, J., Kesiraju, S., Burget, L. & Černocký, J. Beyond the labels: Unveiling text-dependency in paralinguistic speech recognition datasets (2024).