JavaScript is NOT enabled !

Please enable JavaScript.

Article

PubMed

This record comes from PubMed

Fusing linguistic and acoustic information for automated forensic speaker comparison

Sergidou, E K
Author Sergidou, E K Netherlands Forensic Institute, PO Box 24044, 2490 AA The Hague, the Netherlands; University of Amsterdam, Science Park 904, 1098 XH Amsterdam, the Netherlands. Electronic address: e.sergidou@nfi.nl
Ypma, Rolf
Author Ypma, Rolf Netherlands Forensic Institute, PO Box 24044, 2490 AA The Hague, the Netherlands
Rohdin, Johan
Author Rohdin, Johan Brno University of Technology, Boˇzetˇechova 2, Brno 61266, Czech Republic
Worring, Marcel
Author Worring, Marcel University of Amsterdam, Science Park 904, 1098 XH Amsterdam, the Netherlands
Geradts, Zeno
Author Geradts, Zeno Netherlands Forensic Institute, PO Box 24044, 2490 AA The Hague, the Netherlands; University of Amsterdam, Science Park 904, 1098 XH Amsterdam, the Netherlands
Bosma, Wauter
Author Bosma, Wauter Netherlands Forensic Institute, PO Box 24044, 2490 AA The Hague, the Netherlands

Science & justice. 2024 Sep ; 64 (5) : 485-497. [epub] 20240709

Sci Justice
ISSN 1876-4452 | 1355-0306
Source

Language English Country Great Britain, England Media print-electronic

Document type Journal Article

Persistent link https://www.medvik.cz/link/pmid39277331

PubMed 39277331
DOI 10.1016/j.scijus.2024.07.001
PII: S1355-0306(24)00056-X
Knihovny.cz E-resources

Keywords
Forensic speaker comparison, Frequent-word analysis, Information fusion, Likelihood ratio framework, Multi-modal analysis,
MeSH
Speech Acoustics MeSH
Algorithms MeSH
Humans MeSH
Linguistics MeSH
Likelihood Functions MeSH
Speech MeSH
Forensic Sciences * methods MeSH
Support Vector Machine MeSH
Check Tag
Humans MeSH
Publication type
Journal Article MeSH

Verifying the speaker of a speech fragment can be crucial in attributing a crime to a suspect. The question can be addressed given disputed and reference speech material, adopting the recommended and scientifically accepted likelihood ratio framework for reporting evidential strength in court. In forensic practice, usually, auditory and acoustic analyses are performed to carry out such a verification task considering a diversity of features, such as language competence, pronunciation, or other linguistic features. Automated speaker comparison systems can also be used alongside those manual analyses. State-of-the-art automatic speaker comparison systems are based on deep neural networks that take acoustic features as input. Additional information, though, may be obtained from linguistic analysis. In this paper, we aim to answer if, when and how modern acoustic-based systems can be complemented by an authorship technique based on frequent words, within the likelihood ratio framework. We consider three different approaches to derive a combined likelihood ratio: using a support vector machine algorithm, fitting bivariate normal distributions, and passing the score of the acoustic system as additional input to the frequent-word analysis. We apply our method to the forensically relevant dataset FRIDA and the FISHER corpus, and we explore under which conditions fusion is valuable. We evaluate our results in terms of log likelihood ratio cost (Cllr) and equal error rate (EER). We show that fusion can be beneficial, especially in the case of intercepted phone calls with noise in the background.

Brno University of Technology Boˇzetˇechova 2 Brno 61266 Czech Republic

Netherlands Forensic Institute PO Box 24044 2490 AA The Hague the Netherlands

Netherlands Forensic Institute PO Box 24044 2490 AA The Hague the Netherlands; University of Amsterdam Science Park 904 1098 XH Amsterdam the Netherlands

University of Amsterdam Science Park 904 1098 XH Amsterdam the Netherlands

References provided by Crossref.org

Borrow
RIS

Find record

In BMC

Fusing linguistic and acoustic information for automated forensic speaker comparison

Find record

Citation metrics

Archiving options