Complex interaction analysis, and information extraction from text and speech are an active research field, based on linguistic theories and NLP techniques. In this paper we present a conceptual model that aims at generating a rich description of forensic examinations, exploiting –in a novel way– forensic, psychological, and linguistic theories. Such description is then translated into profiles and a report. Profiles describe examiners and the person under examination, while the report evaluates the whole examination. Our main goal is to provide a didactical tool for improving forensic examination techniques. The model is based on a multi-layered set of HMMs, which leverage and combine speech (from audio recordings) and textual (from related transcriptions) features, classifying the examination at several granularity levels. Then, a rule-based expert system generates profiles and an evaluation of the examination. We also created a new audio/textual corpus based on real examinations collected from Italian trials. DIKE is the prototype we are currently implementing and that we plan to use for model validation.

Forensic Examinations: Computational Analysis and Information Extraction

SBATTELLA, LICIA;TEDESCO, ROBERTO;TRIVILINI, ALESSANDRO
2014-01-01

Abstract

Complex interaction analysis, and information extraction from text and speech are an active research field, based on linguistic theories and NLP techniques. In this paper we present a conceptual model that aims at generating a rich description of forensic examinations, exploiting –in a novel way– forensic, psychological, and linguistic theories. Such description is then translated into profiles and a report. Profiles describe examiners and the person under examination, while the report evaluates the whole examination. Our main goal is to provide a didactical tool for improving forensic examination techniques. The model is based on a multi-layered set of HMMs, which leverage and combine speech (from audio recordings) and textual (from related transcriptions) features, classifying the examination at several granularity levels. Then, a rule-based expert system generates profiles and an evaluation of the examination. We also created a new audio/textual corpus based on real examinations collected from Italian trials. DIKE is the prototype we are currently implementing and that we plan to use for model validation.
2014
Proceedings of International Conference on Forensic Science - Criminalistics Research (FSCR)
Information extraction; Natural Language Processing; HMM
File in questo prodotto:
File Dimensione Formato  
paper-fscr-2014-polimi-v3.1.pdf

Accesso riservato

: Pre-Print (o Pre-Refereeing)
Dimensione 9.41 MB
Formato Adobe PDF
9.41 MB Adobe PDF   Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11311/867148
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact