RE.PUBLIC@POLIMI pubblicazioni di ricerca del Politecnico di Milano

This paper describes an audio event detection system which automatically classifies an audio event as ambient noise, scream or gunshot. The classification system uses two parallel GMM classifiers for discriminating screams from noise and gunshots from noise. Each classifier is trained using different features, appropriately chosen from a set of 47 audio features, which are selected according to a 2-step process. First, feature subsets of increasing size are assembled using filter selection heuristics. Then, a classifier is trained and tested with each feature subset. The obtained classification performance is used to determine the optimal feature vector dimension. This allows a noticeable speed-up w.r.t. wrapper feature selection methods. In order to validate the proposed detection algorithm, we carried out extensive experiments on a rich set of gunshots and screams mixed with ambient noise at different SNRs. Our results demonstrate that the system is able to guarantee a precision of 90% at a false rejection rate of 8%.

Scream and Gunshot Detection in Noisy Environments

ANTONACCI, FABIO;L. Gerosa;SARTI, AUGUSTO;TAGLIASACCHI, MARCO;VALENZISE, GIUSEPPE

2007-01-01

Abstract

This paper describes an audio event detection system which automatically classifies an audio event as ambient noise, scream or gunshot. The classification system uses two parallel GMM classifiers for discriminating screams from noise and gunshots from noise. Each classifier is trained using different features, appropriately chosen from a set of 47 audio features, which are selected according to a 2-step process. First, feature subsets of increasing size are assembled using filter selection heuristics. Then, a classifier is trained and tested with each feature subset. The obtained classification performance is used to determine the optimal feature vector dimension. This allows a noticeable speed-up w.r.t. wrapper feature selection methods. In order to validate the proposed detection algorithm, we carried out extensive experiments on a rich set of gunshots and screams mixed with ambient noise at different SNRs. Our results demonstrate that the system is able to guarantee a precision of 90% at a false rejection rate of 8%.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione
	
				2007
			
	Titolo del libro
	
				15th European Signal Processing Conference, EUSIPCO 2007
			
	Titolo della collana
	
				EUROPEAN SIGNAL PROCESSING CONFERENCE
			
	ISBN (International Standard Book Number)
	
				9788392134022
			
	Parole chiave
	
				INF; TEL
			
	Appare nelle tipologie:
	
				04.1 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
07098998.pdf Accesso riservato Descrizione: Articolo principale : Publisher’s version Dimensione 239.63 kB Formato Adobe PDF Visualizza/Apri	239.63 kB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11311/252010

Citazioni

ND

84

ND

social impact