RE.PUBLIC@POLIMI pubblicazioni di ricerca del Politecnico di Milano

The present work contributes to the field of generalized sound classification. We extensively examine the performance of the next three feature sets: a) MPEG-7 Audio Spectrum Projection, b) MFCC (using an alternative method for their extraction) and c) a group derived utilizing critical band based wavelet packets. Subsequently three types of tem poral feature integration strategies are applied on the extracted instant values: a) short-term statistics, b) spectral moments and c) two autoregressive functions. During the experimental phase, we organize ten sound classes using professional sound effects collections of high quality. The density of each category is approximated with left-right hidden Markov models. Comparable results with respect to all the feature sets as well as integration methods are provided, which demonstrate the superiority of the short-term statistics method. ©2010 IEEE.

Sound classification based on temporal feature integration

NTALAMPIRAS, STAVROS;Potamitis, Ilyas;Fakotakis, Nikos

2010-01-01

Abstract

The present work contributes to the field of generalized sound classification. We extensively examine the performance of the next three feature sets: a) MPEG-7 Audio Spectrum Projection, b) MFCC (using an alternative method for their extraction) and c) a group derived utilizing critical band based wavelet packets. Subsequently three types of tem poral feature integration strategies are applied on the extracted instant values: a) short-term statistics, b) spectral moments and c) two autoregressive functions. During the experimental phase, we organize ten sound classes using professional sound effects collections of high quality. The density of each category is approximated with left-right hidden Markov models. Comparable results with respect to all the feature sets as well as integration methods are provided, which demonstrate the superiority of the short-term statistics method. ©2010 IEEE.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione
	
				2010
			
	Titolo del libro
	
				Final Program and Abstract Book - 4th International Symposium on Communications, Control, and Signal Processing, ISCCSP 2010
			
	ISBN (International Standard Book Number)
	
				9781424462858
9781424462858
			
	Parole chiave
	
				Computer Networks and Communications; Signal Processing; Electrical and Electronic Engineering
			
	Appare nelle tipologie:
	
				04.1 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
13 ISCCSP2009 05463315.pdf Accesso riservato : Publisher’s version Dimensione 2.84 MB Formato Adobe PDF Visualizza/Apri	2.84 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11311/1004430

Citazioni

ND

2

ND

social impact