The mismatch between the data distributions of training and test data acquired under different recording conditions and using different devices is known to severely impair the performance of acoustic scene classification (ASC) systems. To address this issue, we propose an unsupervised domain adaptation method for ASC based on the projection of spectro-temporal features extracted from both the source and target domain onto the principal subspace spanned by the eigen-vectors of the sample covariance matrix of source-domain training data. Using the TUT Urban Acoustic Scenes 2018 Mobile Development dataset we show that the proposed method outperforms state-of-the-art unsupervised domain adaptation techniques when applied jointly with a convolutional ASC model and can also be practically employed as a feature extraction procedure for shallower artificial neural networks.

Feature projection-based unsupervised domain adaptation for acoustic scene classification

Alessandro Ilic Mezza;Augusto Sarti
2020-01-01

Abstract

The mismatch between the data distributions of training and test data acquired under different recording conditions and using different devices is known to severely impair the performance of acoustic scene classification (ASC) systems. To address this issue, we propose an unsupervised domain adaptation method for ASC based on the projection of spectro-temporal features extracted from both the source and target domain onto the principal subspace spanned by the eigen-vectors of the sample covariance matrix of source-domain training data. Using the TUT Urban Acoustic Scenes 2018 Mobile Development dataset we show that the proposed method outperforms state-of-the-art unsupervised domain adaptation techniques when applied jointly with a convolutional ASC model and can also be practically employed as a feature extraction procedure for shallower artificial neural networks.
2020
2020 IEEE 30th International Workshop on Machine Learning for Signal Processing (MLSP)
978-1-7281-6662-9
unsupervised domain adaptation
mismatched recording devices
acoustic scene classification
File in questo prodotto:
File Dimensione Formato  
Feature_Projection-Based_Unsupervised_Domain_Adaptation_for_Acoustic_Scene_Classification.pdf

Accesso riservato

: Publisher’s version
Dimensione 496.51 kB
Formato Adobe PDF
496.51 kB Adobe PDF   Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11311/1250200
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 5
  • ???jsp.display-item.citation.isi??? 1
social impact