Given a sample of unlabeled observations, the goal of a novelty detec- tion method is to identify which units substantially deviate from the observed la- beled patterns. Therefore, in a model-based framework, it is firstly of paramount importance to learn the components that correspond to the manifest groups in the training set. Secondly, one needs to take into account the lack of knowledge regard- ing the statistical novelties. Thirdly, contaminated elements in the known classes could greatly jeopardize the identification of new groups. Motivated by these chal- lenges, we propose a two-stage Bayesian non-parametric novelty detector. At stage one, robust estimates are extracted from the training set and, subsequently, such in- formation is employed to elicit informative priors within a flexible semiparametric mixture. This general paradigm can be easily adapted to complex modeling frame- works: we provide here an application to functional data from a food authenticity study.

Outlier and novelty detection for Functional data: a semiparametric Bayesian approach

Andrea Cappozzo;
2021-01-01

Abstract

Given a sample of unlabeled observations, the goal of a novelty detec- tion method is to identify which units substantially deviate from the observed la- beled patterns. Therefore, in a model-based framework, it is firstly of paramount importance to learn the components that correspond to the manifest groups in the training set. Secondly, one needs to take into account the lack of knowledge regard- ing the statistical novelties. Thirdly, contaminated elements in the known classes could greatly jeopardize the identification of new groups. Motivated by these chal- lenges, we propose a two-stage Bayesian non-parametric novelty detector. At stage one, robust estimates are extracted from the training set and, subsequently, such in- formation is employed to elicit informative priors within a flexible semiparametric mixture. This general paradigm can be easily adapted to complex modeling frame- works: we provide here an application to functional data from a food authenticity study.
2021
Book of Short Papers of the 5th international workshop on Models and Learning for Clustering and Classification
9788855265393
Bayesian mixture model, Dirichlet Process Mixture Model, Functional data, Minimum Regularized Covariance Determinant
File in questo prodotto:
File Dimensione Formato  
Book-Short-Papers-MBC2-2020.pdf

accesso aperto

: Publisher’s version
Dimensione 1.77 MB
Formato Adobe PDF
1.77 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11311/1237383
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact