In recent years an increasing number of applications, IoT sensors and websites have produced endless streams of data. These data streams are not only unbounded, but their characteristics dynamically change over time, generating a phenomenon called concept drift. The standard machine learning models do not work properly in this context and new techniques have been developed in order to tackle these challenges. In this paper we present a new Naïve Bayes algorithm that exploits Kalman Filter, namely KalmanNB, to manage automatically concept drift. Furthermore, we want to investigate when this new approach, which directly follows the values of data's attributes, is better than the standard strategy, which monitors the performance of the model in order to detect a drift. Extensive experiments on both artificial and real datasets with concept drifts reveal that KalmanNB is a valid alternative to the state-of-the-art algorithms, outperforming the latter especially in case of recurring concept drifts.

Fast incremental naïve bayes with kalman filtering

Ziffer, Giacomo;Bernardo, Alessio;Della Valle, Emanuele;
2020-01-01

Abstract

In recent years an increasing number of applications, IoT sensors and websites have produced endless streams of data. These data streams are not only unbounded, but their characteristics dynamically change over time, generating a phenomenon called concept drift. The standard machine learning models do not work properly in this context and new techniques have been developed in order to tackle these challenges. In this paper we present a new Naïve Bayes algorithm that exploits Kalman Filter, namely KalmanNB, to manage automatically concept drift. Furthermore, we want to investigate when this new approach, which directly follows the values of data's attributes, is better than the standard strategy, which monitors the performance of the model in order to detect a drift. Extensive experiments on both artificial and real datasets with concept drifts reveal that KalmanNB is a valid alternative to the state-of-the-art algorithms, outperforming the latter especially in case of recurring concept drifts.
2020
20th International Conference on Data Mining Workshops, ICDM Workshops 2020, Sorrento, Italy, November 17-20, 2020
978-1-7281-9012-9
Naïve Bayes , Kalman Filter , Incremental Learning , Concept Drift Management , Recurring Drift
File in questo prodotto:
File Dimensione Formato  
Fast_Incremental_Naive_Bayes_with_Kalman_Filtering.pdf

accesso aperto

: Publisher’s version
Dimensione 203.68 kB
Formato Adobe PDF
203.68 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11311/1161880
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 1
  • ???jsp.display-item.citation.isi??? 0
social impact