We present Kernel-QuantTree Exponentially Weighted Moving Average (KQT-EWMA), a non-parametric change-detection algorithm that combines the Kernel-QuantTree (KQT) histogram and the EWMA statistic to monitor multivariate data streams online. The resulting monitoring scheme is very flexible, since histograms can be used to model any stationary distribution, and practical, since the distribution of test statistics does not depend on the distribution of datastream in stationary conditions (non-parametric monitoring). KQT-EWMA enables controlling false alarms by operating at a pre-determined Average Run Length (ARL0), which measures the expected number of stationary samples to be monitored before triggering a false alarm. The latter peculiarity is in contrast with most non-parametric change-detection tests, which rarely can control the ARL0 a priori. Our experiments on synthetic and real-world datasets demonstrate that KQT-EWMA can control ARL0 while achieving detection delays comparable to or lower than state-of-the-art methods designed to work in the same conditions.

Change Detection in Multivariate Data Streams: Online Analysis with Kernel-QuantTree

Nogara Notarianni, Michelangelo Olmo;Leveni, Filippo;Stucchi, Diego;Frittoli, Luca;Boracchi, Giacomo
2025-01-01

Abstract

We present Kernel-QuantTree Exponentially Weighted Moving Average (KQT-EWMA), a non-parametric change-detection algorithm that combines the Kernel-QuantTree (KQT) histogram and the EWMA statistic to monitor multivariate data streams online. The resulting monitoring scheme is very flexible, since histograms can be used to model any stationary distribution, and practical, since the distribution of test statistics does not depend on the distribution of datastream in stationary conditions (non-parametric monitoring). KQT-EWMA enables controlling false alarms by operating at a pre-determined Average Run Length (ARL0), which measures the expected number of stationary samples to be monitored before triggering a false alarm. The latter peculiarity is in contrast with most non-parametric change-detection tests, which rarely can control the ARL0 a priori. Our experiments on synthetic and real-world datasets demonstrate that KQT-EWMA can control ARL0 while achieving detection delays comparable to or lower than state-of-the-art methods designed to work in the same conditions.
2025
Advanced Analytics and Learning on Temporal Data. AALTD 202: 9th ECML PKDD Workshop, AALTD 2024, Vilnius, Lithuania, September 9–13, 2024, Revised Selected Papers
9783031770654
9783031770661
File in questo prodotto:
File Dimensione Formato  
2410.13778v1.pdf

Accesso riservato

Dimensione 878.63 kB
Formato Adobe PDF
878.63 kB Adobe PDF   Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11311/1296905
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
social impact