The problem of detecting outliers has been investigated in several research areas such as database, machine learning, knowledge discovery, and logic programming, with the aim of identifying objects of a given population whose behavior is different from that of the other data objects of the dataset. Outliers represent semantically correct situations, albeit infrequent with respect to the majority of cases. Detecting them allows extracting useful and actionable knowledge of interest to domain experts. In this paper, we focus our attention on the analysis of outlier detection in temporal databases. We propose a method, based on association rules, to infer the normal behavior of objects by extracting frequent rules from a given dataset. To include the time information, we define the concept of temporal association rules. Then, temporal association rules are combined to generate temporal quasi-functional dependencies, which define relationships among attributes over time which hold frequently. Once such dependencies have been inferred from data, outliers are retrieved with respect to them. Given a temporal quasi-functional dependency, it is possible to discover the outliers by querying the temporal association rules stored previously. Our method is independent of the considered database and infers rules, used to highlight outliers, directly from data. The applicability of the proposed approach is validated through a set of experiments which show its effectiveness and efficiency.

TOD: Temporal outlier detection by using quasi-functional temporal dependencies

GARZA, PAOLO
2010

Abstract

The problem of detecting outliers has been investigated in several research areas such as database, machine learning, knowledge discovery, and logic programming, with the aim of identifying objects of a given population whose behavior is different from that of the other data objects of the dataset. Outliers represent semantically correct situations, albeit infrequent with respect to the majority of cases. Detecting them allows extracting useful and actionable knowledge of interest to domain experts. In this paper, we focus our attention on the analysis of outlier detection in temporal databases. We propose a method, based on association rules, to infer the normal behavior of objects by extracting frequent rules from a given dataset. To include the time information, we define the concept of temporal association rules. Then, temporal association rules are combined to generate temporal quasi-functional dependencies, which define relationships among attributes over time which hold frequently. Once such dependencies have been inferred from data, outliers are retrieved with respect to them. Given a temporal quasi-functional dependency, it is possible to discover the outliers by querying the temporal association rules stored previously. Our method is independent of the considered database and infers rules, used to highlight outliers, directly from data. The applicability of the proposed approach is validated through a set of experiments which show its effectiveness and efficiency.
DATA & KNOWLEDGE ENGINEERING
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: http://hdl.handle.net/11311/665674
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 12
  • ???jsp.display-item.citation.isi??? 9
social impact