The research in IoT and Smart Homes fields is rapidly growing, leading to the emergence of new services to improve the health and lifestyle of people based on the analysis of data that they produce performing their daily activities. However, researchers report a lack of high-quality publicly-available datasets: conducting experiments gathering such data is long and expensive, especially if the annotation of meaningful information (environment, person’s activity, health status) is required. Moreover, there are even more specific settings (e.g., dementia detection) where data must be related to a change in inhabitants’ behavior. We present a collection of new publicly-available datasets generated with the SHARON simulator. Thanks to this software, researchers can obtain synthetic data suiting their specific requirements. Two classes of datasets are described: one extends existing datasets preserving the original statistical properties, the other is composed of simulations of virtual inhabitant-environment systems. Moreover, we induced behavioral drifts compatible with dementia symptoms, generating further datasets. We believe that these resources may help the progress of research, as long as new real-life high-quality datasets are not available.

Disseminating Synthetic Smart Home Data for Advanced Applications

Andrea Masciadri;Fabio Veronese;Sara Comai;Ilaria Carlini;Fabio Salice
2018-01-01

Abstract

The research in IoT and Smart Homes fields is rapidly growing, leading to the emergence of new services to improve the health and lifestyle of people based on the analysis of data that they produce performing their daily activities. However, researchers report a lack of high-quality publicly-available datasets: conducting experiments gathering such data is long and expensive, especially if the annotation of meaningful information (environment, person’s activity, health status) is required. Moreover, there are even more specific settings (e.g., dementia detection) where data must be related to a change in inhabitants’ behavior. We present a collection of new publicly-available datasets generated with the SHARON simulator. Thanks to this software, researchers can obtain synthetic data suiting their specific requirements. Two classes of datasets are described: one extends existing datasets preserving the original statistical properties, the other is composed of simulations of virtual inhabitant-environment systems. Moreover, we induced behavioral drifts compatible with dementia symptoms, generating further datasets. We believe that these resources may help the progress of research, as long as new real-life high-quality datasets are not available.
2018
Proceedings of the CIKM 2018 Workshops co-located with 27th ACM International Conference on Information and Knowledge Management (CIKM2018), Torino, Italy, October 22, 2018
File in questo prodotto:
File Dimensione Formato  
CameraR__Disseminating_Synthetic_Smart_Home_Data_for_Advanced_Applications.pdf

Accesso riservato

Descrizione: Articolo principale
: Pre-Print (o Pre-Refereeing)
Dimensione 182.66 kB
Formato Adobe PDF
182.66 kB Adobe PDF   Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11311/1122985
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact