Web applications that combine dynamic data stream with distributed background data are getting a growing attention in recent years. Answering in a timely fashion, i.e., reactiveness, is one of the most important performance indicators for those applications. The Semantic Web community showed that RDF Stream Processing (RSP) is an adequate framework to develop this type of applications. However, RSP engines may lose their reactiveness due to the time necessary to access the background data when it is distributed over the Web. State-of-the-art RSP engines remain reactive using a local replica of the background data, but it progressively becomes stale if not updated to reflect the changes in the remote background data. For this reason, recently, the RSP community has investigated maintenance policies of the local replica that guarantee reactiveness while maximizing the freshness of the replica. Previous works simplified the problem with several assumptions. In this paper, we investigate how to remove some of those simplification assumptions. In particular, we target a class of queries for which multiple policies may be used simultaneously and we show that rank aggregation can be effectively used to fairly consider their alternative suggestions. We provide extensive empirical evidence that rank aggregation is key to move a step forward to the practical solution of this problem in the RSP context.

Using rank aggregation in continuously answering SPARQL queries on streaming and quasi-static linked data

Zahmatkesh, Shima;Della Valle, Emanuele;Dell Aglio, Daniele
2017-01-01

Abstract

Web applications that combine dynamic data stream with distributed background data are getting a growing attention in recent years. Answering in a timely fashion, i.e., reactiveness, is one of the most important performance indicators for those applications. The Semantic Web community showed that RDF Stream Processing (RSP) is an adequate framework to develop this type of applications. However, RSP engines may lose their reactiveness due to the time necessary to access the background data when it is distributed over the Web. State-of-the-art RSP engines remain reactive using a local replica of the background data, but it progressively becomes stale if not updated to reflect the changes in the remote background data. For this reason, recently, the RSP community has investigated maintenance policies of the local replica that guarantee reactiveness while maximizing the freshness of the replica. Previous works simplified the problem with several assumptions. In this paper, we investigate how to remove some of those simplification assumptions. In particular, we target a class of queries for which multiple policies may be used simultaneously and we show that rank aggregation can be effectively used to fairly consider their alternative suggestions. We provide extensive empirical evidence that rank aggregation is key to move a step forward to the practical solution of this problem in the RSP context.
2017
DEBS 2017 - Proceedings of the 11th ACM International Conference on Distributed Event-Based Systems
9781450350655
Continuous SPARQL query processing; Linked Data; Rank aggregation; RDF stream; Computer Networks and Communications; Hardware and Architecture; Control and Systems Engineering; Software
File in questo prodotto:
File Dimensione Formato  
p170-Zahmatkesh.pdf

accesso aperto

: Publisher’s version
Dimensione 1.16 MB
Formato Adobe PDF
1.16 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11311/1038743
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 2
  • ???jsp.display-item.citation.isi??? ND
social impact