Today's social web platforms, such as Facebook, Twitter, Google+, and LinkedIn, increasingly have to process large volumes of user-generated data on the fly. As the role of such platforms shifts from being portals for largely historic data towards providing platforms for real-time data analytics, we observe that their architectures incrementally move from storage-centric designs, based on distributed data management technologies, towards event-based models exploiting queueing and stream processing systems. We believe that it is time to rethink fundamentally the software architecture for social web platforms and base them on a content-based communication model, that is explicitly designed to disseminate and partition incoming request flows on a cluster of servers. A content-based publish/subscribe system thus acts as a scalable and elastic, highly responsive data distribution backbone. By focusing on fresh data, such an architecture can optimize the routing of data to match the topology of the data center, dynamically adapt data flows to alleviate hot spots, and elastically scale to more servers when required by computationally expensive on-the-fly data analytics applications. © 2012 ACM.

Living in the present: On-the-fly information processing in scalable web architectures

MARGARA, ALESSANDRO;
2012-01-01

Abstract

Today's social web platforms, such as Facebook, Twitter, Google+, and LinkedIn, increasingly have to process large volumes of user-generated data on the fly. As the role of such platforms shifts from being portals for largely historic data towards providing platforms for real-time data analytics, we observe that their architectures incrementally move from storage-centric designs, based on distributed data management technologies, towards event-based models exploiting queueing and stream processing systems. We believe that it is time to rethink fundamentally the software architecture for social web platforms and base them on a content-based communication model, that is explicitly designed to disseminate and partition incoming request flows on a cluster of servers. A content-based publish/subscribe system thus acts as a scalable and elastic, highly responsive data distribution backbone. By focusing on fresh data, such an architecture can optimize the routing of data to match the topology of the data center, dynamically adapt data flows to alleviate hot spots, and elastically scale to more servers when required by computationally expensive on-the-fly data analytics applications. © 2012 ACM.
2012
Proceedings of the 2nd International Workshop on Cloud Computing Platforms, CloudCP 2012 - Co-located with EuroSys 2012
1595930361
1595930361
Software
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11311/999269
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 13
  • ???jsp.display-item.citation.isi??? ND
social impact