Nowadays, thanks to the massive usage of the Cloud, different providers offer storage as a service solutions. Each of these solutions is characterized by different storage capacity and features. They are also offered according to various business models, typically, users can choose between free plans (with a limited amount of space) and paid plans. Free plans users, when the storage capacity lowers, tend to subscribe to new free plans from other providers, thus increasing the so called data fragmentation. This phenomenon heavily increases the file management complexity. This paper proposes a solution to the data fragmentation problem, by describing an innovative approach which allows to deploy a distributed file system on top of different SaaS storage accounts, offered by different providers. This approach, not only lowers the complexity of data management by providing a single transparent storage solution to the user, but it is also able to provide features like full-text search, file classification and categorization, data analytics (MapReduce) on top of these SaaS storage accounts. Furthermore, this approach proposes a new way to address data privacy and security issues, typically connected to SaaS storage accounts.

A distributed file system over heterogeneous saas storage platforms

SCAVUZZO, MARCO
2015-01-01

Abstract

Nowadays, thanks to the massive usage of the Cloud, different providers offer storage as a service solutions. Each of these solutions is characterized by different storage capacity and features. They are also offered according to various business models, typically, users can choose between free plans (with a limited amount of space) and paid plans. Free plans users, when the storage capacity lowers, tend to subscribe to new free plans from other providers, thus increasing the so called data fragmentation. This phenomenon heavily increases the file management complexity. This paper proposes a solution to the data fragmentation problem, by describing an innovative approach which allows to deploy a distributed file system on top of different SaaS storage accounts, offered by different providers. This approach, not only lowers the complexity of data management by providing a single transparent storage solution to the user, but it is also able to provide features like full-text search, file classification and categorization, data analytics (MapReduce) on top of these SaaS storage accounts. Furthermore, this approach proposes a new way to address data privacy and security issues, typically connected to SaaS storage accounts.
2015
Proceedings - 16th International Symposium on Symbolic and Numeric Algorithms for Scientific Computing, SYNASC 2014
9781479984480
9781479984480
Big Data; Hadoop Distributed File System (HDFS); Storage as a Service; Computational Theory and Mathematics; Theoretical Computer Science; Applied Mathematics
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11311/988399
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 1
  • ???jsp.display-item.citation.isi??? 1
social impact