Data products are emerging as architectural elements of data mesh with the aim of improving data management within organizations. Among the main aspects in the design of a data product is the need to define the policies that regulate its access. These policies concern not only security aspects but, more generally, compliance with regulations (e.g., GDPR, HIPAA) or organizational regulations. Taking advantage of similarities with microservice-based solutions, policy-as-code is also used in the data mesh to regulate access to data products. Following the proposed models for the service mesh which regulates a microservice-based environment, the performance of the Policy Decision Point (PDP) becomes crucial to ensure efficient access to data managed by data products. Since this may require a replication of PDPs, in order to achieve true scalability, each of them will have to take into account only the policies associated with the data products under their responsibilities. This work provides a solution to automate the replication of PDPs and the distribution of policies to be controlled according to an approach that balances policy consistency and system scalability. The work has been validated through an extension of the Kubernetes orchestrator that, based on the information collected from the monitoring system, it is able to define which is the right number of PDPs and which data product, and related policies, should be assigned to each of them. Notably, the policy management is based on a Open Policy Agent (OPA) implementation.

Scalable Policy-as-Code Decision Points for Data Products

Brambilla, Matteo;Plebani, Pierluigi
2025-01-01

Abstract

Data products are emerging as architectural elements of data mesh with the aim of improving data management within organizations. Among the main aspects in the design of a data product is the need to define the policies that regulate its access. These policies concern not only security aspects but, more generally, compliance with regulations (e.g., GDPR, HIPAA) or organizational regulations. Taking advantage of similarities with microservice-based solutions, policy-as-code is also used in the data mesh to regulate access to data products. Following the proposed models for the service mesh which regulates a microservice-based environment, the performance of the Policy Decision Point (PDP) becomes crucial to ensure efficient access to data managed by data products. Since this may require a replication of PDPs, in order to achieve true scalability, each of them will have to take into account only the policies associated with the data products under their responsibilities. This work provides a solution to automate the replication of PDPs and the distribution of policies to be controlled according to an approach that balances policy consistency and system scalability. The work has been validated through an extension of the Kubernetes orchestrator that, based on the information collected from the monitoring system, it is able to define which is the right number of PDPs and which data product, and related policies, should be assigned to each of them. Notably, the policy management is based on a Open Policy Agent (OPA) implementation.
2025
2025 IEEE International Conference on Web Services
979-8-3315-5564-1
authorization
data mesh
Kubernetes
OPA
policy
policy-as-code
scalability
File in questo prodotto:
File Dimensione Formato  
ICWS2025_ScalablePolicies_Short.pdf

accesso aperto

: Post-Print (DRAFT o Author’s Accepted Manuscript-AAM)
Dimensione 518.32 kB
Formato Adobe PDF
518.32 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11311/1307670
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact