Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                

Computer Science and Information Systems 2024 Volume 21, Issue 3, Pages: 1033-1054
https://doi.org/10.2298/CSIS230426026B
Full text ( 395 KB)


Graph rewriting primitives for semantic graph databases sanitization

Boiret Adrien (INSA Centre Val de Loire, Laboratoire d’Informatique Fondamentale d’Orléans, Bourges, France)
Eichler Cédric (INSA Centre Val de Loire, Laboratoire d’Informatique Fondamentale d’Orléans, Bourges, France)
Nguyen Benjamin (INSA Centre Val de Loire, Laboratoire d’Informatique Fondamentale d’Orléans, Bourges, France)
Taki Sara (INSA Centre Val de Loire, Laboratoire d’Informatique Fondamentale d’Orléans, Bourges, France)

Due to the rapid proliferation of data online, an important quantity of private or sensitive informations is being stored as linked data in graph databases (e.g., represented as RDF). For such databases to be shared without jeopardizing privacy, they must first undergo a process known as database sanitization. During this process, databases are transformed following graph transformations that are usually described informally or through ad-hoc processes. However, a more thourough formalization of these transformations would aid in analysing the sanitization process, ensuring its correctness, and demonstrating the resulting privacy guarantees. This paper is an effort toward bridging the gap between the rigorous graph rewriting approaches and graph sanitization. We propose a graph transformation language to serve as a basis for constructing various sanitization mechanisms. This language relies on a set of elementary transformation operators formalized using a generic algebraic graph rewriting approach. Our language takes into account semantic and supports the equivalent of WHERE and EXCEPT clauses. As a proof of concept, we use these operators to implement two mechanisms from the literature, one generic (Local Differential Privacy) and one specifically introduced for semantic graph databases (sensitive attribute masking through anatomization). We propose an open-sourced tool implementing the elementary operators and the privacy mechanisms we derive from them relying on the Attributed Graph Grammar System (AGG) and its java API, providing a concrete tool implementing formal graph rewriting mechanisms to sanitize semantic graph databases. We present experimental results on this implementation regarding both proposed schemes and discuss its efficiency and scalability.


Show references