Ultimate Power of Inference Attacks: Privacy Risks of High-Dimensional Models

Murakonda, Sasi Kumar; Shokri, Reza; Theodorakopoulos, George

Statistics > Machine Learning

arXiv:1905.12774v1 (stat)

[Submitted on 29 May 2019 (this version), latest version 17 Feb 2021 (v3)]

Title:Ultimate Power of Inference Attacks: Privacy Risks of High-Dimensional Models

Authors:Sasi Kumar Murakonda, Reza Shokri, George Theodorakopoulos

View PDF

Abstract:Models leak information about their training data. This enables attackers to infer sensitive information about their training sets, notably determine if a data sample was part of the model's training set. The existing works empirically show the possibility of these tracing (membership inference) attacks against complex models with a large number of parameters. However, the attack results are dependent on the specific training data, can be obtained only after the tedious process of training the model and performing the attack, and are missing any measure of the confidence and unused potential power of the attack. A model designer is interested in identifying which model structures leak more information, how adding new parameters to the model increases its privacy risk, and what is the gain of adding new data points to decrease the overall information leakage. The privacy analysis should also enable designing the most powerful inference attack.
In this paper, we design a theoretical framework to analyze the maximum power of tracing attacks against high-dimensional models, with the focus on probabilistic graphical models. We provide a tight upper-bound on the power (true positive rate) of these attacks, with respect to their error (false positive rate). The bound, as it should be, is independent of the knowledge and algorithm of any specific attack, as well as the values of particular samples in the training set. It provides a measure of the potential leakage of a model given its structure, as a function of the structure complexity and the size of training set.

Subjects:	Machine Learning (stat.ML); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
Cite as:	arXiv:1905.12774 [stat.ML]
	(or arXiv:1905.12774v1 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.1905.12774

Submission history

From: Sasi Kumar Murakonda [view email]
[v1] Wed, 29 May 2019 23:14:45 UTC (436 KB)
[v2] Fri, 11 Oct 2019 08:03:34 UTC (521 KB)
[v3] Wed, 17 Feb 2021 05:51:25 UTC (144 KB)

Statistics > Machine Learning

Title:Ultimate Power of Inference Attacks: Privacy Risks of High-Dimensional Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Ultimate Power of Inference Attacks: Privacy Risks of High-Dimensional Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators