research-article

Public Access

Use Privacy in Data-Driven Systems: Theory and Experiments with Machine Learnt Programs

Authors:

Shayak SenAuthors Info & Claims

CCS '17: Proceedings of the 2017 ACM SIGSAC Conference on Computer and Communications Security

Pages 1193 - 1210

https://doi.org/10.1145/3133956.3134097

Published: 30 October 2017 Publication History

PDF eReader

Abstract

This paper presents an approach to formalizing and enforcing a class of use privacy properties in data-driven systems. In contrast to prior work, we focus on use restrictions on proxies (i.e. strong predictors) of protected information types. Our definition relates proxy use to intermediate computations that occur in a program, and identify two essential properties that characterize this behavior: 1) its result is strongly associated with the protected information type in question, and 2) it is likely to causally affect the final output of the program. For a specific instantiation of this definition, we present a program analysis technique that detects instances of proxy use in a model, and provides a witness that identifies which parts of the corresponding program exhibit the behavior. Recognizing that not all instances of proxy use of a protected information type are inappropriate, we make use of a normative judgment oracle that makes this inappropriateness determination for a given witness. Our repair algorithm uses the witness of an inappropriate proxy use to transform the model into one that provably does not exhibit proxy use, while avoiding changes that unduly affect classification accuracy. Using a corpus of social datasets, our evaluation shows that these algorithms are able to detect proxy use instances that would be difficult to find using existing techniques, and subsequently remove them while maintaining acceptable classification performance.

References

[1]

Paul Barford, Igor Canadi, Darja Krushevskaja, Qiang Ma, and S. Muthukrishnan 2014. Adscape: Harvesting and Analyzing Online Display Ads Proceedings of the 23rd International Conference on World Wide Web. International World Wide Web Conferences Steering Committee, Republic and Canton of Geneva, Switzerland, 597--608.

Google Scholar

[2]

Raef Bassily, Adam Smith, and Abhradeep Thakurta 2014. Private Empirical Risk Minimization: Efficient Algorithms and Tight Error Bounds 55th IEEE Annual Symposium on Foundations of Computer Science, FOCS 2014, Philadelphia, PA, USA, October 18--21, 2014. 464--473.

Google Scholar

[3]

Richard Berk and Justin Bleich 2014. Forecasts of Violence to Inform Sentencing Decisions. Journal of Quantitative Criminology Vol. 30, 1 (2014), 79--96. .acm.org/10.1145/2381966.2381969

Crossref

Google Scholar

[4]

Rich Zemel, Yu Wu, Kevin Swersky, Toni Pitassi, and Cynthia Dwork 2013. Learning Fair Representations. In Proceedings of the Internetional Conference on Machine Learning. endthebibliography

Google Scholar

Cited By

View all

Xu HZhang N(2024)Goal Orientation for Fair Machine Learning AlgorithmsProduction and Operations Management10.1177/10591478241234998Online publication date: 18-Mar-2024
https://doi.org/10.1177/10591478241234998
Xiao YZhang JLiu YMousavi MLiu SXue D(2024)MirrorFair: Fixing Fairness Bugs in Machine Learning Software via Counterfactual PredictionsProceedings of the ACM on Software Engineering10.1145/36608011:FSE(2121-2143)Online publication date: 12-Jul-2024
https://doi.org/10.1145/3660801
Yang QWang CYuan HCui JTeng HChen XJiang C(2024)Approaching the Information-Theoretic Limit of Privacy Disclosure With Utility GuaranteesIEEE Transactions on Information Forensics and Security10.1109/TIFS.2024.335441219(3339-3352)Online publication date: 1-Jan-2024
https://dl.acm.org/doi/10.1109/TIFS.2024.3354412
Show More Cited By

Index Terms

Use Privacy in Data-Driven Systems: Theory and Experiments with Machine Learnt Programs
1. Security and privacy
  1. Human and societal aspects of security and privacy
    1. Privacy protections

Recommendations

The Use of Software Agents as Proxies
ISCC '00: Proceedings of the Fifth IEEE Symposium on Computers and Communications (ISCC 2000)

As network technology is advancing at a rapid rate, clients can access information from the Internet using a variety of devices and via different types of networks. As the Internet is heterogeneous in nature, and with such diversities in devices, a ...
Freedom of Privacy: Anonymous Data Collection with Respondent-Defined Privacy Protection

The massive amount of sensitive survey data about individuals that agencies collect and share through the Internet is causing a great deal of privacy concerns. These concerns may discourage individuals from revealing their sensitive information. ...
Protecting privacy in data release
Foundations of security analysis and design VI

The evolution of the Information and Communication Technology has radically changed our electronic lives, making information the key driver for today's society. Every action we perform requires the collection, elaboration, and dissemination of personal ...

Comments

Information & Contributors

Information

Published In

CCS '17: Proceedings of the 2017 ACM SIGSAC Conference on Computer and Communications Security

October 2017

2682 pages

ISBN:9781450349468

DOI:10.1145/3133956

General Chair:
Bhavani Thuraisingham
The University of Texas at Dallas, USA
,
Program Chairs:
David Evans
University of Virginia
,
Tal Malkin
Columbia University
,
Dongyan Xu
Purdue University

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 30 October 2017

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

Conference

CCS '17

Sponsor:

SIGSAC

CCS '17: 2017 ACM SIGSAC Conference on Computer and Communications Security

October 30 - November 3, 2017

Texas, Dallas, USA

Acceptance Rates

CCS '17 Paper Acceptance Rate 151 of 836 submissions, 18%;

Overall Acceptance Rate 1,261 of 6,999 submissions, 18%

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

46
Total Citations
View Citations
938
Total Downloads

Downloads (Last 12 months)125
Downloads (Last 6 weeks)13

Reflects downloads up to 15 Oct 2024

Other Metrics

View Author Metrics

Citations

Cited By

View all

Xu HZhang N(2024)Goal Orientation for Fair Machine Learning AlgorithmsProduction and Operations Management10.1177/10591478241234998Online publication date: 18-Mar-2024
https://doi.org/10.1177/10591478241234998
Xiao YZhang JLiu YMousavi MLiu SXue D(2024)MirrorFair: Fixing Fairness Bugs in Machine Learning Software via Counterfactual PredictionsProceedings of the ACM on Software Engineering10.1145/36608011:FSE(2121-2143)Online publication date: 12-Jul-2024
https://doi.org/10.1145/3660801
Yang QWang CYuan HCui JTeng HChen XJiang C(2024)Approaching the Information-Theoretic Limit of Privacy Disclosure With Utility GuaranteesIEEE Transactions on Information Forensics and Security10.1109/TIFS.2024.335441219(3339-3352)Online publication date: 1-Jan-2024
https://dl.acm.org/doi/10.1109/TIFS.2024.3354412
Yang TShan Cang LIqbal MAlmakhles D(2024)Attack Risk Analysis in Data Anonymization in Internet of ThingsIEEE Transactions on Computational Social Systems10.1109/TCSS.2023.324308911:4(4986-4993)Online publication date: Aug-2024
https://doi.org/10.1109/TCSS.2023.3243089
Hoch HHertweck CLoi MTamò-Larrieux A(2024)Discrimination for the sake of fairness by design and its legal frameworkComputer Law & Security Review10.1016/j.clsr.2023.10591652(105916)Online publication date: Apr-2024
https://doi.org/10.1016/j.clsr.2023.105916
Ramadan QKonersmann MAhmadian AJürjens JStaab S(2024)MBFair: a model-based verification methodology for detecting violations of individual fairnessSoftware and Systems Modeling10.1007/s10270-024-01184-yOnline publication date: 10-Jun-2024
https://doi.org/10.1007/s10270-024-01184-y
Soen AHusain HNock RKrause ABrunskill ECho KEngelhardt BSabato SScarlett J(2023)Fair densities via boosting the sufficient statistics of exponential familiesProceedings of the 40th International Conference on Machine Learning10.5555/3618408.3619739(32105-32144)Online publication date: 23-Jul-2023
https://dl.acm.org/doi/10.5555/3618408.3619739
Dutta SHamman F(2023)A Review of Partial Information Decomposition in Algorithmic Fairness and ExplainabilityEntropy10.3390/e2505079525:5(795)Online publication date: 13-May-2023
https://doi.org/10.3390/e25050795
Brandner LHirsbrunner S(2023)Algorithmische Fairness in der polizeilichen Ermittlungsarbeit:Ethische Analyse von Verfahren des maschinellen Lernens zur GesichtserkennungTATuP - Zeitschrift für Technikfolgenabschätzung in Theorie und Praxis10.14512/tatup.32.1.2432:1(24-29)Online publication date: 23-Mar-2023
https://doi.org/10.14512/tatup.32.1.24
Edenberg EWood A(2023)Disambiguating Algorithmic Bias: From Neutrality to JusticeProceedings of the 2023 AAAI/ACM Conference on AI, Ethics, and Society10.1145/3600211.3604695(691-704)Online publication date: 8-Aug-2023
https://dl.acm.org/doi/10.1145/3600211.3604695
Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Cited By

Index Terms

Recommendations

The Use of Software Agents as Proxies

Freedom of Privacy: Anonymous Data Collection with Respondent-Defined Privacy Protection

Protecting privacy in data release

Comments

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Funding Sources

Conference

Acceptance Rates

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF

eReader

Login options

Full Access

Abstract

References

Cited By

Index Terms

Recommendations

The Use of Software Agents as Proxies

Freedom of Privacy: Anonymous Data Collection with Respondent-Defined Privacy Protection

Protecting privacy in data release

Comments

Information

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Funding Sources

Conference

Acceptance Rates

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

View options

PDF

eReader

Get Access

Login options

Full Access

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations