Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3571306.3571400acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicdcnConference Proceedingsconference-collections
research-article

Distributed Data Minimization for Decentralized Collaborative Filtering Systems

Published: 04 January 2023 Publication History

Abstract

Data minimization is a legal principle that mandates the limitation of personal data to a necessary minimum in order to protect the privacy of individuals. In this light, we address ourselves to decentralized collaborative filtering systems in which individual users interact with each other instead of institutional recommendation providers to obtain recommendations. Decentralized collaborative filtering systems, in which data is distributed over all users, bear a privacy benefit over traditional centralized systems in which data is kept at a single authoritative node. We propose the first formal definition of data minimization for use in distributed systems in order to combine the privacy benefits of data minimization and data distribution. On the basis of this definition, we propose a distributed data minimization scheme based on Distributed Gradient Descent (DGD) that solves the data minimization problem for decentralized collaborative filtering systems. Here, data minimization represents a collection minimization problem in which users aim to collect necessary minimum amounts of rating data from other users in the system. We find by means of simulation that the overall amount of data collected by all users can be reduced significantly (> 14.65%) without jeopardizing recommendation performance, when users coordinate data collection. We thus prove that users collect unnecessarily much data, when they do not coordinate data collection.

References

[1]
Arno Bakker, Elth Ogston, and Maarten van Steen. 2009. Collaborative Filtering Using Random Neighbours in Peer-to-Peer Networks. In Proceedings of the 1st ACM International Workshop on Complex Networks Meet Information & Knowledge Management. ACM, 67–75.
[2]
Ranieri Baraglia, Patrizio Dazzi, Matteo Mordacchini, and Laura Ricci. 2013. A Peer-to-Peer Recommender System for Self-emerging User Communities Based on Gossip Overlays. J. Comput. System Sci. 79, 2 (2013), 291–308.
[3]
Lucas Nunes Barbosa, Jonathan Gemmell, Miller Horvath, and Tales Heimfarth. 2018. Distributed User-Based Collaborative Filtering on an Opportunistic Network. In Proceedings of the 2018 IEEE 32nd International Conference on Advanced Information Networking and Applications. IEEE, 266–273.
[4]
Yacine Belal, Aurélien Bellet, Sonia Ben Mokhtar, and Vlad Nitu. 2022. PEPPER: Empowering User-Centric Recommender Systems over Gossip Learning. Proc. ACM Interact. Mob. Wearable Ubiquitous Technol. 6, 3, Article 101 (2022), 27 pages.
[5]
Alon Ben Horin and Tamir Tassa. 2021. Privacy Preserving Collaborative Filtering by Distributed Mediation. In Proceedings of the 15th ACM Conference on Recommender Systems. ACM, 332–341.
[6]
Asia J. Biega, Peter Potash, Hal Daumé, Fernando Diaz, and Michèle Finck. 2020. Operationalizing the Legal Principle of Data Minimization for Personalization. In Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 399–408.
[7]
Di Chai, Leye Wang, Kai Chen, and Qiang Yang. 2021. Secure Federated Matrix Factorization. IEEE Intelligent Systems 36, 5 (2021), 11–20.
[8]
Richard Chow, Hongxia Jin, Bart Knijnenburg, and Gokay Saldamli. 2013. Differential Data Analysis for Recommender Systems. In Proceedings of the 7th ACM Conference on Recommender Systems. ACM, 323–326.
[9]
Amir Daneshmand, Gesualdo Scutari, and Vyacheslav Kungurtsev. 2018. Second-order Guarantees of Gradient Algorithms over Networks. In Proceedings of the 56th Annual Allerton Conf. on Comm., Control, and Computing. IEEE, 359–365.
[10]
Alexandre de Spindler, Moira C. Norrie, and Michael Grossniklaus. 2007. Collaborative Filtering Based on Opportunistic Information Sharing in Mobile Ad-Hoc Networks. In Proceedings of the 6th OTM Confederated International Conferences "On the Move to Meaningful Internet Systems"(LNCS, 4803). Springer, 408–416.
[11]
Jürgen Dunkel and Ramón Hermoso. 2021. Towards MANET-based Recommender Systems for Open Facilities. Applied Intelligence 52, 8 (2021), 9045–9066.
[12]
Erika Duriakova, Elias Z. Tragos, Barry Smyth, Neil Hurley, Francisco J. Peña, Panagiotis Symeonidis, James Geraci, and Aonghus Lawlor. 2019. PDMFRec: A Decentralised Matrix Factorisation with Tunable User-Centric Privacy. In Proceedings of the 13th ACM Conference on Recommender Systems. ACM, 457–461.
[13]
Tobias Eichinger. 2021. Reviews Are Gold!? On the Link between Item Reviews and Item Preferences. In Joint Workshop Proceedings of the 3rd Edition of Knowledge-aware and Conversational Recommender Systems and the 5th Edition of Recommendation in Complex Environments. CEUR-WS, 1–9.
[14]
Tobias Eichinger, Felix Beierle, Robin Papke, Lucas Rebscher, Hong Chinh Tran, and Magdalena Trzeciak. 2019. On Gossip-based Information Dissemination in Pervasive Recommender Systems. In Proceedings of the 13th ACM Conference on Recommender Systems. ACM, 442–446.
[15]
Michael D. Ekstrand, John T. Riedl, and Joseph A. Konstan. 2011. Collaborative Filtering Recommender Systems. Found. and Trends in HCI 4, 2 (2011), 81–173.
[16]
Michèle Finck and Asia J. Biega. 2021. Reviving Purpose Limitation and Data Minimisation in Data-Driven Systems. Tech. and Regulation 2021 (2021), 44–61.
[17]
Gemma Galdon Clavell, Mariano Martín Zamorano, Carlos Castillo, Oliver Smith, and Aleksandar Matic. 2020. Auditing Algorithms: On Lessons Learned and the Risks of Data Minimization. In Proceedings of the 3rd AAAI/ACM Conference on AI, Ethics, and Society. ACM, 265–271.
[18]
Patrick Gratz and Tom Leclerc. 2009. Delay-Tolerant Collaborative Filtering. In Proceedings of the 7th ACM International Symposium on Mobility Management and Wireless Access. ACM, 109–113.
[19]
F. Maxwell Harper and Joseph A. Konstan. 2015. The MovieLens Datasets: History and Context. ACM TIIS 5, 4, Article 19 (2015), 19 pages.
[20]
István Hegedűs, Gábor Danner, and Márk Jelasity. 2020. Decentralized Recommendation Based on Matrix Factorization: A Comparison of Gossip and Federated Learning. In Machine Learning and Knowledge Discovery in Databases. Springer, 317–332.
[21]
Damon Horowitz and Sepandar D. Kamvar. 2010. The Anatomy of a Large-Scale Social Search Engine. In Proceedings of the 19th International Conference on World Wide Web. ACM, 431–440.
[22]
Anne-Marie Kermarrec, Vincent Leroy, Afshin Moin, and Christopher Thraves. 2010. Application of Random Walks to Decentralized Recommender Systems. In Proceedings of the 14th Int. Conf. on Principles of Distr. Systems. Springer, 48–63.
[23]
Martha Larson, Alessandro Zito, Babak Loni, and Paolo Cremonesi. 2017. Towards Minimal Necessary Data: The Case for Analyzing Training Data Requirements of Recommender Algorithms. In Proceedings of the 1st FATREC Workshop on Responsible Recommendation. 1–6.
[24]
Sangsu Lee, Xi Zheng, Jie Hua, Haris Vikalo, and Christine Julien. 2021. Opportunistic Federated Learning: An Exploration of Egocentric Collaboration for Pervasive Computing Applications. In Proceedings of the 19th IEEE International Conference on Pervasive Computing and Communications. IEEE, 1–8.
[25]
Bradley N. Miller, Joseph A. Konstan, and John Riedl. 2004. PocketLens: Toward a Personal Recommender System. ACM TIIS 22, 3 (2004), 437–476.
[26]
Lorenzo Minto, Moritz Haller, Benjamin Livshits, and Hamed Haddadi. 2021. Stronger Privacy for Federated Collaborative Filtering With Implicit Feedback. In Proceedings of the 15th ACM Conference on Recommender Systems. ACM, 342–350.
[27]
UK Information Commissioner’s Office. 2018. Guide to the General Data Protection Regulation (GDPR). Principle (c): Data minimisation. https://ico.org.uk/for-organisations/guide-to-data-protection/guide-to-the-general-data-protection-regulation-gdpr/principles/data-minimisation/ (accessed Jan 25th, 2022).
[28]
Giancarlo Ruffo and Rossano Schifanella. 2009. A Peer-to-Peer Recommender System Based on Spontaneous Affinities. ACM TOIT 9, 1, Article 4 (2009), 34 pages.
[29]
Rossano Schifanella, André Panisson, Cristina Gena, and Giancarlo Ruffo. 2008. MobHinter: Epidemic Collaborative Filtering and Self-organization in Mobile Ad-hoc Networks. In Proceedings of the 2nd ACM Conference on Recommender Systems. ACM, 27–34.
[30]
Awanthika Senarath and Nalin Asanka Gamagedara Aarachchilage. 2018. Understanding Software Developers’ Approach towards Implementing Data Minimization. arxiv:1808.01479
[31]
Divya Shanmugam, Fernando Diaz, Samira Shabanian, Michèle Finck, and Asia J. Biega. 2022. Learning to Limit Data Collection via Scaling Laws: A Computational Interpretation for the Legal Principle of Data Minimization. In Proceedings of the 5th ACM Conf. on Fairness, Accountability, and Transparency. ACM, 839–849.
[32]
Amit Sharma and Dan Cosley. 2015. Studying and Modeling the Connection between People’s Preferences and Content Sharing. In Proceedings of the 18th ACM Conference on Computer Supported Cooperative Work & Social Computing. Association for Computing Machinery, 1246–1257.
[33]
Reza Shokri, Pedram Pedarsani, George Theodorakopoulos, and Jean-Pierre Hubaux. 2009. Preserving Privacy in Collaborative Filtering through Distributed Aggregation of Offline Profiles. In Proceedings of the 3rd ACM Conference on Recommender Systems. ACM, 157–164.
[34]
Quoc-Tuan Truong, Aghiles Salah, and Hady Lauw. 2021. Multi-Modal Recommender Systems: Hands-On Exploration. In Proceedings of the 15th ACM Conference on Recommender Systems. ACM, 834–837.
[35]
European Union. 2012. Charter of Fundamental Rights of the European Union. Official Journal of the European Union 55, C 326 (2012), 391–407.
[36]
European Union. 2016. REGULATION (EU) 2016/679 OF THE EUROPEAN PARLIAMENT AND OF THE COUNCIL of 27 April 2016 on the protection of natural persons with regard to the processing of personal data and on the free movement of such data, and repealing Directive 95/46/EC (General Data Protection Regulation). Official Journal of the European Union 59, L 199 (2016), 1–88.
[37]
Bart van der Sloot. 2013. Discrimination and Privacy in the Information Society: Data Mining and Profiling in Large Databases. Springer, Chapter From Data Minimization to Data Minimummization, 273–287.
[38]
Aidmar Wainakh, Tim Grube, Jörg Daubert, and Max Mühlhäuser. 2019. Efficient Privacy-preserving Recommendations Based on Social Graphs. In Proceedings of the 13th ACM Conference on Recommender Systems. ACM, 78–86.
[39]
Hongyi Wen, Longqi Yang, Michael Sobolev, and Deborah Estrin. 2018. Exploring Recommendations under User-Controlled Data Filtering. In Proceedings of the 12th ACM Conference on Recommender Systems. ACM, 72–76.
[40]
Alan F. Westin. 2015. Privacy and Freedom. Ig Publishing.
[41]
Ligeng Zhu, Zhijian Liu, and Song Han. 2019. Deep Leakage from Gradients. In Adv. in Neural Information Processing Systems, Vol. 32. Curran Associates, Inc.
[42]
Minghui Zhu and Sonia Martínez. 2010. Discrete-time dynamic average consensus. Automatica 46, 2 (2010), 322–329.

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
ICDCN '23: Proceedings of the 24th International Conference on Distributed Computing and Networking
January 2023
461 pages
ISBN:9781450397964
DOI:10.1145/3571306
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 04 January 2023

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. data minimization
  2. decentralized collaborative filtering
  3. distributed gradient descent
  4. distributed systems
  5. recommender systems
  6. redundancy minimization

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

ICDCN 2023

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 148
    Total Downloads
  • Downloads (Last 12 months)37
  • Downloads (Last 6 weeks)1
Reflects downloads up to 26 Jan 2025

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media