Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3487553.3524216acmconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
short-paper

Privacy-Preserving Methods for Repeated Measures Designs

Published: 16 August 2022 Publication History

Abstract

Evolving privacy practices have led to increasing restrictions around the collection and storage of user level data. In turn, this has resulted in analytical challenges, such as properly estimating experimental statistics, especially in the case of long-running tests with repeated measurements. We propose a method for analyzing A/B tests which avoids aggregating and storing data at the unit-level. The approach utilizes a unit-level hashing mechanism which generates and stores the first and second moments of random subsets of the original population, thus allowing estimation of statistics, such as the variance of the average treatment effect (ATE), by bootstrap. Across a sample of past A/B tests at Netflix, we provide empirical results that demonstrate the effectiveness of the approach, and show how techniques to improve the sensitivity of experiments, such as regression adjustment, are still feasible under this new design.

References

[1]
Jan Philipp Albrecht. 2016. How the GDPR will change the world. Eur. Data Prot. L. Rev. 2 (2016), 287.
[2]
Nicholas Chamandy, Omkar Muralidharan, Amir Najmi, and Siddartha Naidu. 2012. Estimating Uncertainty for Massive Data Streams. Technical Report. Google.
[3]
Alex Deng, Ulf Knoblich, and Jiannan Lu. 2018. Applying the Delta method in metric analytics: A practical guide with novel ideas. arXiv preprint arXiv:1803.06336(2018).
[4]
Alex Deng, Ya Xu, Ron Kohavi, and Toby Walker. 2013. Improving the sensitivity of online controlled experiments by utilizing pre-experiment data. In Proceedings of the sixth ACM international conference on Web search and data mining. 123–132.
[5]
Bradley Efron. 1981. Nonparametric estimates of standard error: the jackknife, the bootstrap and other methods. Biometrika 68, 3 (1981), 589–599.
[6]
Bradley Efron and Robert J Tibshirani. 1994. An introduction to the bootstrap. CRC press.
[7]
Aleksander Fabijan, Pavel Dmitriev, Colin McFarland, Lukas Vermeer, Helena Holmström Olsson, and Jan Bosch. 2018. Experimentation growth: Evolving trustworthy A/B testing capabilities in online software companies. Journal of Software: Evolution and Process 30, 12 (2018), e2113.
[8]
Michelle Goddard. 2017. The EU General Data Protection Regulation (GDPR): European regulation that has a global impact. International Journal of Market Research 59, 6 (2017), 703–705.
[9]
Allegra Hobbs. 2021. Facebook Battles Apple Over User Privacy. (2021).
[10]
Kimberly A Houser and W Gregory Voss. 2018. GDPR: The end of Google and Facebook or a new paradigm in data privacy. Rich. JL & Tech. 25(2018), 1.
[11]
Ron Kohavi, Alex Deng, Brian Frasca, Toby Walker, Ya Xu, and Nils Pohlmann. 2013. Online controlled experiments at large scale. In Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining. 1168–1176.
[12]
Ron Kohavi, Diane Tang, and Ya Xu. 2020. Trustworthy online controlled experiments: A practical guide to a/b testing. Cambridge University Press.
[13]
David Letson and BD McCullough. 1998. Better confidence intervals: The double bootstrap with no pivot. American journal of agricultural economics 80, 3 (1998), 552–559.
[14]
Winston Lin. 2013. Agnostic notes on regression adjustments to experimental data: Reexamining Freedman’s critique. The Annals of Applied Statistics 7, 1 (2013), 295–318.
[15]
Kevin Liou and Sean J Taylor. 2020. Variance-Weighted Estimators to Improve Sensitivity in Online Experiments. In Proceedings of the 21st ACM Conference on Economics and Computation. 837–850.
[16]
Art B Owen, Dean Eckles, 2012. Bootstrapping data arrays of arbitrary order. The Annals of Applied Statistics 6, 3 (2012), 895–927.
[17]
Giovanni Maria Riva, Alexandr Vasenev, and Nicola Zannone. 2020. SoK: Engineering privacy-aware high-tech systems. In Proceedings of the 15th International Conference on Availability, Reliability and Security. 1–10.
[18]
Donald B Rubin. 1973. The use of matched sampling and regression adjustment to remove bias in observational studies. Biometrics (1973), 185–203.
[19]
Julio Sánchez-Meca and Fulgencio Marin-Martinez. 1998. Weighting by inverse variance or by sample size in meta-analysis: A simulation study. Educational and Psychological Measurement 58, 2 (1998), 211–220.
[20]
David Stites and Katie Skinner. 2014. User privacy on iOS and OS X. In The Apple Worldwide Developers Conference.
[21]
Huizhi Xie and Juliette Aurisset. 2016. Improving the sensitivity of online controlled experiments: Case studies at netflix. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 645–654.

Cited By

View all
  • (2024)Budget Recycling Differential Privacy2024 IEEE Symposium on Security and Privacy (SP)10.1109/SP54263.2024.00212(1028-1046)Online publication date: 19-May-2024
  • (2024)Responsible retrospection: adapting responsible innovation to the liminal innovation of ICTsJournal of Responsible Innovation10.1080/23299460.2024.232623211:1Online publication date: 24-Mar-2024

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
WWW '22: Companion Proceedings of the Web Conference 2022
April 2022
1338 pages
ISBN:9781450391306
DOI:10.1145/3487553
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 16 August 2022

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. A/B testing
  2. controlled experiments
  3. privacy

Qualifiers

  • Short-paper
  • Research
  • Refereed limited

Conference

WWW '22
Sponsor:
WWW '22: The ACM Web Conference 2022
April 25 - 29, 2022
Virtual Event, Lyon, France

Acceptance Rates

Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)30
  • Downloads (Last 6 weeks)6
Reflects downloads up to 14 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Budget Recycling Differential Privacy2024 IEEE Symposium on Security and Privacy (SP)10.1109/SP54263.2024.00212(1028-1046)Online publication date: 19-May-2024
  • (2024)Responsible retrospection: adapting responsible innovation to the liminal innovation of ICTsJournal of Responsible Innovation10.1080/23299460.2024.232623211:1Online publication date: 24-Mar-2024

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media