Trustworthy Experimentation Under Telemetry Loss

Gupchup, Jayant; Hosseinkashi, Yasaman; Dmitriev, Pavel; Schneider, Daniel; Cutler, Ross; Jefremov, Andrei; Ellis, Martin

doi:10.1145/3269206.3271747

Computer Science > Other Computer Science

arXiv:1903.12470 (cs)

[Submitted on 22 Jan 2019]

Title:Trustworthy Experimentation Under Telemetry Loss

Authors:Jayant Gupchup, Yasaman Hosseinkashi, Pavel Dmitriev, Daniel Schneider, Ross Cutler, Andrei Jefremov, Martin Ellis

View PDF

Abstract:Failure to accurately measure the outcomes of an experiment can lead to bias and incorrect conclusions. Online controlled experiments (aka AB tests) are increasingly being used to make decisions to improve websites as well as mobile and desktop applications. We argue that loss of telemetry data (during upload or post-processing) can skew the results of experiments, leading to loss of statistical power and inaccurate or erroneous conclusions. By systematically investigating the causes of telemetry loss, we argue that it is not practical to entirely eliminate it. Consequently, experimentation systems need to be robust to its effects. Furthermore, we note that it is nontrivial to measure the absolute level of telemetry loss in an experimentation system. In this paper, we take a top-down approach towards solving this problem. We motivate the impact of loss qualitatively using experiments in real applications deployed at scale, and formalize the problem by presenting a theoretical breakdown of the bias introduced by loss. Based on this foundation, we present a general framework for quantitatively evaluating the impact of telemetry loss, and present two solutions to measure the absolute levels of loss. This framework is used by well-known applications at Microsoft, with millions of users and billions of sessions. These general principles can be adopted by any application to improve the overall trustworthiness of experimentation and data-driven decision making.

Comments:	Proceedings of the 27th ACM International Conference on Information and Knowledge Management, October 2018
Subjects:	Other Computer Science (cs.OH)
Cite as:	arXiv:1903.12470 [cs.OH]
	(or arXiv:1903.12470v1 [cs.OH] for this version)
	https://doi.org/10.48550/arXiv.1903.12470
Related DOI:	https://doi.org/10.1145/3269206.3271747

Submission history

From: Jayant Gupchup A [view email]
[v1] Tue, 22 Jan 2019 01:36:01 UTC (1,545 KB)

Computer Science > Other Computer Science

Title:Trustworthy Experimentation Under Telemetry Loss

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Other Computer Science

Title:Trustworthy Experimentation Under Telemetry Loss

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators