Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
extended-abstract

Device analyzer: large-scale mobile data collection

Published: 17 April 2014 Publication History

Abstract

We collected usage information from 12,500 Android devices in the wild over the course of nearly 2 years. Our dataset contains 53 billion data points from 894 models of devices running 687 versions of Android. Processing the collected data presents a number of challenges ranging from scalability to consistency and privacy considerations. We present our system architecture for collection and analysis of this highly-distributed dataset, discuss how our system can reliably collect time-series data in the presence of unreliable timing information, and discuss issues and lessons learned that we believe apply to many other big data collection projects.

References

[1]
D. P. Anderson, J. Cobb, E. Korpela, M. Lebofsky, and D. Werthimer. SETI@home: an experiment in public-resource computing. Comms of the ACM, 45(11), 2002.
[2]
M. Böhmer, B. Hecht, J. Schöning, A. Krüger, and G. Bauer. Falling Asleep with Angry Birds, Facebook and Kindle A Large Scale Study on Mobile Application Usage. MobileHCI, 2011.
[3]
L. Carata, R. Sohan, A. Rice, and A. Hopper. IPAPI: Designing an Improved Provenance API. TaPP, 2013.
[4]
N. Eagle and A. S. Pentland. Reality mining: sensing complex social systems. Personal and Ubiquitous Computing, 10(4), 2005.
[5]
A. Girardello and F. Michahelles. AppAware: which mobile applications are hot? MobileHCI, 2010.
[6]
F. Khatib, S. Cooper, M. D. Tyka, K. Xu, I. Makedon, Z. Popovic, D. Baker, and F. Players. Algorithm discovery by protein folding game players. PNAS, 108(47), 2011.
[7]
M. Langheinrich. Privacy by Design - Principles of Privacy-Aware Ubiquitous Systems. UbiComp, 2001.
[8]
J. K. Laurila, J. Blom, O. Dousse, D. Gatica-perez, O. Bornet, and M. Miettinen. The Mobile Data Challenge : Big Data for Mobile Computing Research. PerCom, 2012.
[9]
E. Oliver. The challenges in large-scale smartphone user studies. HotPlanet, 2010.

Cited By

View all
  • (2024)A Data-Driven Evaluation of the Current Security State of Android Devices2024 IEEE Conference on Communications and Network Security (CNS)10.1109/CNS62487.2024.10735682(1-9)Online publication date: 30-Sep-2024
  • (2024)The digital pheromone: Building digital identity of smartphone users based on time-varying multivariatesICT Express10.1016/j.icte.2024.07.00810:5(981-988)Online publication date: Oct-2024
  • (2023)Smart City Based on Open Data: A SurveyIEEE Access10.1109/ACCESS.2023.328343611(56726-56748)Online publication date: 2023
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM SIGMETRICS Performance Evaluation Review
ACM SIGMETRICS Performance Evaluation Review  Volume 41, Issue 4
March 2014
104 pages
ISSN:0163-5999
DOI:10.1145/2627534
Issue’s Table of Contents

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 17 April 2014
Published in SIGMETRICS Volume 41, Issue 4

Check for updates

Qualifiers

  • Extended-abstract

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)17
  • Downloads (Last 6 weeks)1
Reflects downloads up to 25 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2024)A Data-Driven Evaluation of the Current Security State of Android Devices2024 IEEE Conference on Communications and Network Security (CNS)10.1109/CNS62487.2024.10735682(1-9)Online publication date: 30-Sep-2024
  • (2024)The digital pheromone: Building digital identity of smartphone users based on time-varying multivariatesICT Express10.1016/j.icte.2024.07.00810:5(981-988)Online publication date: Oct-2024
  • (2023)Smart City Based on Open Data: A SurveyIEEE Access10.1109/ACCESS.2023.328343611(56726-56748)Online publication date: 2023
  • (2022)Understanding Privacy Risks and Perceived Benefits in Open Dataset Collection for Mobile Affective ComputingProceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies10.1145/35346236:2(1-26)Online publication date: 7-Jul-2022
  • (2022)Me in the Wild: An Exploratory Study Using Smartphones to Detect the Onset of DepressionWireless Mobile Communication and Healthcare10.1007/978-3-031-06368-8_9(121-145)Online publication date: 7-Jun-2022
  • (2021)Predicting Depression From Smartphone Behavioral Markers Using Machine Learning Methods, Hyperparameter Optimization, and Feature Importance Analysis: Exploratory StudyJMIR mHealth and uHealth10.2196/265409:7(e26540)Online publication date: 12-Jul-2021
  • (2021)Mining and Construction of Information Opportunity Cooperation Mode Based on Big Data Fusion Internet of ThingsIEEE Access10.1109/ACCESS.2021.30583579(29401-29415)Online publication date: 2021
  • (2021)Sensor-Based Human Activity and Behavior ComputingVision, Sensing and Analytics: Integrative Approaches10.1007/978-3-030-75490-7_6(147-176)Online publication date: 6-Jun-2021
  • (2020)Privacy-Preserving Sensor-Based Continuous Authentication and User Profiling: A ReviewSensors10.3390/s2101009221:1(92)Online publication date: 25-Dec-2020
  • (2020)Which App Features Are Being Used? Learning App Feature Usages from Interaction Data2020 IEEE 28th International Requirements Engineering Conference (RE)10.1109/RE48521.2020.00019(66-77)Online publication date: Aug-2020
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media