Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2815675.2815677acmconferencesArticle/Chapter ViewAbstractPublication PagesimcConference Proceedingsconference-collections
research-article

Dissecting UbuntuOne: Autopsy of a Global-scale Personal Cloud Back-end

Published: 28 October 2015 Publication History

Abstract

Personal Cloud services, such as Dropbox or Box, have been widely adopted by users. Unfortunately, very little is known about the internal operation and general characteristics of Personal Clouds since they are proprietary services.
In this paper, we focus on understanding the nature of Personal Clouds by presenting the internal structure and a measurement study of UbuntuOne (U1). We first detail the U$1$ architecture, core components involved in the U1 metadata service hosted in the datacenter of Canonical, as well as the interactions of U$1$ with Amazon S3 to outsource data storage. To our knowledge, this is the first research work to describe the internals of a large-scale Personal Cloud.
Second, by means of tracing the U$1$ servers, we provide an extensive analysis of its back-end activity for one month. Our analysis includes the study of the storage workload, the user behavior and the performance of the U1 metadata store. Moreover, based on our analysis, we suggest improvements to U1 that can also benefit similar Personal Cloud systems.
Finally, we contribute our dataset to the community, which is the first to contain the back-end activity of a large-scale Personal Cloud. We believe that our dataset provides unique opportunities for extending research in the field.

References

[1]
F. Research, "The personal cloud: Transforming personal computing, mobile, and web markets." http://www.forrester.com, 2011.
[2]
I. Drago, M. Mellia, M. M Munafo, A. Sperotto, R. Sadre, and A. Pras, "Inside dropbox: understanding personal cloud storage services," in ACM IMC'12, 2012, pp. 481--494.
[3]
I. Drago, E. Bocchi, M. Mellia, H. Slatman, and A. Pras, "Benchmarking personal cloud storage," in ACM IMC'13, 2013, pp. 205--212.
[4]
Z. Li, C. Wilson, Z. Jiang, Y. Liu, B. Y. Zhao, C. Jin, Z.-L. Zhang, and Y. Dai, "Efficient batched synchronization in dropbox-like cloud storage services," in ACM/IFIP/USENIX Middleware'13, 2013, pp. 307--327.
[5]
Z. Li, C. Jin, T. Xu, C. Wilson, Y. Liu, L. Cheng, Y. Liu, Y. Dai, and Z.-L. Zhang, "Towards network-level efficiency for cloud storage services," in ACM IMC'14, 2014.
[6]
W. Hu, T. Yang, and J. Matthews, "The good, the bad and the ugly of consumer cloud storage," ACM SIGOPS Operating Systems Review, vol. 44, no. 3, pp. 110--115, 2010.
[7]
R. Gracia-Tinedo, M. Sánchez-Artigas, A. Moreno-Martínez, C. Cotes, and P. García-López, "Actively measuring personal cloud storage," in IEEE CLOUD'13, 2013, pp. 301--308.
[8]
R. Sears, C. Van Ingen, and J. Gray, "To blob or not to blob: Large object storage in a database or a filesystem?" Microsoft Research, Tech. Rep., 2007.
[9]
J. Li, N. K. Sharma, D. R. Ports, and S. D. Gribble, "Tales of the tail: Hardware, os, and application-level sources of tail latency," in ACM SoCC'14, 2014.
[10]
P. García-López, S. Toda-Flores, C. Cotes-González, M. Sánchez-Artigas, and J. Lenton, "Stacksync: Bringing elasticity to dropbox-like file synchronization," in ACM/IFIP/USENIX Middleware'14, 2014, pp. 49--60.
[11]
"FP7 cloudspaces EU project," http://cloudspaces.eu.
[12]
R. Gracia-Tinedo, M. Sánchez-Artigas, and P. García-López, "Cloud-as-a-gift: Effectively exploiting personal cloud free accounts via REST APIs," in IEEE CLOUD'13, 2013, pp. 621--628.
[13]
E. Hammer-Lahav, "The OAuth 1.0 Protocol," http://tools.ietf.org/html/rfc5849, 2010.
[14]
M. G. Baker, J. H. Hartman, M. D. Kupfer, K. W. Shirriff, and J. K. Ousterhout, "Measurements of a distributed file system," in ACM SIGOPS Operating Systems Review, vol. 25, no. 5, 1991, pp. 198--212.
[15]
N. Agrawal, W. J. Bolosky, J. R. Douceur, and J. R. Lorch, "A five-year study of file-system metadata," ACM Transactions on Storage, vol. 3, no. 3, p. 9, 2007.
[16]
A. W. Leung, S. Pasupathy, G. R. Goodson, and E. L. Miller, "Measurement and analysis of large-scale network file system workloads." in USENIX ATC'08, vol. 1, no. 2, 2008, pp. 5--2.
[17]
W. Hsu and A. Smith, "Characteristics of i/o traffic in personal computer and server workloads," IBM Systems Journal, vol. 42, no. 2, pp. 347--372, 2003.
[18]
S. Liu, X. Huang, H. Fu, and G. Yang, "Understanding data characteristics and access patterns in a cloud storage system," in IEEE/ACM CCGrid'13, 2013, pp. 327--334.
[19]
S. Muralidhar, W. Lloyd, S. Roy, C. Hill, E. Lin, W. Liu, S. Pan, S. Shankar, V. Sivakumar, L. Tang, and S. Kumar, "f4: Facebook warm blob storage system," in USENIX OSDI'14, 2014, pp. 383--398.
[20]
J. Mirkovic and P. Reiher, "A taxonomy of DDoS attack and DDoS defense mechanisms," ACM SIGCOMM Computer Communication Review, vol. 34, no. 2, pp. 39--53, 2004.
[21]
A.-L. Barabasi, "The origin of bursts and heavy tails in human dynamics," Nature, vol. 435, no. 7039, pp. 207--211, 2005.
[22]
M. E. Crovella and A. Bestavros, "Self-similarity in world wide web traffic: evidence and possible causes," IEEE/ACM Transactions on Networking, vol. 5, no. 6, pp. 835--846, 1997.
[23]
S. Hätönen, A. Nyrhinen, L. Eggert, S. Strowes, P. Sarolahti, and M. Kojo, "An experimental study of home gateway characteristics," in ACM IMC'10, 2010, pp. 260--266.
[24]
P. Deolasee, A. Katkar, A. Panchbudhe, K. Ramamritham, and P. Shenoy, "Adaptive push-pull: disseminating dynamic web data," in ACM WWW'01, 2001, pp. 265--274.
[25]
Z. Hill, J. Li, M. Mao, A. Ruiz-Alvarez, and M. Humphrey, "Early observations on the performance of windows azure," in ACM HPDC'10, 2010, pp. 367--376.
[26]
M. R. Palankar, A. Iamnitchi, M. Ripeanu, and S. Garfinkel, "Amazon S3 for science grids: a viable solution?" in ACM DADC'08, 2008, pp. 55--64.
[27]
A. Bergen, Y. Coady, and R. McGeer, "Client bandwidth: The forgotten metric of online storage providers," in IEEE Pacific Rim Conference on Communications, Computers and Signal Processing, 2011, pp. 543--548.
[28]
T. Mager, E. Biersack, and P. Michiardi, "A measurement study of the wuala on-line storage service," in IEEE P2P'12, 2012, pp. 237--248.
[29]
G. Gonçalves, I. Drago, A. P. C. da Silva, A. B. Vieira, and J. M. Almeida, "Modeling the dropbox client behavior," in IEEE ICC'14, vol. 14, 2014.
[30]
M. Mulazzani, S. Schrittwieser, M. Leithner, M. Huber, and E. Weippl, "Dark clouds on the horizon: Using cloud storage as attack vector and online slack space." in USENIX Security, 2011.
[31]
J. Silber, "Shutting down Ubuntu One file services," http://blog.canonical.com/2014/04/02/shutting- down-ubuntu-one-file-services/, April 2014.

Cited By

View all
  • (2024)SkyPIE: A Fast & Accurate Oracle for Object PlacementProceedings of the ACM on Management of Data10.1145/36393102:1(1-27)Online publication date: 26-Mar-2024
  • (2024)BTQoS: A Tenant Relationship-Aware QoS Framework for Multi-tenant Distributed Storage SystemWeb and Big Data10.1007/978-981-97-7241-4_16(245-260)Online publication date: 28-Aug-2024
  • (2023)Practical Storage-Compute Elasticity for Stream Data ProcessingProceedings of the 24th International Middleware Conference: Industrial Track10.1145/3626562.3626828(1-7)Online publication date: 11-Dec-2023
  • Show More Cited By

Index Terms

  1. Dissecting UbuntuOne: Autopsy of a Global-scale Personal Cloud Back-end

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      IMC '15: Proceedings of the 2015 Internet Measurement Conference
      October 2015
      550 pages
      ISBN:9781450338486
      DOI:10.1145/2815675
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 28 October 2015

      Permissions

      Request permissions for this article.

      Check for updates

      Badges

      • Best Dataset

      Author Tags

      1. measurement
      2. performance analysis
      3. personal cloud

      Qualifiers

      • Research-article

      Funding Sources

      Conference

      IMC '15
      Sponsor:
      IMC '15: Internet Measurement Conference
      October 28 - 30, 2015
      Tokyo, Japan

      Acceptance Rates

      IMC '15 Paper Acceptance Rate 31 of 96 submissions, 32%;
      Overall Acceptance Rate 277 of 1,083 submissions, 26%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)10
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 13 Jan 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)SkyPIE: A Fast & Accurate Oracle for Object PlacementProceedings of the ACM on Management of Data10.1145/36393102:1(1-27)Online publication date: 26-Mar-2024
      • (2024)BTQoS: A Tenant Relationship-Aware QoS Framework for Multi-tenant Distributed Storage SystemWeb and Big Data10.1007/978-981-97-7241-4_16(245-260)Online publication date: 28-Aug-2024
      • (2023)Practical Storage-Compute Elasticity for Stream Data ProcessingProceedings of the 24th International Middleware Conference: Industrial Track10.1145/3626562.3626828(1-7)Online publication date: 11-Dec-2023
      • (2023)PravegaProceedings of the 24th International Middleware Conference10.1145/3590140.3629113(165-177)Online publication date: 27-Nov-2023
      • (2023)Characterization of I/O Behaviors in Cloud Storage WorkloadsIEEE Transactions on Computers10.1109/TC.2023.326372672:10(2726-2739)Online publication date: Oct-2023
      • (2022)A New Upper Bound on Cache Hit Probability for Non-Anticipative Caching PoliciesACM Transactions on Modeling and Performance Evaluation of Computing Systems10.1145/35473327:2-4(1-24)Online publication date: 26-Nov-2022
      • (2021)A New Upper Bound on Cache Hit Probability for Non-anticipative Caching PoliciesACM SIGMETRICS Performance Evaluation Review10.1145/3453953.345398548:3(138-143)Online publication date: 5-Mar-2021
      • (2021)Enabling Conflict-free Collaborations with Cloud Storage Services2021 IEEE 27th International Conference on Parallel and Distributed Systems (ICPADS)10.1109/ICPADS53394.2021.00082(615-621)Online publication date: Dec-2021
      • (2021)A Study of Costs and Benefits of Content Sharing in Personal Cloud StorageJournal of Network and Systems Management10.1007/s10922-021-09598-529:3Online publication date: 3-Apr-2021
      • (2020)Analyzing the distribution fit for storage workload and Internet traffic tracesPerformance Evaluation10.1016/j.peva.2020.102121(102121)Online publication date: Jun-2020
      • Show More Cited By

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media