Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2398776.2398827acmconferencesArticle/Chapter ViewAbstractPublication PagesimcConference Proceedingsconference-collections
research-article

Inside dropbox: understanding personal cloud storage services

Published: 14 November 2012 Publication History

Abstract

Personal cloud storage services are gaining popularity. With a rush of providers to enter the market and an increasing offer of cheap storage space, it is to be expected that cloud storage will soon generate a high amount of Internet traffic. Very little is known about the architecture and the performance of such systems, and the workload they have to face. This understanding is essential for designing efficient cloud storage systems and predicting their impact on the network.
This paper presents a characterization of Dropbox, the leading solution in personal cloud storage in our datasets. By means of passive measurements, we analyze data from four vantage points in Europe, collected during 42 consecutive days. Our contributions are threefold: Firstly, we are the first to study Dropbox, which we show to be the most widely-used cloud storage system, already accounting for a volume equivalent to around one third of the YouTube traffic at campus networks on some days. Secondly, we characterize the workload users in different environments generate to the system, highlighting how this reflects on network traffic. Lastly, our results show possible performance bottlenecks caused by both the current system architecture and the storage protocol. This is exacerbated for users connected far from storage data-centers.
All measurements used in our analyses are publicly available in anonymized form at the SimpleWeb trace repository: http://traces.simpleweb.org/dropbox/

Supplementary Material

PDF File (140.pdf)
Summary Review Documentation for "Inside Dropbox: Understanding Personal Cloud Storage Services", Authors: I. Drago, M. Mellia, M. Munafo, A. Sperotto, R. Sadre, A. Pras

References

[1]
A. Bergen, Y. Coady, and R. McGeer. Client Bandwidth: The Forgotten Metric of Online Storage Providers. In Proceedings of the 2011 IEEE Pacific Rim Conference on Communications, Computers and Signal Processing, PacRim'2011, pages 543--548, 2011.
[2]
I. Bermudez, M. Mellia, M. M. Munafò. R. Keralapura, and A. Nucci. DNS to the Rescue: Discerning Content and Services in a Tangled Web. In Proceedings of the 12th ACM SIGCOMM Conference on Internet Measurement, IMC'12, 2012.
[3]
M. Cha, H. Kwak, P. Rodriguez, Y.-Y. Ahn, and S. Moon. I Tube, You Tube, Everybody Tubes: Analyzing the World's Largest User Generated Content Video System. In Proceedings of the 7th ACM SIGCOMM Conference on Internet Measurement, IMC'07, pages 1--14, 2007.
[4]
N. Dukkipati, T. Refice, Y. Cheng, J. Chu, T. Herbert, A. Agarwal, A. Jain, and N. Sutin. An Argument for Increasing TCP's Initial Congestion Window. SIGCOMM Comput. Commun. Rev., 40(3):26--33, 2010.
[5]
A. Finamore, M. Mellia, M. Meo, M. M. Munafò and D. Rossi. Experiences of Internet Traffic Monitoring with Tstat. IEEE Network, 25(3):8--14, 2011.
[6]
A. Finamore, M. Mellia, M. M. Munafò, R. Torres, and S. G. Rao. YouTube Everywhere: Impact of Device and Infrastructure Synergies on User Experience. In Proceedings of the 11th ACM SIGCOMM Conference on Internet Measurement, IMC'11, pages 345--360, 2011.
[7]
M. Gjoka, M. Sirivianos, A. Markopoulou, and X. Yang. Poking Facebook: Characterization of OSN Applications. In Proceedings of the First Workshop on Online Social Networks, WOSN'08, pages 31--36, 2008.
[8]
S. Halevi, D. Harnik, B. Pinkas, and A. Shulman-Peleg. Proofs of Ownership in Remote Storage Systems. In Proceedings of the 18th ACM Conference on Computer and Communications Security, CCS'11, pages 491--500, 2011.
[9]
D. Harnik, B. Pinkas, and A. Shulman-Peleg. Side Channels in Cloud Services: Deduplication in Cloud Storage. IEEE Security and Privacy, 8(6):40--47, 2010.
[10]
S. Hätönen, A. Nyrhinen, L. Eggert, S. Strowes, P. Sarolahti, and M. Kojo. An Experimental Study of Home Gateway Characteristics. In Proceedings of the 10th ACM SIGCOMM Conference on Internet Measurement, IMC'10, pages 260--266, 2010.
[11]
W. Hu, T. Yang, and J. N. Matthews. The Good, the Bad and the Ugly of Consumer Cloud Storage. ACM SIGOPS Operating Systems Review, 44(3):110--115, 2010.
[12]
A. Lenk, M. Klems, J. Nimis, S. Tai, and T. Sandholm. What's Inside the Cloud? An Architectural Map of the Cloud Landscape. In Proceedings of the 2009 ICSE Workshop on Software Engineering Challenges of Cloud Computing, CLOUD'09, pages 23--31, 2009.
[13]
A. Li, X. Yang, S. Kandula, and M. Zhang. CloudCmp: Comparing Public Cloud Providers. In Proceedings of the 10th ACM SIGCOMM Conference on Internet Measurement, IMC'10, pages 1--14, 2010.
[14]
M. Mellia, M. Meo, L. Muscariello, and D. Rossi. Passive Analysis of TCP Anomalies. Computer Networks, 52(14):2663--2676, 2008.
[15]
A. Mislove, M. Marcon, K. P. Gummadi, P. Druschel, and B. Bhattacharjee. Measurement and Analysis of Online Social Networks. In Proceedings of the 7th ACM SIGCOMM Conference on Internet Measurement, IMC'07, pages 29--42, 2007.
[16]
M. Mulazzani, S. Schrittwieser, M. Leithner, M. Huber, and E. Weippl. Dark Clouds on the Horizon: Using Cloud Storage as Attack Vector and Online Slack Space. In Proceedings of the 20th USENIX Conference on Security, SEC'11, 2011.
[17]
G. Wang and T. E. Ng. The Impact of Virtualization on Network Performance of Amazon EC2 Data Center. In Proceedings of the 29th IEEE INFOCOM, pages 1--9, 2010.
[18]
Q. Zhang, L. Cheng, and R. Boutaba. Cloud Computing: State-of-the-Art and Research Challenges.Journal of Internet Services and Applications, 1:7--18, 2010.
[19]
M. Zhou, R. Zhang, W. Xie, W. Qian, and A. Zhou. Security and Privacy in Cloud Computing: A Survey. In Sixth International Conference on Semantics Knowledge and Grid, SKG'10, pages 105--112, 2010.

Cited By

View all
  • (2024)The Design of Fast Delta Encoding for Delta Compression Based Storage SystemsACM Transactions on Storage10.1145/3664817Online publication date: 14-May-2024
  • (2024)Secure Data Integrity Check Based on Verified Public Key Encryption With Equality Test for Multi-Cloud StorageIEEE Transactions on Dependable and Secure Computing10.1109/TDSC.2024.337536921:6(5359-5373)Online publication date: Nov-2024
  • (2024)An Evaluation of the Effect of Network Cost Optimization for Leadership Class SupercomputersSC24: International Conference for High Performance Computing, Networking, Storage and Analysis10.1109/SC41406.2024.00037(1-16)Online publication date: 17-Nov-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
IMC '12: Proceedings of the 2012 Internet Measurement Conference
November 2012
572 pages
ISBN:9781450317054
DOI:10.1145/2398776
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 14 November 2012

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. cloud storage
  2. dropbox
  3. internet measurement

Qualifiers

  • Research-article

Conference

IMC '12
Sponsor:
IMC '12: Internet Measurement Conference
November 14 - 16, 2012
Massachusetts, Boston, USA

Acceptance Rates

Overall Acceptance Rate 277 of 1,083 submissions, 26%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)312
  • Downloads (Last 6 weeks)21
Reflects downloads up to 13 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2024)The Design of Fast Delta Encoding for Delta Compression Based Storage SystemsACM Transactions on Storage10.1145/3664817Online publication date: 14-May-2024
  • (2024)Secure Data Integrity Check Based on Verified Public Key Encryption With Equality Test for Multi-Cloud StorageIEEE Transactions on Dependable and Secure Computing10.1109/TDSC.2024.337536921:6(5359-5373)Online publication date: Nov-2024
  • (2024)An Evaluation of the Effect of Network Cost Optimization for Leadership Class SupercomputersSC24: International Conference for High Performance Computing, Networking, Storage and Analysis10.1109/SC41406.2024.00037(1-16)Online publication date: 17-Nov-2024
  • (2024)A Compact and Accurate Sketch for Estimating a Large Range of Set Difference Cardinalities2024 IEEE 40th International Conference on Data Engineering (ICDE)10.1109/ICDE60146.2024.00110(1338-1351)Online publication date: 13-May-2024
  • (2024)LearnedSync: A Learning-Based Sync Optimization for Cloud StorageAlgorithms and Architectures for Parallel Processing10.1007/978-981-97-0801-7_1(1-21)Online publication date: 1-Mar-2024
  • (2023)An Active File Mode Transition Mechanism Based on Directory Activation Ratio in File Synchronization ServiceApplied Sciences10.3390/app1310597013:10(5970)Online publication date: 12-May-2023
  • (2023)Study on Artificial Intelligence (AI) Assisted English Linguistics Teaching SystemProceedings of the 2023 2nd International Conference on Educational Innovation and Multimedia Technology (EIMT 2023)10.2991/978-94-6463-192-0_67(511-520)Online publication date: 5-Jul-2023
  • (2023)Sanare: Pluggable Intrusion Recovery for Web ApplicationsIEEE Transactions on Dependable and Secure Computing10.1109/TDSC.2021.313947220:1(590-605)Online publication date: 1-Jan-2023
  • (2023)PuppetStack: A tool for building high-availability private cloud infrastructures2023 IEEE International Conference on Cloud Computing Technology and Science (CloudCom)10.1109/CloudCom59040.2023.00044(224-231)Online publication date: 4-Dec-2023
  • (2023)Blockchain-Based Verifiable and Reliable File Access Control Layer for Cloud Storages2023 IEEE International Conference on Big Data (BigData)10.1109/BigData59044.2023.10386940(2303-2310)Online publication date: 15-Dec-2023
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media