Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3326285.3329052acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiwqosConference Proceedingsconference-collections
research-article

Understanding I/O performance of IPFS storage: a client's perspective

Published: 24 June 2019 Publication History

Abstract

IPFS has surged into popularity in recent years. It organizes user data as multiple objects where users can obtain the objects according to their Content IDentifiers (CIDs). As a storage system, it is of great importance to understand its data I/O performance. But existing work still lacks such a comprehensive study. In this work, we deploy an IPFS storage system with geographically-distributed storage nodes on Amazon EC2. We then conduct extensive experiments to evaluate the performance of data I/O operations from a client's perspective. We find that the access patterns of I/O operations (e.g., request size) severely affect the I/O performance, since IPFS typically uses multiple I/O strategies to perform different I/O requests. Moreover, for the read operations, IPFS requires to resolve remote nodes and downloading objects via the internet. Our experimental study reveals that both resolving and downloading operations can become bottlenecks. Our results can shed light to optimizing IPFS in avoiding high-latency I/O operations.

References

[1]
Juan Benet. IPFS-content addressed, versioned, P2P file system. arXiv preprint arXiv:1407.3561, pages 1--11, 2014.
[2]
Self-describing content-addressed identifiers for distributed systems. https://github.com/multiformats/cid.
[3]
Petar Maymounkov and David Mazieres. Kademlia: A peer-to-peer information system based on the XOR metric. In Proc. of International Workshop on Peer-to-Peer Systems (IPTPS 02), pages 53--65, 2002.
[4]
Bastien Confais, Adrien Lebre, and Benoit Parrein. An object store service for a Fog/Edge computing infrastructure based on IPFS and a scale-out NAS. In Proc. of IEEE International Conf. on Fog and Edge Computing (ICFEC 17), pages 41--50, 2017.
[5]
Trent McConaghy, Rodolphe Marques, Andreas Müller, et al. BigchainDB 2.0 Whitepaper. https://www.bigchaindb.com/whitepaper/, 2016.
[6]
Antonio Tenorio-Fornés, Viktor Jacynycz, David Llop-Vila, et al. Towards a decentralized process for scientific publication and peer review using blockchain and IPFS. In Proc. of the 52nd Hawaii International Conf. on System Sciences, pages 1--10, 2019.
[7]
R. Fielding, UC Irvine, J. Gettys, et al. Hypertext Transfer Protocol - HTTP/1.1. https://tools.ietf.org/html/rfc2616.
[8]
Amazon EC2. https://aws.amazon.com/.
[9]
Ion Stoica, Robert Morris, David Liben-Nowell, et al. Chord: A scalable peer-to-peer lookup protocol for internet applications. IEEE/ACM Transactions on Networking (TON), 11(1):17--32, 2003.
[10]
David Karger, Eric Lehman, Tom Leighton, et al. Consistent hashing and random trees: Distributed caching protocols for relieving hot spots on the World Wide Web. In Proc. of the twenty-ninth annual ACM Symp. on Theory of Computing (STOC 97), pages 654--663, 1997.
[11]
Ben Y Zhao, Ling Huang, Jeremy Stribling, et al. Tapestry: A resilient global-scale overlay for service deployment. IEEE Journal on Selected Areas in Communications (JSAC), 22(1):41--53, 2004.
[12]
Anjali Gupta, Barbara Liskov, and Rodrigo Rodrigues. One hop lookups for peer-to-peer overlays. In Proc. of USENIX Workshop on Hot Topics in Operating Systems (HotOS 03), pages 1--6, 2003.
[13]
Jinyang Li, Jeremy Stribling, Thomer M Gil, et al. Comparing the performance of distributed hash tables under churn. In Proc. of International Workshop on Peer-to-Peer Systems (IPTPS 04), pages 87--99, 2004.
[14]
Michael J Freedman, Eric Freudenthal, and David Mazieres. Democratizing content publication with Coral. In Proc. of USENIX Symp. on Networked Systems Design and Implementation (NSDI 04), pages 1--18, 2004.
[15]
Sylvia Ratnasamy, Paul Francis, Mark Handley, et al. A scalable content-addressable network. In Proc. of ACM Special Interest Group on Data Communication (SIGCOMM 01), pages 161--172, 2001.
[16]
Antony Rowstron and Peter Druschel. Pastry: Scalable, decentralized object location, and routing for large-scale peer-to-peer systems. In Proc. of IFIP/ACM International Conf. on Distributed Systems Platforms and Open Distributed Processing, pages 329--350, 2001.
[17]
Bram Cohen. Incentives build robustness in bittorrent. In Proc. of Workshop on Economics of Peer-to-Peer systems, volume 6, pages 68--72, 2003.
[18]
Ingmar Baumgart and Sebastian Mies. S/Kademlia: A practicable approach towards secure key-based routing. In Proc. of IEEE International Conf. on Parallel and Distributed Systems (IPDPS 07), pages 1--8, 2007.
[19]
Dahlia Malkhi, Moni Naor, and David Ratajczak. Viceroy: A scalable and dynamic emulation of the butterfly. In Proc. of annual ACM Symp. on Principles of distributed computing (PODC 02), pages 183--192, 2002.
[20]
Dimitri Pertin, Sylvain David, Pierre Evenou, et al. Distributed file system based on erasure coding for I/O-intensive applications. In Proc. of International Conf. on Cloud Computing and Service Science (CLOSER 14), pages 451--456, 2014.
[21]
Bruno Produit. Using blockchain technology in distributed storage systems. https://courses.cs.ut.ee/MTAT.07.022/2018_spring/uploads/Main/bruno-report-s17-18.pdf, 2018.
[22]
Bokang Jia, Chenhao Xu, Rehan Gotla, et al. Opus-decentralized music distribution using interplanetary file systems (IPFS) on the Ethereum blockchain v0.8.3. https://foxico.io/static/uploads/wpfile-f132f54e911e7ae32712890f4e448030.pdf, 2016.
[23]
Mat Kelly, Sawood Alam, Michael L Nelson, et al. InterPlanetary Wayback: Peer-to-Peer permanence of web archives. In Proc. of International Conf. on Theory and Practice of Digital Libraries, pages 411--416, 2016.
[24]
Yongle Chen, Hui Li, Kejiao Li, and Jiyang Zhang. An improved P2P file system scheme based on IPFS and Blockchain. In Proc. of 2017 IEEE International Conf. on Big Data (Big Data 17), pages 2652--2657, 2017.
[25]
Shangping Wang, Yinglong Zhang, and Yaling Zhang. A blockchain-based framework for data sharing with fine-grained access control in decentralized storage systems. IEEE Access, 6:38437--38450, 2018.
[26]
Guangyuan Wu, Fangming Liu, Haowen Tang, et al. On the performance of cloud storage applications with global measurement. In Proc. of IEEE/ACM International Symp. on Quality of Service (IWQoS 16), 2016.
[27]
Li Li, Xu Ke, Wang Dan, et al. A measurement study on skype voice and video calls in LTE networks on high speed rails. In Proc. of IEEE/ACM International Symp. on Quality of Service (IWQoS 17), pages 1--10, 2017.
[28]
Jun Li and Baochun Li. Zebra: Demand-aware erasure coding for distributed storage systems. In IEEE/ACM International Symp. on Quality of Service (IWQoS 16), pages 1--10, 2016.
[29]
Jiajie Shen, Yi Li, Yangfan Zhou, and Xin Wang. Mobile cloud-of-clouds storage made efficient: A network coding based approach. In IEEE Symp. on Reliable Distributed Systems (SRDS), pages 72--82, 2018.
[30]
Jiajie Shen, Kai Zhang, Jiazhen Gu, et al. Efficient scheduling for multi-block updates in erasure coding based storage systems. IEEE Trans. on Computers (TC), 67(4):573--581, 2018.
[31]
Node.js toolkit. https://nodejs.org/.
[32]
Binbing Hou, Feng Chen, Zhonghong Ou, et al. Understanding I/O performance behaviors of cloud storage from a client's perspective. In Proc. of IEEE Conf. on MASS Storage Systems and Technologies (MSST 17), pages 1--12, 2017.
[33]
Joel Alcântara, Tiago Oliveira, and Alysson Bessani. Ginja: One-dollar cloud-based disaster recovery for databases. In Proc. of the 18th ACM/IFIP/USENIX Middleware Conf. (Middleware 17), pages 248--260, 2017.
[34]
Ana Klimovic, Yawen Wang, Christos Kozyrakis, et al. Understanding ephemeral storage for serverless analytics. In Proc. of 2018 USENIX Annual Technical Conf. (ATC 18), pages 789--794, 2018.
[35]
Song Li, Yangfan Zhou, Lei Jiao, et al. Towards operational cost minimization in hybrid clouds for dynamic resource provisioning with delay-aware optimization. IEEE Trans. on Services Computing (TSC), 8(3):398--409, 2015.
[36]
Dushyanth Narayanan, Austin Donnelly, and Antony Rowstron. Write offloading: practical power management for enterprise storage. In Proc. of USENIX Conf. on File and Storage Technologies (FAST 08), pages 256--267, 2008.

Cited By

View all
  • (2024)Blockchain-Based Caching Architecture for DApp Data Security and DeliverySensors10.3390/s2414455924:14(4559)Online publication date: 14-Jul-2024
  • (2024)Secrets are Forever: Characterizing Sensitive File Leaks on IPFS2024 IFIP Networking Conference (IFIP Networking)10.23919/IFIPNetworking62109.2024.10619838(522-528)Online publication date: 3-Jun-2024
  • (2024)A Closer Look into IPFS: Accessibility, Content, and PerformanceACM SIGMETRICS Performance Evaluation Review10.1145/3673660.365504052:1(77-78)Online publication date: 13-Jun-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
IWQoS '19: Proceedings of the International Symposium on Quality of Service
June 2019
420 pages
ISBN:9781450367783
DOI:10.1145/3326285
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 24 June 2019

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. IPFS
  2. content addressable technology
  3. data I/O performance
  4. performance measurement
  5. storage system

Qualifiers

  • Research-article

Conference

IWQoS '19

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)77
  • Downloads (Last 6 weeks)0
Reflects downloads up to 08 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Blockchain-Based Caching Architecture for DApp Data Security and DeliverySensors10.3390/s2414455924:14(4559)Online publication date: 14-Jul-2024
  • (2024)Secrets are Forever: Characterizing Sensitive File Leaks on IPFS2024 IFIP Networking Conference (IFIP Networking)10.23919/IFIPNetworking62109.2024.10619838(522-528)Online publication date: 3-Jun-2024
  • (2024)A Closer Look into IPFS: Accessibility, Content, and PerformanceACM SIGMETRICS Performance Evaluation Review10.1145/3673660.365504052:1(77-78)Online publication date: 13-Jun-2024
  • (2024)A Closer Look into IPFS: Accessibility, Content, and PerformanceProceedings of the ACM on Measurement and Analysis of Computing Systems10.1145/36560158:2(1-31)Online publication date: 29-May-2024
  • (2024)A Closer Look into IPFS: Accessibility, Content, and PerformanceAbstracts of the 2024 ACM SIGMETRICS/IFIP PERFORMANCE Joint International Conference on Measurement and Modeling of Computer Systems10.1145/3652963.3655040(77-78)Online publication date: 10-Jun-2024
  • (2024)Web-based Visualization and Analytics of Petascale data: Equity as a Tide that Lifts All Boats2024 IEEE 14th Symposium on Large Data Analysis and Visualization (LDAV)10.1109/LDAV64567.2024.00009(1-11)Online publication date: 13-Oct-2024
  • (2024)Plausibly Deniable Content Discovery for Bitswap Using Random Walks2024 IEEE 49th Conference on Local Computer Networks (LCN)10.1109/LCN60385.2024.10639726(1-9)Online publication date: 8-Oct-2024
  • (2024)Resilient, Auditable, and Secure IoT-Enabled Smart Inverter Firmware Amendments With BlockchainIEEE Internet of Things Journal10.1109/JIOT.2023.332195411:5(8945-8960)Online publication date: 1-Mar-2024
  • (2024)IPFS in the Fast Lane: Accelerating Record Storage with Optimistic ProvideIEEE INFOCOM 2024 - IEEE Conference on Computer Communications10.1109/INFOCOM52122.2024.10621404(1920-1929)Online publication date: 20-May-2024
  • (2024)Blockchain-Based Decentralized Storage Design for Data Confidence Over Cloud-Native Edge InfrastructureIEEE Access10.1109/ACCESS.2024.338301012(50083-50099)Online publication date: 2024
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media