article

Review Article: Lifetime and availability of data stored on a P2P system: Evaluation of redundancy and recovery schemes

Authors:

Abdulhalim Dandoush,

Sara Alouf,

Philippe NainAuthors Info & Claims

Computer Networks: The International Journal of Computer and Telecommunications Networking, Volume 64

Pages 243 - 260

https://doi.org/10.1016/j.comnet.2014.02.015

Published: 01 May 2014 Publication History

Abstract

This paper studies the performance of Peer-to-Peer storage and backup systems (P2PSS). These systems are based on three pillars: data fragmentation and dissemination among the peers, redundancy mechanisms to cope with peers churn and repair mechanisms to recover lost or temporarily unavailable data. Usually, redundancy is achieved either by using replication or by using erasure codes. A new class of network coding (regenerating codes) has been proposed recently. Therefore, we will adapt our work to these three redundancy schemes. We introduce two mechanisms for recovering lost data and evaluate their performance by modeling them through absorbing Markov chains. Specifically, we evaluate the quality of service provided to users in terms of durability and availability of stored data for each recovery mechanism and deduce the impact of its parameters on the system performance. The first mechanism is centralized and based on the use of a single server that can recover multiple losses at once. The second mechanism is distributed: reconstruction of lost fragments is iterated sequentially on many peers until that the required level of redundancy is attained. The key assumptions made in this work, in particular, the assumptions made on the recovery process and peer on-times distribution, are in agreement with the analysis in [1] and in [2] respectively. The models are thereby general enough to be applicable to many distributed environments as shown through numerical computations. We find that, in stable environments such as local area or research institute networks where machines are usually highly available, the distributed-repair scheme in erasure-coded systems offers a reliable, scalable and cheap storage/backup solution. For the case of highly dynamic environments, in general, the distributed-repair scheme is inefficient, in particular to maintain high data availability, unless the data redundancy is high. Using regenerating codes overcomes this limitation of the distributed-repair scheme. P2PSS with centralized-repair scheme are efficient in any environment but have the disadvantage of relying on a centralized authority. However, the analysis of the overhead cost (e.g. computation, bandwidth and complexity cost) resulting from the different redundancy schemes with respect to their advantages (e.g. simplicity), is left for future work.

References

[1]

A. Dandoush, S. Alouf, P. Nain, Simulation analysis of download and recovery processes in P2P storage systems, in: Proc. of 21st International Teletraffic Congress (ITC), Paris, France, 2009.

Abstract

References

Recommendations

Research and performance evaluation of data replication technology in distributed storage systems

A free-rider aware reputation system for peer-to-peer file-sharing networks

Reducing data replication overhead in DHT based peer-to-peer system

Comments

Information

Published In

Publisher

Publication History

Author Tags

Qualifiers

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

View options

Share

Share this Publication link

Share on social media

Affiliations