Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
article

TBFR: a threshold-based file replication approach for increased file availability and its formal verification

Published: 01 April 2013 Publication History

Abstract

This work proposes an active replication mechanism, i.e., threshold-based file replication TBFR approach that replicates the file, from one node to the other node, when the total number of request for a particular file, reaches the threshold value. Significant performance improvement has been observed in terms of total number of interactions required for file replication and performance comparison of the TBFR has been carried out with the request reply RR and request reply acknowledgement RRA protocol. Results indicate that, the proposed approach reduces the number of messages exchanged for file replication by 33% to 55% and 25% to 50% in case when unnecessary file replication is avoided. With this approach service time is reduced by about 36% as compared to no replication. A relationship between the formal aspect of the static file replication and dynamic TBFR model is established using process algebraic approach for stability analysis of the proposed model.

References

[1]
Baru, C., Moore, R., Rajasekar, A. and Wan, M. (1998) 'The SDSC storage resource broker', Proceedings of CASCON'98, IBM Press, Boston, MA.
[2]
Baumgartel, P. (2002) Oracle Replication: An Introduction, [online] http://www.nyoug.org/ 200212baumgartel.pdf (accessed 31 July 2012).
[3]
Bell, W.H., Cameron, D.G., Carvajal, S., Millar, A.P., Stockinger, K. and Zini, F. (2003) 'Evaluation of an economy-based file replication strategy for a data grid', Proceedings of the 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid, 2003 (CCGrid'03), Tokyo, Japan, IEEE Press, Los Alamitos, CA.
[4]
Birman, K. and Cooper, R. (1991) 'The ISIS project: real experience with a fault tolerant programming system', SIGOPS Oper. Syst. Rev., Vol. 25, pp.103-107.
[5]
Buyya, R. (2008) High Performance Cluster Computing, Vol. 1, Pearson Education, New Delhi.
[6]
Cabri, G., Corradi, A. and Zambonelli, F. (1996) 'Experience of adaptive replication in distributed file systems', IEEE Proc. of 22nd EUROMICRO Conf. on Beyond 2000: Hardware and Software Design Strategies, pp.459-466.
[7]
Cheng, H.Y. and King, C.T. (1999) 'File replication for enhancing the availability of parallel I/O systems on clusters', 1st IEEE Computer Society Int. Workshop on Cluster Computing, pp.137-144.
[8]
Chervenak, A., Deelman, E., Foster, I., Guy, L., Hoschek, W., Iamnitchi, A., Kesselman, C., Kunst, P., Ripeanu, M., Schwartzkopf, B., Stockinger, H., Stockinger, K. and Tierney, B. (2002) 'Giggle: a framework for constructing scalable replica location services', Proceedings of the IEEE/ACM Conference on Supercomputing (SC'02), Baltimore, MD.
[9]
Clarke, I., Sandberg, O., Wiley, B. and Hong, T.W. (2001) 'Freenet: a distributed anonymous information storage and retrieval system', in International Workshop on Designing Privacy Enhancing Technologies: Design Issues in Anonymity and Unobservability, pp.46-66.
[10]
Cohen, E. and Shenker, S. (2002) 'Replication strategies in unstructured peer-to-peer networks', The ACM SIGCOMM'02 Conference.
[11]
Dabek, F. et al. (2001) 'Wide-area cooperative storage with CFS', Proc. of the 18th ACM Symposium on Operating Systems Principles, 21-24 October, Banff, Alberta, Canada.
[12]
DeCandia, G. et al. (2007) 'Dynamo: Amazon's highly available key-value store', Proc. of 21st ACM Symposium on Operating Systems Principles.
[13]
Demers, A. et al. (1994) 'The Bayou architecture: support for data sharing among mobile users', Proc. of IEEE Workshop on Mobile Computing Systems & Applications, December.
[14]
Ghemawat, S. and Gobiol, H. (2003) 'The Google file system', Proc. of the 19th ACM Symposium on Operating Systems Principles, October, Bolton Landing, NY, ACM.
[15]
Gifford, D. (1979) 'Weighted voting for replicated data', Proc. of the 7th ACM Symposium on Operating Systems Principles, Pacific Crove, CA.
[16]
Gwertzman, J. and Seltzer, M. (1995) 'The case for geographical push-caching', Presented at 5th Annual Workshop on Hot Operating Systems.
[17]
Helal, A.A., Hedaya, A.A. and Bhargava, B.B. (1996) Replication Techniques in Distributed Systems, Kluwer Academic Publishers, Boston.
[18]
Hou, C. and Shin, K. (1994) 'Replication and allocation of task modules in distributed real-time systems', Proc. 24th Int'l Symp. Fault Tolerant Computing (FTCS '94), Manu Vardhan and Dharmender Singh Kushwaha.
[19]
Jerian, C. et al. (1993) Availability in the Echo File System, SRC Research Report 112, Systems Research Center, Digital Co., Palo Alto, CA.
[20]
Kistler, J.J. and Satyanarayanan, M. (1992) 'Disconnected operation in the coda file system', ACM Transactions on Computer Systems, Vol. 10, No. 1, pp.3-25.
[21]
Lamehamedi, H., Shentu, Z., Szymanski, B. and Deelman, E. (2003) 'Simulation of dynamic data replication strategies in data grids', Proceedings of the 17th International Symposium on Parallel and Distributed Processing (IPDPS'03), Nice, France, IEEE Press, Los Alamitos, CA.
[22]
Lamehamedi, H., Szymanski, B., Shentu, Z. and Deelman, E. (2002) 'Data replication strategies in grid environments', Proceedings of the 5th International Conference on Algorithms and Architectures for Parallel Processing (ICA3PP'02), IEEE Press, Los Alamitos, CA.
[23]
Lee, B.D. and Weissman, J.B. (2001) 'Dynamic replica management in the service grid', Proceedings of the 10th IEEE International Symposium on High Performance Distributed Computing (HPDC10'), San Francisco, CA, IEEE Press, Los Alamitos, CA.
[24]
Li, Y. and Mascagni, M. (2003) 'Improving performance via computational replication on a large-scale computational grid', Proc. Third Int'l. Symp. Cluster Computing and the Grid (CCGrid '03).
[25]
Liskov, B., Ghemawat, S., Gruber, R., Johnson, P., Shrira, L. and Williams, M. (1991) 'Replication in the harp file system', Proceedings of 13th ACM Symposium on Operating Systems Principles, October, ACM Press, pp.226-238.
[26]
Lv, Q. et al. (2002) 'Search and replication in unstructured peer-to-peer networks', Proceedings of the 16th ACM International Conference on Supercomputing, New York, USA.
[27]
Milner, R. (1989) Communication and Concurrency, Prentice Hall, Upper Saddle River, NJ, USA.
[28]
Milner, R.A. (1980) 'Calculus for communicating system', Lecture Notes in Computer Science, Vol. 92, VI, 176p., Springer.
[29]
Morris, J.H. et al. (1986) 'Andrew: a distributed personal computing environment', Commun. ACM, pp.184-201.
[30]
Parrington, G.D., Shrivastava, S.K., Wheater, S.M. and Little, M.C. (1995) 'The design and implementation of Arjuna', USENIX Computing Systems Journal, Vol. 8, No. 2, pp.255-308, Pegasus, [online] http://www.bkent.net/Doc/usobpeg.html (accessed 31 July 2012).
[31]
Ranganathan, K., Iamnitchi, A. and Foster, I. (2002) 'Improving data availability through dynamic model driven replication in large peer-to-peer communities', Proceedings of the 2nd IEEE/ACM International Symposium on Cluster Computing and the Grid (CCGRID'02), Berlin, Germany, IEEE Press, Los Alamitos, CA.
[32]
Samar, A. and Stockinger, H. (2001) 'Grid data management pilot (GDMP): a tool for wide area replication', Proceedings of the IASTED International Conference on Applied Informatics (AI'01), Innsbruck, Austria, ACTA Press, Calgary, Canada.
[33]
Sato, H., Matsuoka, S. and Endo, T. (2009) 'File clustering based replication algorithm in a grid environment', 9th IEEE/ACM Int. Sym. on Cluster Computing and the Grid, pp.204-211.
[34]
Sidell, J., Aoki, P.M., Barr, S., Sah, A., Staelin, C., Stonebraker, M. and Yu, A. (1996) 'Data replication in Mariposa', Proceedings of 17th International Conference on Data Engineering, February 1996, New Orleans, USA, pp.485-494.
[35]
Siegel, A., Birman, K. and Marzullo, K. (1990) Deceit: A Flexible Distributed File System, Technical Report No. 89-1042, Department of Computer Science, Cornell Univ., November 1989 (also in Proceedings of USENIX Conference, Anaheim USA, pp.51-62).
[36]
Silva, D., Cirne, W. and Brasileiro, F. (2003) 'Trading cycles for information: using replication to schedule bag-of-tasks applications on computational grids', Proc. Int'l. Conf. Parallel and Distributed Computing (Euro-Par'03), pp.169-180.
[37]
Spector, A.Z. (1982) 'Performing remote operation efficiently on a local computer network', Communications of the ACM, Vol. 25, No. 4, pp.246-259.
[38]
Tang, X., Chi, H. and Chanson, S.T. (2007) 'Optimal replica placement under TTL-based consistency', IEEE Transactions on Parallel and Distributed Systems, Vol. 18, No. 3, pp.351-363.
[39]
Tatebe, O., Morita, Y., Matsuoka, S., Soda, N. and Sekiguchi, S. (2002) 'Grid data farm architecture for petascale data intensive computing', Proceedings of the 2nd IEEE/ACM International Symposium on Cluster Computing and the Grid (CCGrid'02), Berlin, Germany, IEEE Press, Los Alamitos, CA.
[40]
Triantafillou, P. and Neilson, C. (1997) 'Achieving strong consistency in a distributed file system', IEEE Trans. Softw. Eng., Vol. 23, pp.35-55.
[41]
Venugopal, S., Buyya, R. and Ramamohanarao, K. (2006) 'A taxonomy of data grids for distributed data sharing, management, and processing', Journal of ACM Computing Surveys (CSUR) Surveys, Vol. 38, No. 1, ACM New York, NY, USA.
[42]
Weil, S.A. et al. (2006a) 'Ceph: a scalable, high-performance distributed file system', Proc. of the 7th Symposium on Operating Systems Design and Implementation, November, USENIX, Seattle, WA.
[43]
Weil, S.A. et al. (2006b) 'CRUSH: controlled, scalable, decentralized placement of replicated data', Proc. of the 2006 ACM/IEEE Conference on Supercomputing, November, ACM, Tampa, FL.
[44]
Wolfson, O., Jajodia, S. and Huang, Y. (1997) 'An adaptive data replication algorithm', ACM Transactions on Database Systems, Vol. 22, No. 2, pp.255-314.
[45]
Zomaya, A.Y. (1995) Parallel and Distributed Handbook, pp.60-68, McGraw Hill Professionals, New York.
  1. TBFR: a threshold-based file replication approach for increased file availability and its formal verification

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image International Journal of Information and Communication Technology
      International Journal of Information and Communication Technology  Volume 5, Issue 2
      April 2013
      119 pages
      ISSN:1466-6642
      EISSN:1741-8070
      Issue’s Table of Contents

      Publisher

      Inderscience Publishers

      Geneva 15, Switzerland

      Publication History

      Published: 01 April 2013

      Qualifiers

      • Article

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • 0
        Total Citations
      • 0
        Total Downloads
      • Downloads (Last 12 months)0
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 23 Jan 2025

      Other Metrics

      Citations

      View Options

      View options

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media