Abstract
Disaster recovery solutions have gained popularity in the past few years because of their ability to tolerate disasters and to achieve the reliability and availability. Data replication is one of the most key disaster recovery solutions. While there are a number of mechanisms to restore data after disasters, the efficiency of the recovery process is not ideal yet. Providing the efficiency guarantee in replication systems is important and complex because the services must not be interrupted and the availability and continuity of businesses must be kept after disasters. To recover the data efficiently, we (1) present a fast disaster recovery mechanism, (2) implement it in a volume replication system, and (3) report an evaluation for the recovery efficiency of the volume replication system. It’s proved that our disaster recovery mechanism can recover the data at the primary system as fast as possible and achieve the ideal recovery efficiency. Fast disaster recovery mechanism can also be applicable to other kinds of replication systems to recover the data in the event of disasters.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Smith, D.M.: The cost of lost data. Journal of Contemporary Business Practice 6(3) (2003)
Chervenak, A., Vellanki, V., Kurmas, Z.: Protecting file systems: A survey of backup techniques. In: Proc. of Joint NASA and IEEE Mass Storage Conference, IEEE Computer Society Press, Los Alamitos (1998)
Yang, Z., Gong, Y., Sang, W., et al.: A Primary-Backup Lazy Replication System for Disaster Tolerance (in Chinese). Journal Of Computer Research And Development, 1104–1109 (2003)
Cougias, D., Heiberger, E., Koop, K.: The backupbook: disaster recovery from desktop to data center. Schaser-Vartan Books, Lecanto, FL (2003)
Marcus, E., Stern, H.: Blueprints for high availability. Wiley Publishing, Indianapolis, IN (2003)
Cegiela, R.: Selecting Technology for Disaster Recovery. In: DepCos-RELCOMEX 2006. Proc. of the International Conference on Dependability of Computer Systems (2006)
Keeton, K., Santos, C., Beyer, D., et al.: Designing for Disasters. In: FAST 2004. Proc. of the 3rd USENIX Conf on File and Storage Technologies, pp. 59–72 (2004)
Nayak, T.K., Sharma, U.: Automated Management of Disaster Recovery Systems Using Adaptive Scheduling. In: Proc. of the 10th IFIP/IEEE International Symposium on Integrated Network Management, pp. 542–545 (2007)
Keeton, K., Beyer, D., Brau, E., et al.: On the road to recovery: restoring data after disasters. In: EuroSys. Proc. of the 1st ACM European Systems Conference, pp. 235–248. ACM Press, New York (2006)
Azagury, A.C., Factor, M.E., Micka, W.F.: Advanced Functions for Storage Subsystems: Supporting Continuous Availability. IBM SYSTEM Journal 42 (2003)
Using EMC SnapView and MirrorView for Remote Backup. Engineering White Paper, EMC Corporation (2002)
Software Solutions Guide for Enterprise Storage. White Paper, Hitachi Data Systems Corporation (2000)
VERITAS Volume Replicator 3.5: Administrator’s Guide (Solaris). White Paper, Veritas Software Corp. (2002)
Ji, M., Veitch, A., Wilkes, J.: Seneca: remote mirroring done write. In: USENIX 2003. Proc. of the 2003 USENIX Technical Conference, pp. 253–268 (2003)
Patterson, H., Manley, S., Federwisch, M., Hitz, D., Kleiman, S., Owara, S.: SnapMirror: File-System-Based Asynchronous Mirroring for Disaster Recovery. In: Proc. of the First USENIX conference on File and Storage Technologies (2002)
Wang, Y., Li, Z., Lin, W.: RWAR: A Resilient Window-consistent Asynchronous Replication Protocol. In: ARES 2007. Proc. of the 2nd International Conference on Availability, Reliability and Security, pp. 499–505 (2007)
Hwang, K., Chow, E., Wang, C.-L., et al.: Fault-Tolerant Clusters of Workstations with Single System Image. In: Cluster computing (1998)
McKinty, S.: Combining Clusters for Business Continuity. In: Cluster 2006. Proc. of the 2006 IEEE International Conference on Cluster Computing (2006)
Redhat (2005), http://sources.redhat.com/lvm2/
IOzone (2006), http://www.iozone.org/
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wang, Y., Li, Z., Lin, W. (2007). A Fast Disaster Recovery Mechanism for Volume Replication Systems. In: Perrott, R., Chapman, B.M., Subhlok, J., de Mello, R.F., Yang, L.T. (eds) High Performance Computing and Communications. HPCC 2007. Lecture Notes in Computer Science, vol 4782. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-75444-2_68
Download citation
DOI: https://doi.org/10.1007/978-3-540-75444-2_68
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-75443-5
Online ISBN: 978-3-540-75444-2
eBook Packages: Computer ScienceComputer Science (R0)