Abstract
The paper presents the implementation and analysis of a service-based reconfigurable High Availability infrastructure of cluster system for Grid. Based on service notion, the High Availability infrastructure is constructed for mission critical applications on Grid. The application high availability service is responsible for registered applications’ high availability. The Service Manager is in charge of monitoring all services via heartbeat and automatically recovering from failure to assure service high availability. To improve flexibility and dependability, a dynamic reconfiguration mechanism is designed so that the configurations can be changed without interrupting application’s running. A dependability modeling and simulation tool is deployed in the reconfiguration procedure. Dependability parameters can be inputted into the simulation tool. With the objects and sub-model library in the tool, corresponding global model will be constructed and statistic result for different recovery strategies is given. The configuration of application can be adjusted based on these statistic results.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Siegel, J.: An Overview of CORBA 3. In: Proc. 2nd IFIP Int’l Working Conf. Distributed Applications and Interoperable Systems (DAIS 1999), Kluwer, Boston (1999)
Li, M., Tao, W., Goldberg, D., Hsu, I., Tamir, Y.: Design and Validation of Portable Communication Infrastructure for Fault-Tolerant Cluster Middleware. In: Proc. of the IEEE int’l conf. on Cluster Computing (CLUSTER 2002), Chicago,USA (September 2002)
Kalbarczyk, Z.T., Iyer, R.K., Bagchi, S., Whisnant, K., Chameleon: A Software Infrastructure for Adaptive Fault Tolerance. IEEE Trans. Parallel and Distributed System 10(6) (June 1999)
Patterson, D., et al.: Recovery Oriented Computing (ROC): Motivation, Definition, Techniques, and Case Studies. Technical Report CSD-02-1175, UC Berkeley Computer Science (2002)
Xinyu, L., Wen, G., Ninghui, S.: NCPN: A Simulation Tool for Coloured Petri Nets. In: Proc. of 14th IASTED Int’l conf. on Parallel and Distributed Computing and Systems, Cambridge, Novemeber (2002)
Ning-Hui, S., Dan, M.: Key Design Issues of Dawning3000 Superserver. Chinese Journal of Computers 25(11) (2002)
Hariri, S., Mutlu, H.: Hierarchical Modeling of Availability in Distributed Systems. IEEE Trans. Software Engineering 21(1) (January 1995)
Whisnant, K., Iyer, R.K., Jones, P., Some, R., Rennels, D.: An Experimental Evauation of the REE SIFT Environment for Spaceborne Applications. In: Proc. of the 2002 Int’l conf. on Dependable System & Networks(DSN 2002), Washington, D.C.,USA (June 2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Gao, W., Liu, X., Wang, L., Nanya, T. (2004). A Reconfigurable High Availability Infrastructure in Cluster for Grid. In: Li, M., Sun, XH., Deng, Qn., Ni, J. (eds) Grid and Cooperative Computing. GCC 2003. Lecture Notes in Computer Science, vol 3032. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24679-4_102
Download citation
DOI: https://doi.org/10.1007/978-3-540-24679-4_102
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-21988-0
Online ISBN: 978-3-540-24679-4
eBook Packages: Springer Book Archive