Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
article
Free access

Efficient transparent application recovery in client-server information systems

Published: 01 June 1998 Publication History

Abstract

Database systems recover persistent data, providing high database availability. However, database applications, typically residing on client or “middle-tier” application-server machines, may lose work because of a server failure. This prevents the masking of server failures from the human user and substantially degrades application availability. This paper aims to enable high application availability with an integrated method for database server recovery and transparent application recovery in a client-server system. The approach, based on application message logging, is similar to earlier work on distributed system fault tolerance. However, we exploit advanced database logging and recovery techniques and request/reply messaging properties to significantly improve efficiency. Forced log I/Os, frequently required by other methods, are usually avoided. Restart time, for both failed server and failed client, is reduced by checkpointing and log truncation. Our method ensures that a server can recover independently of clients. A client may reduce logging overhead in return for dependency on server availability during client restart.

References

[1]
Lorenzo Alvisi, Keith Marzullo: Message Logging: Pessimistic, Optimistic, and Causal. International Conference on Distributed Computing Systems, 1995
[2]
J.F. Bartlett: A NonStop Kernel, ACM Symposium on Operation Systems Principles, 1981
[3]
Philip A. Bernstein, Brian Harry, Paul Sanders, David Shutt, Jason Zander: The Microsoft Repository. Invited Keynote Paper, VLDB Conference, Athens, 1997
[4]
Philip A. Bernstein, Meichun Hsu, Bruce Mann: Implementing Recoverable Requests Using Queues, ACM SIGMOD Conference, 1990
[5]
Anita Borg, Jim Baumbach, Sam Glazer: A Message System Supporting Fault Tolerance. ACM Symposium on Operating Systems Principles, 1983
[6]
Anita Borg, Wolfgang Blau, Wolfgang Graetsch, Ferdinand Herrmann, Wolfgang Oberle: Fault Tolerance Under UNIX, ACM Transactions on Computer Systems Vol.7 No.l, February 1989, pp. 1-24
[7]
Andrea Borr: Transaction Monitoring in Encompass: Reliable Distributed Transaction Processing. VLDB Conference, Cannes, 1981
[8]
Dick C.A. Bultermann and Lynda Hardman: Multimedia Authoring Tools: State of the Art and Research Challenges, in: Jan van Leeuwen (Editor), Computer Science Today: Recent Trend and Developments, Springer, LNCS 1000, 1995
[9]
K.M. Chandy and Leslie Lamport: Distributed Snapshots: Determining Global States of Distributed Systems, ACM Transactions on Computing Systems V ol.3 No.l, Feb. 1985, pp. 63-75
[10]
Ahmed K. Elmagarmid (Editor): Database Transaction Models for Advanced Applications. Morgan Kaufmann, 1992
[11]
E.N. Elnozahy, D.B. Johnson, Y.M. Wang: A Survey of Rollback-Recovery Protocols in Message-Passing Systems. Technical Report, Carnegie-Mellon University, Pittsburgh, 1996
[12]
Dimitrios Georgakopoulos, Marl~ Homick, Amit Sheth: An Overview of Workflow Management: From Process Modeling to Workflow Automation Infrastructure. Distributed and Parallel Databases Vol.3 No.2, 1995, pp. 119-153
[13]
Jim Gray, Andreas Reuter: Transaction Processing: Concepts and Techniques. Morgan Kaufmann, 1993
[14]
Yennun Huang, Yi-Min Wang: Why Optimistic Message Logging Has Not Been Used In Telecommunications Systems. International Symposium on Fault-Tolerant Computing Systems, 1995
[15]
David B. Johnson, Willy Zwaenepoel: Sender-based Message Logging. International Symposium on Fault-Tolerant Computing Systems, 1987
[16]
Gail E. Kaiser and Jim Whitehead, Collaborative Work: Distributed Authoring and Versioning. IEEE Internet Computing Vol.1 No.2, 1997, pp. 76-77
[17]
Won Kim: Highly Available Systems for Database Applications. ACM Computing Surveys Vol.16 No.i, 1984, pp. 71-98
[18]
David Lomet: MLR: A Recovery Method for Multi- Level Systems. ACM SIGMOD Conference, 1992
[19]
David Lomet, Mark Tuttle: Redo Recovery after System Crashes. VLDB Conference, Zurich, 1995
[20]
David Lomet: Application Recovery with Logical Write Operations. Technical Report, Microsoft Research, Redmond, Washington, June 1997
[21]
David Lomet: Persistent Applications Using Generalized Redo Recovery. IEEE Int. Conference on Data Engineering, Orlando, FL 1998
[22]
David Lomet, Gerhard Weikum: Efficient Transparent Application Recovery in Cient-Server Information Systems, Technical Report, Microsoft Research, Redmond, Washington, 1998
[23]
C. Mohan, Don Haderle, Bruce Lindsay, Hamid Pirahesh, Peter Schwarz: ARIES: A Transaction Recovery Method Supporting Fine-Granularity Locking and Partial Rollbacks Using Write-Ahead Logging. ACM Transactions on Database Systems Vol.17 No.l, March 1992, pp. 94-162
[24]
C. Mohan: A Cost-effective Method for Providing Improved Data Availability During DBMS Restart Recovery After a Failure. VLDB Conference, 1993
[25]
Krithi Ramamritham and Panos Chrysanthis: Advances in Concurrency Control and Transaction Processing. IEEE Computer Society Press, 1996
[26]
Robert E. Strom, Shaula Yemini: Optimistic Recovery in Distributed Systems. ACM Transactions on Computer Systems Vol.3 No.3, August 1985, pp. 204-226
[27]
Robert E. Strom, David F. Bacon, Shaula A. Yemini: Volatile Logging in n-Fault-Tolerant Distributed Systems. International Symposium on Fault-Tolerant Computing, Tokyo, 1988
[28]
Gerhard Weikum, Christof Hasse, Peter Broessler, Peter Muth: Multi-Level Recovery. ACM PODS Symposium, Nashville, 1990
[29]
Gerhard Weikum, Christof Hasse: Multi-Level Transaction Management for Complex Objects: Implementation, Performance, Parallelism. VLDB Journal Vol.2 No.4, 1993

Cited By

View all
  • (2021)Enhancing Branch Office Network Availability Using Cloud EoIP GatewayProcedia Computer Science10.1016/j.procs.2021.01.042179(574-581)Online publication date: 2021
  • (2012)Automated verification and testing of user-interactive undo features in database applicationsSoftware Testing, Verification & Reliability10.1002/stvr.43922:4(245-265)Online publication date: 1-Jun-2012
  • (2011)Log-based middleware server recovery with transaction supportThe VLDB Journal — The International Journal on Very Large Data Bases10.1007/s00778-010-0199-120:3(347-370)Online publication date: 1-Jun-2011
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM SIGMOD Record
ACM SIGMOD Record  Volume 27, Issue 2
June 1998
595 pages
ISSN:0163-5808
DOI:10.1145/276305
Issue’s Table of Contents
  • cover image ACM Conferences
    SIGMOD '98: Proceedings of the 1998 ACM SIGMOD international conference on Management of data
    June 1998
    599 pages
    ISBN:0897919955
    DOI:10.1145/276304
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 June 1998
Published in SIGMOD Volume 27, Issue 2

Check for updates

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)91
  • Downloads (Last 6 weeks)27
Reflects downloads up to 13 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2021)Enhancing Branch Office Network Availability Using Cloud EoIP GatewayProcedia Computer Science10.1016/j.procs.2021.01.042179(574-581)Online publication date: 2021
  • (2012)Automated verification and testing of user-interactive undo features in database applicationsSoftware Testing, Verification & Reliability10.1002/stvr.43922:4(245-265)Online publication date: 1-Jun-2012
  • (2011)Log-based middleware server recovery with transaction supportThe VLDB Journal — The International Journal on Very Large Data Bases10.1007/s00778-010-0199-120:3(347-370)Online publication date: 1-Jun-2011
  • (2009)BibliographyPrinciples of Transaction Processing10.1016/B978-1-55860-623-4.00019-6(365-369)Online publication date: 2009
  • (2007)Ensuring e-Transaction with Asynchronous and Uncoordinated Application Server ReplicasIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2007.4218:3(364-378)Online publication date: 1-Mar-2007
  • (2005)Design and Analysis of an e-Transaction Protocol Tailored for OCCProceedings of the The 2005 Symposium on Applications and the Internet10.1109/SAINT.2005.19(125-131)Online publication date: 31-Jan-2005
  • (2004)Recovery in the mobile wireless environment using mobile agentsIEEE Transactions on Mobile Computing10.1109/TMC.2004.1303:2(180-191)Online publication date: 1-Apr-2004
  • (2002)e-TransactionsIEEE Transactions on Software Engineering10.1109/TSE.2002.99543028:4(378-395)Online publication date: 1-Apr-2002
  • (2001)Implementing E-Transactions with Asynchronous ReplicationIEEE Transactions on Parallel and Distributed Systems10.1109/71.91086912:2(133-146)Online publication date: 1-Feb-2001
  • (2022)The Effects of Soft Errors and Mitigation Strategies for Virtualization ServersIEEE Transactions on Cloud Computing10.1109/TCC.2020.297314610:2(1065-1081)Online publication date: 1-Apr-2022
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media