Rollback recovery Research Papers

Bookmark
Download
- by Yennun Huang
- •
- 5
  Rollback recovery, Source Code, Digest, Design and Implementation

This paper presents ReVive, a novel general-purpose rollback recovery mechanism for shared-memory multiprocessors. ReVive carefully balances the conflicting requirements of availability, performance, and hardware cost. ReVive performs... more

Bookmark
Download
- by Anurag Sachan
- •
- 16
  Performance, Fault Tolerance, Hardware, Recovery

A method of generating rollback points which takes into account program structure is proposed. It is shown that the variables needed for program rollback are those that vary in specific program implementation. A new method of placing... more

Algorithms of computing recovery after hardware faults are considered. An algorithm of modified linear recovery is proposed that enables to find the latest non-damaged recovery point. A uniqueness of the sequence is proven. The problem... more

El número de procesadores dentro de las computadoras de altas prestaciones se encuentra en constante crecimiento, por lo que el tiempo promedio de fallo de las mismas va disminuyendo significativamente en comparación con el tiempo de... more

Checkpointing with rollback recovery is a well-known method for achieving fault-tolerance in distributed systems. In this work, we introduce algorithms for checkpointing and rollback recovery on asynchronous unidirectional and... more

Bookmark
Download
- by Maria Serna
- •
- 16
  Heuristics, Approximation Algorithms, Complexity, Reverse Engineering

Bookmark
Download
- by Dolores Rexachs
- •
- 3
  Rollback recovery, Fault Tolerant, Message-logging

This paper presents Sonora, a platform for mobile-cloud computing. Sonora is designed to support the development and execution of continuous mobile-cloud services. To this end, Sonora provides developers with stream-based programming... more

Bookmark
Download
- by Fan Yang
- •
- 7
  Distributed System, Cloud Computing, Mobile Database, Dynamic Load Balancing

Checkpoint is defined as a designated place in a program at which normal processing is interrupted specifically to preserve the status information necessary to allow resumption of processing at a later time.Checkpointing is the process of... more

If the variables used for a checkpointing algorithm have data faults, the existing checkpointing and recovery algorithms may fail. In this paper, self-stabilizing data fault detecting and correcting, checkpointing, and recovery algorithms... more

Recovery from transient failures is one of the prime issues in the context of distributed systems. These systems demand to have transparent yet efficient techniques to achieve the same. Checkpoint is defined as a designated place in a... more

A method of generating rollback points which takes into account program structure is proposed. It is shown that the variables needed for program rollback are those that vary in specific program implementation. A new method of placing... more

A major concern in implementing a checkpoint-based recovery protocol for distributed systems is the performance degradation resulting from process roll-backs. In critical systems, it is highly desirable to contain the rollback distance as... more

Bookmark
- by Bina Ramamurthy
- •
- 10
  Computer Science, Distributed System, Distributed Systems, Key words

Problems related to distributed systems fault-tolerance are tackled by providing efficient and fault-tolerant algorithm procedures for checkpointing and rollback recovery for such systems. The authors propose checkpointing algorithms... more

Bookmark
- by kassem afif saleh
- •
- 9
  Distributed Computing, Law, Distributed Algorithms, Distributed System

The number of processors within the high performance computers is constantly growing, so the average time of failure of these is decreasing significantly compared to the execution time of the parallel applications of message passing that... more

In this paper we consider two software-based control-flow error recovery methods with a rollback recovery mechanism for using in multithreaded architectures. Disregarding to thread interactions between different threads by previous CFE... more

Bookmark
Download
- by navid khoshavi
- •
- 19
  Embedded Systems, Flow Control, Transient analysis, Fault Injection

This paper presents ReVive, a novel general-purpose rollback recovery mechanism for shared-memory multiprocessors. ReVive carefully balances the conflicting requirements of availability, per- formance, and hardware cost. ReVive performs... more

Bookmark
Download
- by Josep Torrellas
- •
- 3
  Cost effectiveness, High performance, Rollback recovery

Bookmark
Download
- by Kin Fun Li
- •
- 3
  Distributed Computing, Computer Software, Rollback recovery

Bookmark
Download
- by Jerzy Brzeziński and +1
  Anna Kobusinska
- •
- 11
  Distributed Computing, Mobile Systems, Mobile Computing, Fault Tolerance

Bookmark
Download
- by Om Damani
- •
- 2
  Distributed Simulation, Rollback recovery

Checkpointing and rollback recovery is a very effective technique to tolerate transient faults and preventive shutdowns. In the past, most of the checkpointing schemes published in the literature were supposed to be transparent to the... more

Bookmark
Download
- by Luke Lin
- •
- 14
  Computer Science, Distributed Computing, Performance, Synchronization

Bookmark
Download
- by Samuel Rodriguez
- •
- 11
  Embedded Systems, Electromagnetic Interference, Yield, Dissection

Bookmark
Download
- by Alfredo Goldman and +2
  Fabio Kon
  Raphael Camargo
- •
- 7
  Grid Computing, Parallel Computing, Middleware, Checkpointing

Bookmark
Download
- by Kishor S Trivedi
- •
- 3
  Time Use, Preventive Maintenance, Rollback recovery

Bookmark
Download
- by Keith Marzullo
- •
- 12
  Computer Science, Fault Tolerance, Performance Evaluation, Protocols

Bookmark
Download
- by Omer Rana
- •
- 8
  Distributed Computing, Petri Nets, Dynamic Software Adaptation, Petri Net

Bookmark
Download
- by Heonyoung Yeom
- •
- 11
  Information Systems, Grid Computing, Law, Fault Tolerance

Bookmark
Download
- by Srinivasa Rao
- •
- 11
  Object Oriented Programming, Fault Tolerance, Middleware, MPI

If the variables used for a checkpointing algorithm have data faults, the existing checkpointing and recovery algorithms may fail. In this paper, self-stabilizing data fault detecting and correcting, checkpointing, and recovery algorithms... more

If the variables used for a checkpointing algorithm have data faults, the algorithm may fail. In this paper, a self-stabilizing checkpointing algorithm is proposed for handling data faults in a ring network. The proposed algorithm can... more

Rollback recovery

Log In