Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

On the Execution of Large Batch Programs in Unreliable Computing Systems

Published: 01 July 1984 Publication History

Abstract

The execution of long-running batch programs imposes severe reliability constraints on a computing system since the occurrence of a failure during its execution is more likely and that once occurred, a failure would destroy all the processing perfonned thus far. This paper studies the execution delay and machine resources consumed in supporting the running of large batch programs in a computing environment interrupted by failures. The effect of checkpoints and their optimal insertion are also considered. The results are applicable to arbitrary law of failure.

Cited By

View all
  • (2013)Optimization of cloud task processing with checkpoint-restart mechanismProceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis10.1145/2503210.2503217(1-12)Online publication date: 17-Nov-2013
  • (2001)A Variational Calculus Approach to Optimal Checkpoint PlacementIEEE Transactions on Computers10.1109/12.93623650:7(699-708)Online publication date: 1-Jul-2001
  • (1997)Impact of Checkpoint Latency on Overhead Ratio of a Checkpointing SchemeIEEE Transactions on Computers10.1109/12.60928146:8(942-947)Online publication date: 1-Aug-1997
  • Show More Cited By

Index Terms

  1. On the Execution of Large Batch Programs in Unreliable Computing Systems
    Index terms have been assigned to the content through auto-classification.

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image IEEE Transactions on Software Engineering
    IEEE Transactions on Software Engineering  Volume 10, Issue 4
    July 1984
    152 pages

    Publisher

    IEEE Press

    Publication History

    Published: 01 July 1984

    Author Tags

    1. Batch program
    2. checkpoint
    3. failure law

    Qualifiers

    • Research-article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 10 Oct 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2013)Optimization of cloud task processing with checkpoint-restart mechanismProceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis10.1145/2503210.2503217(1-12)Online publication date: 17-Nov-2013
    • (2001)A Variational Calculus Approach to Optimal Checkpoint PlacementIEEE Transactions on Computers10.1109/12.93623650:7(699-708)Online publication date: 1-Jul-2001
    • (1997)Impact of Checkpoint Latency on Overhead Ratio of a Checkpointing SchemeIEEE Transactions on Computers10.1109/12.60928146:8(942-947)Online publication date: 1-Aug-1997
    • (1996)Minimizing completion time of a program by checkpointing and rejuvenationProceedings of the 1996 ACM SIGMETRICS international conference on Measurement and modeling of computer systems10.1145/233013.233050(252-261)Online publication date: 15-May-1996
    • (1996)Minimizing completion time of a program by checkpointing and rejuvenationACM SIGMETRICS Performance Evaluation Review10.1145/233008.23305024:1(252-261)Online publication date: 15-May-1996
    • (1991)On the Optimal Total Processing Time Using CheckpointsIEEE Transactions on Software Engineering10.1109/32.9044617:5(436-442)Online publication date: 1-May-1991
    • (1987)Optimal checkpointing of real-time tasksIEEE Transactions on Computers10.1109/TC.1987.500947236:11(1328-1341)Online publication date: 1-Nov-1987

    View Options

    View options

    Get Access

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media