Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

On Scheduling Tasks with a Quick Recovery from Failure

Published: 01 May 1986 Publication History

Abstract

Multiprocessors used in life-critical real-time systems must recover quickly from failure. Part of this recovery consists of switching to a new task schedule that ensures that hard deadlines for critical tasks continue to be met. We present a dynamic programming algorithm that ensures that backup, or contingency, schedules can be efficiently embedded within the original, "primary" schedule to ensure that hard deadlines continue to be met in the face of up to a given maximum number of processor failures. Several illustrative examples are included.

References

[1]
J. H. Wensley et al., "SIFT:. Design and analysis of a fault-tolerant computer for aircraft control," Proc. IEEE, vol. 66, no. 10, pp. 1240-1255, Oct. 1978.
[2]
C. M. Krishna and K. G. Shin, "Performance measures for multiprocessor controllers," in Performance '83, A. K. Agrawala and S. K. Tripathi, Eds. Amsterdam, The Netherlands: North-Holland, pp. 229-250, 1983.
[3]
K. G. Shin, C. M. Krishna, and Y.-H. Lee, "A unified method for characterizing real-time computer controllers and its application," IEEE Trans. Automat. Contr., vol. AC-30, no. 4, pp. 357-366, Apr. 1985.

Cited By

View all
  • (2017)Classification of Resilience Techniques Against Functional Errors at Higher Abstraction Layers of Digital SystemsACM Computing Surveys10.1145/309269950:4(1-38)Online publication date: 4-Oct-2017
  • (2014)Fault-tolerant scheduling in homogeneous real-time systemsACM Computing Surveys10.1145/253402846:4(1-34)Online publication date: 1-Mar-2014
  • (2006)A probabilistic approach for fault tolerant multiprocessor real-time schedulingProceedings of the 20th international conference on Parallel and distributed processing10.5555/1898953.1899105(173-173)Online publication date: 25-Apr-2006
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image IEEE Transactions on Computers
IEEE Transactions on Computers  Volume 35, Issue 5
May 1986
99 pages

Publisher

IEEE Computer Society

United States

Publication History

Published: 01 May 1986

Author Tags

  1. Fault-tolerant and real-time multiprocessors
  2. hard deadlines
  3. notification times
  4. primary and contingency schedules
  5. primary and ghost clones

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 06 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2017)Classification of Resilience Techniques Against Functional Errors at Higher Abstraction Layers of Digital SystemsACM Computing Surveys10.1145/309269950:4(1-38)Online publication date: 4-Oct-2017
  • (2014)Fault-tolerant scheduling in homogeneous real-time systemsACM Computing Surveys10.1145/253402846:4(1-34)Online publication date: 1-Mar-2014
  • (2006)A probabilistic approach for fault tolerant multiprocessor real-time schedulingProceedings of the 20th international conference on Parallel and distributed processing10.5555/1898953.1899105(173-173)Online publication date: 25-Apr-2006
  • (2006)Dual-mode r-reliable task model for flexible scheduling in reliable real-time systemsProceedings of the 2006 international conference on Embedded and Ubiquitous Computing10.1007/11802167_32(308-317)Online publication date: 1-Aug-2006
  • (2005)Scheduling fixed-priority hard real-time tasks in the presence of faultsProceedings of the Second Latin-American conference on Dependable Computing10.1007/11572329_14(154-173)Online publication date: 25-Oct-2005
  • (2004)The Interplay of Power Management and Fault Recovery in Real-Time SystemsIEEE Transactions on Computers10.1109/TC.2004.126183053:2(217-231)Online publication date: 1-Feb-2004
  • (2004)Efficient overloading techniques for primary-backup scheduling in real-time systemsJournal of Parallel and Distributed Computing10.1016/j.jpdc.2004.03.01564:5(629-648)Online publication date: 1-May-2004
  • (2004)Fault-tolerant scheduling for real-time embedded control systemsJournal of Computer Science and Technology10.1007/BF0294479719:2(191-202)Online publication date: 1-Mar-2004
  • (2003)A Nonpreemptive Real-Time Scheduler with Recovery from Transient Faults and Its ImplementationIEEE Transactions on Software Engineering10.1109/TSE.2003.122364829:8(752-767)Online publication date: 1-Aug-2003
  • (2001)Analysis of Checkpointing for Real-Time SystemsReal-Time Systems10.1023/A:102658920041920:1(83-102)Online publication date: 1-Jan-2001
  • Show More Cited By

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media