Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/29903.29921acmconferencesArticle/Chapter ViewAbstractPublication PagesmetricsConference Proceedingsconference-collections
Article
Free access

Performance analysis of a fault detection scheme in multiprocessor systems

Published: 01 May 1987 Publication History

Abstract

A technique is described for detecting and diagnosing faults at the processor level in a multiprocessor system. In this method, a process is assigned whenever possible to two processors: the processor that it would normally be assigned to (primary) and an additional processor which would otherwise be idle (secondary). Two strategies will be described and analyzed: one which is preemptive and another which is non-preemptive. It is shown that for moderately loaded systems, a sufficient percentage of processes can be performed redundantly using the system's spare capacity to provide a basis for fault detection and diagnosis with virtually no degradation of response time.

References

[1]
M. Malek, "A comparison connection assignment for diagnosis of multiprocessor systems," in Proc. of the 7th Syrup. on Comp. Arch., pp. 31-35, May 1980.
[2]
S.L. Hakimi and K.Y. Chwa, "Schemes for faulttolerant computing: a comparison of modularly redundant and t-diagnosable systems," Information and Control, vol. 49. pp. 212-238, June 1981.
[3]
A.T. Dahbura and G.M. Masson. "Greedy diagnosis as the basis of intermittent-fault/transientupset tolerant system design," IEEE Trans. Cornput., vol. C-32, no. 10, pp. 953-957, Oct. 1983.
[4]
A.T. Dahbura, K.K. Sabnani, and L.L. King. "The comparison approach to multiprocessor fault diagnosis," in Proc. 15th Int. Symp. on Fault-Toleran# Comput., IEEE Computer Society Publications, pp. 260-265, June 1985. Full version to appear, IEEE Trans. Comput.
[5]
L. Kleinrock (1975), Queuing Systems. Volume I: Theory. J. Wiley & Sons. New York.
[6]
W.H. Huggins and D.R. Entwistle (t968), Introductory Systems and Design. BLaisdell Publ. Co., Waltham, Mass.
[7]
M. Shooman (1968), Probabilistic Reliability: An Engineering Approach. McGraw Hill Book Co., New York.
[8]
F.S. Hiller and G.J. Lieberman (1974), Operations Research, Second Edition. I-Iolden-Day, Inc., San Francisco.

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
SIGMETRICS '87: Proceedings of the 1987 ACM SIGMETRICS conference on Measurement and modeling of computer systems
August 1987
267 pages
ISBN:089791225X
DOI:10.1145/29903
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 May 1987

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Article

Conference

SIGMETRICS87
Sponsor:

Acceptance Rates

Overall Acceptance Rate 459 of 2,691 submissions, 17%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)20
  • Downloads (Last 6 weeks)11
Reflects downloads up to 22 Sep 2024

Other Metrics

Citations

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media