Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

A Fault-Tolerant Systolic Sorter

Published: 01 May 1988 Publication History

Abstract

A fault-tolerant systolic sorter design is proposed. An algorithm-based fault tolerance is achieved by testing the invariants of a systolic sorter during normal operation. Transient and permanent computation errors can be detected by using error-checking code and some redundant cells. A block with a single faulty cell can be located. Small hardware overhead and negligible time overhead are shown to be the major advantages of the method. A hierarchical structure is suggested as an efficient architecture for realizing the method. An offline fault-testing method for permanent stuck-at faults is presented.

References

[1]
{1} C. E. Leiserson, "Systolic priority queues," in Proc. Caltech Conf. VLSI, Jan. 1979, pp. 199-224.
[2]
{2} H. Yasuura, N. Takagi, and S. Yajima, "The parallel enumeration sorting scheme for VLSI," IEEE Trans. Comput., vol. C-31, pp. 1192-1201, Dec. 1982.
[3]
{3} T. C. Chen, V. Y. Lum, and C. Tung, "The rebound sorter: An efficient sort engine for large files," in Proc. Fourth Int. Conf. Very Large Data Bases, 1978, pp. 312-318.
[4]
{4} N. Takagi and C. K. Wong, "A hardware sort-merge system," IBM J. Res. Develop., vol. 29, pp. 49-67, Jan. 1985.
[5]
{5} K. -H. Huang and J. A. Abraham, "Algorithm-based fault tolerance for matrix operations," IEEE Trans. Comput., vol. C-33, pp. 297- 311, June 1984.
[6]
{6} F. T. Leighton and C. E. Leiserson, "Wafer-scale integration of svstolic arrays," in Proc. 23rd Annu. Symp. Foundations Comput. Sci., 1982, pp. 297-311.
[7]
{7} J. F. Wakerly, Error Detecting Codes, Self-Checking Circuits and Application. New York: North-Halland, 1978.
[8]
{8} W. W. Peterson and E. J. Weldon Jr., Error-Correcting Codes, 2nd ed. Cambridge, MA: MIT Press, 1972.
[9]
{9} D. W. Andrew and G. D. Schultz, "A token-ring architecture for local-area networks: An update," in Proc. COMPCON, Fall 1982, pp. 615-624.
[10]
{10} J. J. Wolf, M. T. Liu, B. W. Weide, and D. P. Tsay, "Design of a distributed fault-tolerant loop network," Proc. Int. Symp. Fault Tolerant Comput., June 1979, pp. 17-24.

Cited By

View all
  • (1999)Safety and Reliability Driven Task Allocation in Distributed SystemsIEEE Transactions on Parallel and Distributed Systems10.1109/71.75582410:3(238-251)Online publication date: 1-Mar-1999
  • (1996)Algorithm-Based Fault Tolerant Synthesis for Linear OperationsIEEE Transactions on Computers10.1109/12.49410045:4(425-438)Online publication date: 1-Apr-1996
  • (1994)Design of Algorithm-Based Fault-Tolerant Multiprocessor Systems for Concurrent Error Detection and Fault DiagnosisIEEE Transactions on Parallel and Distributed Systems10.1109/71.3131255:10(1099-1106)Online publication date: 1-Oct-1994
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image IEEE Transactions on Computers
IEEE Transactions on Computers  Volume 37, Issue 5
May 1988
133 pages

Publisher

IEEE Computer Society

United States

Publication History

Published: 01 May 1988

Author Tags

  1. VLSI sorter
  2. algorithm-based fault tolerance
  3. automatic testing
  4. cellular arrays
  5. digital integrated circuits
  6. error detection codes
  7. error-checking code
  8. fault tolerant computing
  9. fault-tolerant systolic sorter
  10. hardware overhead
  11. integrated circuit testing
  12. invariants
  13. offline fault-testing
  14. permanent computation errors
  15. permanent stuck-at faults
  16. reconfiguration
  17. redundancy
  18. redundant cells
  19. single faulty cell
  20. sorting.
  21. testing
  22. time overhead

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 06 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (1999)Safety and Reliability Driven Task Allocation in Distributed SystemsIEEE Transactions on Parallel and Distributed Systems10.1109/71.75582410:3(238-251)Online publication date: 1-Mar-1999
  • (1996)Algorithm-Based Fault Tolerant Synthesis for Linear OperationsIEEE Transactions on Computers10.1109/12.49410045:4(425-438)Online publication date: 1-Apr-1996
  • (1994)Design of Algorithm-Based Fault-Tolerant Multiprocessor Systems for Concurrent Error Detection and Fault DiagnosisIEEE Transactions on Parallel and Distributed Systems10.1109/71.3131255:10(1099-1106)Online publication date: 1-Oct-1994
  • (1994)Partitioned Encoding Schemes for Algorithm-Based Fault Tolerance in Massively Parallel SystemsIEEE Transactions on Parallel and Distributed Systems10.1109/71.2856105:6(649-653)Online publication date: 1-Jun-1994
  • (1994)Almost Certain Fault Diagnosis Through Algorithm-Based Fault ToleranceIEEE Transactions on Parallel and Distributed Systems10.1109/71.2825635:5(532-539)Online publication date: 1-May-1994
  • (1994)Fault Tolerance in a Class of Sorting NetworksIEEE Transactions on Computers10.1109/12.29326143:7(827-837)Online publication date: 1-Jul-1994
  • (1994)Scheduling for Periodic Concurrent Error Detection in Processor ArraysJournal of Parallel and Distributed Computing10.1006/jpdc.1994.114223:3(306-313)Online publication date: 1-Dec-1994
  • (1993)Synthesis of Algorithm-Based Fault-Tolerant Systems from Dependence GraphsIEEE Transactions on Parallel and Distributed Systems10.1109/71.2386224:8(864-874)Online publication date: 1-Aug-1993
  • (1993)Diagnosability and Diagnosis of Algorithm-Based Fault-Tolerant SystemsIEEE Transactions on Computers10.1109/12.23848342:8(924-937)Online publication date: 1-Aug-1993

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media