Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

A Taxonomy of Reconfiguration Techniques for Fault-Tolerant Processor Arrays

Published: 01 January 1990 Publication History
  • Get Citation Alerts
  • Abstract

    Focuses on the characterization and classification of reconfiguration techniques. The techniques are differentiated according to the type of redundancy (time or hardware), allocation of redundancy (local or global), replacement unit, (processor or a set of processors), switching domain (global or local), and switching implementation (switching element, bus, or network). Typical techniques from four major classes-set switching, processor switching, local redundancy, and time redundancy-are reviewed. The proposed taxonomy can be used as a guide for future research in design and analysis of reconfiguration schemes.

    References

    [1]
    1. Uhr, L., Computer Arrays and Networks: Algorithm-Structured Parallel Architectures , Academic Press, New York, 1982.
    [2]
    2. Preston, K., Jr., and M.J.B. Duff, Modern Cellular Automata Theory and Applications , Plenum, New York, 1984.
    [3]
    3. Computer (special issue on systolic arrays), Vol. 20, No.7, July 1987.
    [4]
    4. Mead, C., and L. Conway, Introduction to VLSI Systems, Section 8.3, "Algorithms for VLSI Processor Arrays," Addison-Wesley, Reading, Mass., 1980, pp. 271-292.
    [5]
    1. J.A. Abraham et al., "Fault Tolerance Techniques for Systolic Arrays," Computer, Vol. 20, No. 7, July 1987, pp. 65-75.
    [6]
    2. H.T. Kung and M. Lam, "Wafer-Scale Integration and Two-Level Pipelined Implementations of Systolic Arrays," J. Parallel and Distributed Computing, Vol. 1, Issue 1, 1984, pp. 32-63.
    [7]
    3. I. Koren, "A Reconfigurable and Fault-Tolerant VLSI Multiprocessor Array," Proc. 8th Symp. Computer Architecture, CS Press, Los Alamitos, Calif., Order No. 346 (microfiche), 1981, pp. 425-442.
    [8]
    4. S.-Y. Kuo and W.K. Fuchs, "Efficient Spare Allocation for Reconfigurable Arrays," IEEE Design & Test, Feb. 1987, pp. 24-31.
    [9]
    5. A.L. Rosenberg, "The Diogenes Approach to Testable Fault-Tolerant Arrays of Processors," IEEE Trans. Computers, Vol. C-32, Oct. 1983, pp. 902-910.
    [10]
    6. R. Negrini, M. Sami, and R. Stefanelli, "Fault-Tolerance Techniques for Array Structures Used in Supercomputing," Computer , Vol. 19, No. 2, Feb. 1986, pp. 78-87.
    [11]
    7. M. Chean and J.A.B. Fortes, "FUSS: A Reconfiguration Scheme for Fault-Tolerant Processor Arrays," Int'l Workshop Hardware Fault Tolerance in Multiprocessors , June 1989, pp. 30-32.
    [12]
    8. T. Leighton and C.E. Leiserson, "Wafer-Scale Integration of Systolic Arrays," IEEE Trans. Computers, Vol. C-34, May 1985, pp. 448-461.
    [13]
    9. A.D. Singh, "Interstitial Redundancy: An Area Efficient Fault-Tolerance Scheme for Large Area VLSI Processor Arrays," IEEE Trans. Computers, Vol. C-37, No. 11, Nov. 1988, pp. 1,398-1,410.
    [14]
    10. K.S. Hedlund and L. Snyder, "Wafer-Scale Integration of Configurable, Highly Parallel (CHiP) Architectures and Processors" (extended abstract), Proc. 1982 Int'l Conf. Parallel Processing, CS Press, Los Alamitos, Calif., Order No. 421 (microfiche), 1982, pp. 262-264.
    [15]
    11. Y.-X. Wang and J.A.B. Fortes, "On the Analysis and Design of Hierarchical Fault-Tolerant Processor Arrays," Int'l Workshop Defect and Fault Tolerance in VLSI Systems , Oct. 1988, pp. 347-355.
    [16]
    12. M.G. Sami and R. Stefanelli, "Fault Tolerance of VLSI Processing Arrays: The Time-Redundancy Approach," Real-Time Systems Symp., Dec. 1984, pp. 200-207.
    [17]
    13. J.A.B. Fortes and C.S. Raghavendra, "Gracefully Degradable Processor Arrays," IEEE Trans. Computers, Vol. C-34, Nov. 1985, pp. 1,033-1,044.
    [18]
    Batcher, K.E., "Design of a Massively Parallel Processor," IEEE Trans. Computers, Vol. C-29, Sept. 1980, pp. 836-840.
    [19]
    McCanny, J.V., and J.G. McWhirter, "Yield Enhancement of Bit-Level Systolic Array Chips Using Fault-Tolerant Techniques," Electronics Letters , Vol. 19, July 1983, pp. 525-527.
    [20]
    Ishikawa, T., et al., "Hierarchical Array Processor (HAP) Featuring High Reliability and High System Performance," 1986 Int'l Conf. Parallel Processing, CS Press, Los Alamitos, Calif., Order No. 724 (microfiche), Aug. 1986, pp. 293- 300.
    [21]
    Kim, J.H., and S.M. Reddy, "On Easily Testable and Reconfigurable Two-Dimensional Systolic Arrays," Int'l Conf. Parallel Processing, CS Press, Los Alamitos, Calif., Order No. 783, Aug. 1987, pp. 101-109.
    [22]
    Hedlund, K.S., "WASP -- A Wafer-Scale Systolic Processor," IEEE Int'l Conf. Computer Design , CS Press, Los Alamitos, Calif., Order No. 642 (microfiche), Oct. 1985, pp. 665-671.
    [23]
    Ciminiera, L., C. Demartini, and A. Valenzano, "Defect-Tolerant Array Structures for VLSI and WSI Architectures," Proc. 20th Hawaii Int'l Conf. System Science, 1987, pp. 21-30.
    [24]
    Jervis, L., F. Lombardi, and D. Sciuto, "Orthogonal Mapping: A Reconfiguration Strategy for Fault-Tolerant VLSI/WSI 2-D Arrays," Int'l Workshop Defect and Fault Tolerance in VLSI Systems , Oct. 1988, pp. 309-318.
    [25]
    Lombardi, F., R. Negrini, and R. Stefanelli, "Reconfiguration of VLSI Arrays: A Covering Approach," 17th Int'l Symp. Fault-Tolerant Computing Systems, CS Press, Los Alamitos, Calif., Order No. 778 (microfiche), 1987, pp. 251-256.
    [26]
    Distante, F., M.G. Sami, and R. Stefanelli, "A General Index Mapping Technique for Array Reconfiguration," Int'l Symp. Circuits and Systems , June 1988, pp. 559-563.
    [27]
    Youn, H.Y., and A.D. Singh, "A Highly Efficient Design for Reconfiguring the Processor Array in VLSI," Int't Conf. Parallel Processing, CS Press, Los Atamitos, Calif., Order No. 889 (microfiche), Aug. 1988, pp. 375-382.
    [28]
    Jesshope, C., and L. Bentley, "Techniques for Implementing Two-Dimensional Wafer-Scale Processor Arrays," IEE Proc., Vol. 134, Pt. E, Mar. 1987, pp. 87-92.
    [29]
    Reeves, A.P., "Fault Tolerance in Highly Parallel Mesh-Connected Processors," in Computing Structure for Image Processing, M.J.B. Duff, ed., Academic Press, London, 1983, pp. 77-94.
    [30]
    Hwang, J.H., and C.S. Raghavendra, "VLSI Implementation of Fault-Tolerant Systolic Arrays," IEEE Int'l Conf. Computer Design, CS Press, Los Alamitos, Calif., Order No. 735, 1986, pp. 110-113.
    [31]
    Kung, S.Y., S.N. Jean, and C.W. Chang, "Fault Tolerant Array Processors Using Single-Track Switches," IEEE Trans. Computers, Vol. 38, No. 4, Apr. 1989, pp. 501-514.
    [32]
    Lombardi, F., and D. Sciuto, "Reconfiguration in WSI Arrays Using Minimum Spanning Trees," CompEuro 87, CS Press, Los Alamitos, Calif., Order No. 773, May 1987, pp. 547-550.
    [33]
    Negrini, R., and R. Stefanelli, "Comparative Evaluation of Space- and Time-Redundancy Approaches for WSI Processing Arrays," in Wafer-Scale Integration, G. Saucier and J. Trilhe, eds., Elsevier Science Publishers B.V. (North Holland), 1986, pp. 207-222.
    [34]
    Fortes, J.A.B., "Algorithm Reconfiguration Techniques for Gracefully Degradable Processor Arrays," in Systolic Arrays, W. Moore, A. McCabe, and R. Urquhart, eds., Adam Hilger, 1986, pp. 259-268.
    [35]
    Siewiorek, D.P., and R.S. Swarz, The Theory and Practice of Reliable System Design, Digital Press, Bedford, Mass., 1982.
    [36]
    Johnson, B.W., Design and Analysis of Fault-Tolerant Digital Systems, Addison-Wesley, Reading, Mass., 1989.
    [37]
    Negrini, R., and R. Stefanelli, "Time Redundancy in WSI Arrays of Processing Elements," Proc. Int'l Conf. Supercomputing Systems, CS Press, Los Alamitos, Calif., Order No. 654 (microfiche), 1985, pp. 429-438.

    Cited By

    View all
    • (2017)Classification of Resilience Techniques Against Functional Errors at Higher Abstraction Layers of Digital SystemsACM Computing Surveys10.1145/309269950:4(1-38)Online publication date: 4-Oct-2017
    • (2015)Computing Swarms for Self-Adaptiveness and Self-Organization in Floating-Point Array ProcessingACM Transactions on Autonomous and Adaptive Systems10.1145/274634610:3(1-34)Online publication date: 1-Sep-2015
    • (2015)Auto-localized multimedia platform based on a modular Cyber Physical System aligned in a two-dimensional gridCluster Computing10.1007/s10586-015-0479-z18:4(1449-1464)Online publication date: 1-Dec-2015
    • Show More Cited By

    Recommendations

    Reviews

    Yi-Bing Lin

    Chean and Fortes introduce a taxonomy of reconfiguration techniques for fault-tolerant processor arrays and discuss their distinguishing characteristics. They do not exhaustively survey previously proposed reconfiguration schemes. Instead, they review typical techniques selected from some classes and illustrate the main characteristics of different classes of schemes. The authors show the classes of reconfiguration schemes hierarchically in two figures. Following these classification trees, one can easily read through the paper. The many types of schemes determined by the taxonomy are grouped into four major classes: the set-switching class, the processor-switching class, the local-redundancy class, and the time-redundancy class. A scheme in the set-switching class is characterized by its replacement unit. The authors give a general description of this class and focus on a specific scheme proposed by Kuo and Fuchs [1]. In processor-switching schemes, an available cell directly or indirectly replaces a faulty cell. This class comprises two approaches. In the first approach, functional cells are systematically collected to form a part of the target array. In the second, faulty cells are replaced by their neighbors in a chain fashion until spare cells are reached. In the local-redundancy class, an array is partitioned or systematically reduced to smaller subarrays, each of which can be reconfigured independently. In the time-redundancy approach, the operational cells are assigned the computations that faulty cells would perform if they were operational. This well-written survey can be a valuable starting point for studying the reconfiguration techniques for fault-tolerant processor arrays.

    Access critical reviews of Computing literature here

    Become a reviewer for Computing Reviews.

    Comments

    Information & Contributors

    Information

    Published In

    cover image Computer
    Computer  Volume 23, Issue 1
    January 1990
    103 pages
    ISSN:0018-9162
    Issue’s Table of Contents

    Publisher

    IEEE Computer Society Press

    Washington, DC, United States

    Publication History

    Published: 01 January 1990

    Qualifiers

    • Research-article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 29 Jul 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2017)Classification of Resilience Techniques Against Functional Errors at Higher Abstraction Layers of Digital SystemsACM Computing Surveys10.1145/309269950:4(1-38)Online publication date: 4-Oct-2017
    • (2015)Computing Swarms for Self-Adaptiveness and Self-Organization in Floating-Point Array ProcessingACM Transactions on Autonomous and Adaptive Systems10.1145/274634610:3(1-34)Online publication date: 1-Sep-2015
    • (2015)Auto-localized multimedia platform based on a modular Cyber Physical System aligned in a two-dimensional gridCluster Computing10.1007/s10586-015-0479-z18:4(1449-1464)Online publication date: 1-Dec-2015
    • (2013)The survivability of design-specific spare placement in FPGA architectures with high defect ratesACM Transactions on Design Automation of Electronic Systems10.1145/2442087.244210418:2(1-22)Online publication date: 11-Apr-2013
    • (2010)Self organization on a swarm computing fabricProceedings of the 7th ACM international conference on Computing frontiers10.1145/1787275.1787343(327-336)Online publication date: 17-May-2010
    • (2007)Sliding algorithm for reconfigurable arrays of processorsProceedings of the 7th international conference on Evolvable systems: from biology to hardware10.5555/1792161.1792184(198-209)Online publication date: 21-Sep-2007
    • (2007)An improved replacement algorithm in fault-tolerant meshesProceedings of the 2007 Summer Computer Simulation Conference10.5555/1357910.1357980(443-448)Online publication date: 16-Jul-2007
    • (2003)Enhanced Cluster k-Ary n-Cube, A Fault-Tolerant MultiprocessorIEEE Transactions on Computers10.1109/TC.2003.124494252:11(1443-1453)Online publication date: 1-Nov-2003
    • (2001)An Efficient Exact Algorithm for Constraint Bipartite Vertex CoverJournal of Algorithms10.1006/jagm.2000.114138:2(374-410)Online publication date: 1-Feb-2001
    • (2000)Fault-Tolerant Processor Arrays Using Additional Bypass Linking Allocated by Graph-Node ColoringIEEE Transactions on Computers10.1109/12.85953849:5(431-442)Online publication date: 1-May-2000
    • Show More Cited By

    View Options

    View options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media