Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
article
Free access

OMP: a RISC-based multiprocessor using orthogonal-access memories and multiple spanning buses

Published: 01 June 1990 Publication History
  • Get Citation Alerts
  • Abstract

    This paper presents the architectural design and RISC based implementation of a prototype supercomputer, namely the Orthogonal MultiProcessor (OMP). The OMP system is constructed with 16 Intel 1860 RISC microprocessors and 256 parallel memory modules, which are 2-D interleaved and orthogonally accessed using custom-designed spanning buses. The architectural design has been validated by a CSIM-based multiprocessor simulator. The design choices are based on worst-case delay analysis and simulation validation. The current OMP prototype chooses a 2-dimensional memory architecture, mainly for image processing, computer vision, and neural network simulation applications. The 16-processor OMP prototype is targeted to achieve a peak performance of 400 RISC integer MIPS or a maximum of 640 Mflops. This paper presents the architectural design of the OMP prototype at system and PC board levels. We are presently entering the fabrication stage of all the PC boards. The system is expected to become operational in late 1991 and benchmarking results will be available in 1992. Only hardware design features are reported here. Software and simulation results are reported elsewhere.

    References

    [1]
    American National Standards Institute, ANSI X3.131. Small Computer System In~erface (SCSI). New York, 1986.
    [2]
    V. Balan. T~ojan-C Langua8e and Its P~ogramming Environment. Technical Report, Dept. of Electrical Engineering-Systems, Univ. of Southern California, Los Angeles, CA, Mar 1990.
    [3]
    Cadence Design Systems, Inc., Advanced CAE Division, Lowell, MA. Verilog-XL Reference Manua~ 1989.
    [4]
    C. M. Cheng. Programmer's Guide to the USC Orthogonal Multiprocessor Simulator. Technical Report, Dept. of Electrical Engineering-Systems, Univ. of Southern California, Los Angeles, CA, Max 1990.
    [5]
    N. Haddadi, K. Hwang, and R. Chellsppa. Viacorn: An Orthogonal Multiprocessor for Early Vision and Neural Computing. in 10ih Interna~ion~l Conference on Pa~tern Recognition, Atlantic City, New Jersey, June 17-21 1990.
    [6]
    D.T. Harper Ill and J.R. Jump. Vector Access Performance in Parallel Memories Using a Skewed Storage Scheme. IEEE Transaction on Computers, C-36:1440-1449~ 1987.
    [7]
    K. Hwang and D. DeGroot(eds.). Parallel Processing for Supercompu~ers and Artificial Intelligence. McGraw Hill, N. Y., Mar 1989.
    [8]
    K. Hwang and D. Kim. Generalization of Orthogonal Multiprocessor for Massively Parallel Computations. In Proceedings of ~he Conference on Frontier8 of Massively Parallel Computations, Fairfax, Virginia, October 10-12 1988.
    [9]
    K. Hwang, P.S. Tseng, and D. Kim. An Orthogonal M ultiprocesso~ for Parallel Scientific Computations. IEEE Transactions on Computers, C- 38(1):47-61, January 1989.
    [10]
    Intel Corporation. i860 Programmer's Reference Manual, April 1989.
    [11]
    Intel Corporation, Santa Clara, CA. The STAR 860:i860 Microprocessor Software Development System, 1989.
    [12]
    S. Mehrotra. Simulator Manual for the USC Orthogonal Multiprocessor. Technical Report, Dept. of Electrical Engineering-Systems, Univ. of Southern California, Los Angeles, CA, Feb 1990.
    [13]
    S. Mehrotra, C. M. Cheng, M. Dubois, K. Hwang, and D. K. Panda. Algorithm-Driven Simulation and Projected Performance of the USC Orthogoned Multiprocessor. In Proc. of Infernaiional Conference on Parallel Procesg.ing, S~. Charles, IL., Aug 1990.
    [14]
    C. Mundie. Defining a Standard Parallel Architecture. High Performance Systems, pages 42-48, December 1989.
    [15]
    W. Oed and O. Lange. On the Effective Bandwidth of Interleaved Memories in Vector Processor Systems. IEEE Transaction on Computers, C-34:949-957, 1985.
    [16]
    D. K. Panda and K. Hwang. Embeddings of Parallel Architectures onto Orthogonal Multiprocessor. Technical Report, Dept. of Electrical Engineering- Systems, Univ. of Southern California, Los Angeles, CA, April 1990.
    [17]
    D. K. Panda and K. Hwang. Reconfigurable Vector Register Windows for Fast Matrix Manipulation on the Orthogonal M ultiprocessor. In Proc. of International Conference on Application Specific Array Processors, Princeton, New Jersey, Sept 5- 7, 1990.
    [18]
    H.D. Schwetman. CSIM: A C-Based, Process- Oriented Simulation Language. In Proceedings of ~he 1986 Win~er Simulation Conference, pages 387-396, 1986.
    [19]
    Viewlogic Systems, Inc., Marlboro, MA. Workview Reference Manua~ 1989.
    [20]
    VMEbus International Trade Association. The VMEbus Specification Manual, 1990.

    Cited By

    View all
    • (2001)Low cost scaleable parallel image processing systemMicroprocessors and Microsystems10.1016/S0141-9331(01)00107-725:3(143-157)Online publication date: May-2001
    • (2000)Algorithms for the parallel alternating direction access machineTheoretical Computer Science10.1016/S0304-3975(99)00280-7245:2(151-173)Online publication date: 28-Aug-2000
    • (2022)SAR Image Change Detection Based on Joint Dictionary Learning With Iterative Adaptive Threshold OptimizationIEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing10.1109/JSTARS.2022.318710815(5234-5249)Online publication date: 2022
    • Show More Cited By

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM SIGARCH Computer Architecture News
    ACM SIGARCH Computer Architecture News  Volume 18, Issue 3b
    Special Issue: Proceedings of the 4th international conference on Supercomputing
    Sept. 1990
    489 pages
    ISSN:0163-5964
    DOI:10.1145/255129
    Issue’s Table of Contents
    • cover image ACM Conferences
      ICS '90: Proceedings of the 4th international conference on Supercomputing
      June 1990
      492 pages
      ISBN:0897913698
      DOI:10.1145/77726
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 01 June 1990
    Published in SIGARCH Volume 18, Issue 3b

    Check for updates

    Qualifiers

    • Article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)42
    • Downloads (Last 6 weeks)4

    Other Metrics

    Citations

    Cited By

    View all
    • (2001)Low cost scaleable parallel image processing systemMicroprocessors and Microsystems10.1016/S0141-9331(01)00107-725:3(143-157)Online publication date: May-2001
    • (2000)Algorithms for the parallel alternating direction access machineTheoretical Computer Science10.1016/S0304-3975(99)00280-7245:2(151-173)Online publication date: 28-Aug-2000
    • (2022)SAR Image Change Detection Based on Joint Dictionary Learning With Iterative Adaptive Threshold OptimizationIEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing10.1109/JSTARS.2022.318710815(5234-5249)Online publication date: 2022
    • (2005)Hardware support for collective communication operationsParallel Architectures and Their Efficient Use10.1007/3-540-56731-3_11(110-118)Online publication date: 28-May-2005
    • (1995)ProteusMachine Vision and Applications10.1007/BF012134748:2(85-100)Online publication date: 1-Feb-1995
    • (1993)Parallel implementation of prime-factor discrete cosine transform on the orthogonal multiprocessorIEEE Transactions on Circuits and Systems for Video Technology10.1109/76.2127173:2(107-115)Online publication date: 1-Apr-1993
    • (1992)Proteus: a reconfigurable computational network for computer visionProceedings., 11th IAPR International Conference on Pattern Recognition. Vol. IV. Conference D: Architectures for Vision and Pattern Recognition,10.1109/ICPR.1992.202128(43-54)Online publication date: 1992
    • (1992)A heterogeneous parallel accelerator for image analysis and radar signal processingProceedings of the Twenty-Fifth Hawaii International Conference on System Sciences10.1109/HICSS.1992.183155(129-138 vol.1)Online publication date: 1992
    • (1992)Parallel Architectures and Algorithms for Image Component LabelingIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/34.15990414:10(1014-1034)Online publication date: 1-Oct-1992
    • (1991)A conflict-free memory design for multiprocessorsProceedings of the 1991 ACM/IEEE conference on Supercomputing10.1145/125826.125868(46-55)Online publication date: 1-Aug-1991
    • Show More Cited By

    View Options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Get Access

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media