Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1646461.1646464acmconferencesArticle/Chapter ViewAbstractPublication PagesscConference Proceedingsconference-collections
research-article

SCF: a device- and language-independent task coordination framework for reconfigurable, heterogeneous systems

Published: 15 November 2009 Publication History

Abstract

Heterogeneous computing systems comprised of accelerators such as FPGAs, GPUs, and Cell processors coupled with standard microprocessors are becoming an increasingly popular solution to building future computing systems. Although programming languages and tools have evolved to simplify device-level design, programming such systems is still difficult and time-consuming due to system-level challenges involving synchronization and communication between heterogeneous devices, which currently require ad-hoc solutions. To solve this problem, this paper presents the System-Level Coordination Framework (SCF), which enables transparent communication and synchronization between tasks running on heterogeneous processing devices in the system. By hiding low-level architectural details from the application designer, SCF can improve application development productivity, provide higher levels of application portability, and offer rapid design-space exploration of different task/device mappings. In addition, SCF enables custom communication synthesis, which can provide performance improvements over generic solutions employed previously.

References

[1]
Olukotun, K. and Hammond, L. 2005. The Future of Microprocessors. Queue. Vol. 3, Issue 7 (Sep 2005), pp. 26--29.
[2]
Bhat, P. B., Lim, Y. W., and Prasanna, V. K. 1995. Issues in using heterogeneous HPC systems for embedded real time signal processing applications. Proc. of Second International Workshop on Real-Time Computing Systems and Applications (25--27 Oct 1995), pp. 134--141.
[3]
Erbas, C. and Pimentel, A. D. 2003. Utilizing synthesis methods in accurate system-level exploration of heterogeneous embedded systems. IEEE Workshop on Signal Processing Systems (27--29 Aug 2003), SIPS 2003. pp. 310--315.
[4]
SRC Computers, Inc. 2009. MAPstation workstations. www.srccomp.com/products/mapstation.asp, (website accessed on July 12, 2009).
[5]
XtremeData, Inc. 2009. In-Socket Accelerators. http://www.xtremedatainc.com/index.php?option=com_content&view=article&id=109&Itemid=170, (website accessed on July 12, 2009).
[6]
El-Ghazawi, T., El-Araby, E., Huang, M., Gaj, K., Kindratenko, V., and Buell, D. 2008. The Promise of High-Performance Reconfigurable Computing. IEEE Computer, vol. 41, no. 2 (Feb 2008), pp. 69--76.
[7]
F- Chen, T., Raghavan, R., Dale, J. N., and Iwata, E. 2007. Cell broadband engine architecture and its first implementation: a performance view. IBM Journal of Research and Development 51, 5, 559--572.
[8]
Altera Corp. 2008. Stratix IV Device Handbook. Altera Corp.
[9]
Xilinx, Inc. 2008. Virtex-5 Family Overview. Xilinx, Inc.
[10]
Nvidia Corp. 2006. Nvidia GeForce 8800 GPU Architecture Overview. Nvidia Corp
[11]
Nvidia Corp. 2009. Nvidia tesla Tesla S1070 specifications. Nvidia Corp.
[12]
Ambric, Inc. 2008. Ambric technology backgrounder. http://www.ambric.com/technology/technology-overview.php
[13]
ClearSpeed Technology PLC. 2007. CSX600 Architecture. Whitepaper. ClearSpeed Technology PLC.
[14]
Shih, K., Balachandran, A., Nagarajan, K., Holland, B., Slatton, C., and George, A. 2008. Fast Real-time LIDAR Processing on FPGAs. Proc. of International Conference on Engineering of Reconfigurable Systems and Algorithms (July 14--17 2008), ERSA 2008. Las Vegas, NV.
[15]
Storaasli, O. 2008. Accelerating Genome Sequencing 100--1000X with FPGAs. Many-core and Reconfigurable Supercomputing Conference (April 2008), MRSC 2008.
[16]
Williams, J., George, A., Richardson, J., Gosrani, K., Massie, C., and Lam, H. 2009. Characterization of Fixed and Reconfigurable Multi-Core Devices for Application Acceleration. ACM Transactions on Reconfigurable Technology and Systems. Accepted to appear.
[17]
Sunderam, V. S. 1990. PVM: A Framework for Parallel Distributed Computing. Concurrency: Practice and Experience 2, 4, (Dec 1990), pp 315--339.
[18]
MPI website. http://www.mcs.anl.gov/research/projects/mpi/, (website accessed on July 12, 2009).
[19]
Lastovetsky, A. and Reddy, R. 2006. HeteroMPI: towards a message-passing library for heterogeneous networks of computers. J. Parallel Distrib. Comput. 66, 2 (Feb. 2006), 197--220.
[20]
Graham, R. L., Shipman, G. M., Barrett, B. W., Castain, R. H., Bosilca, G., and Lumsdaine, A. 2006. Open MPI: A High-Performance, Heterogeneous MPI. Proc. of IEEE International Conference on Cluster Computing (Sept 2006), pp. 1--9.
[21]
Massetto, F. I., Gomes A. M., and Sato L. M. 2006. HyMPI -- A MPI Implementation for Heterogeneous High Performance Systems. International Conference on Advances in Grid and Pervasive Computing (May 2006), Taichung, Taiwan.
[22]
Lee, E. A. 2003. Overview of the Ptolemy Project. Technical Memorandum No. UCB/ERL M03/25 (July 2, 2003). University of California, Berkeley, CA, USA.
[23]
Franklin, M., Tyson, E., Buckley, J., Crowley, P., and Maschmeyer, J. 2006. Auto-Pipe and the X Language: A Pipeline Design Tool and Description Language. In Proc. of the 20th International Parallel and Distributed Processing Symposium (April 2006). Rhodes Island, Greece.
[24]
Saldana, M., Patel, A., Madill, C., Nunes, D., Danyao Wang, Styles, H., Putnam, A., Wittig, R., and Chow, P. 2008. MPI as an abstraction for software-hardware interaction for HPRCs. Second International Workshop on High-Performance Reconfigurable Computing Technology and Applications (Nov 2008), HPRCTA 2008. pp. 1--10.
[25]
Luk, W., Coutinho, J., Todman, T. J., Lam, Y. M., Osborne, W. G., Susanto, K. W., and Wong, W. S. 2009. A High-Level Compilation Toolchain for Heterogeneous Systems. IEEE International SOC conference (Sept. 9--11, 2009). Belfast, Northern Ireland, U.K.
[26]
OpenFPGA GenAPI version 0.4 Draft For Comment. 2009 http://www.openfpga.org/pages/Standards.aspx, (website accessed July 12, 2009).
[27]
OpenCL 1.0 Specification. 2009 http://www.khronos.org/registry/cl/specs/opencl-1.0.43.pd, (website accessed July 12, 2009).
[28]
Nvidia corp. 2009. CUDA 2.3 QuickStart Guide. http://www.nvidia.com/object/cuda_develop.html (website accessed August 28, 2009).
[29]
Impulse accelerated technologies. 2009. Impulse C language. Impulse accelerated technologies.
[30]
VHDL standard (IEEE 1076). 2008. http://www.vhdl.org/vasg/ (website accessed July 12, 2009).
[31]
Chatha, K. S. and Vemuri, R. 2001. MAGELLAN: multiway hardware-software partitioning and scheduling for latency minimization of hierarchical control-dataflow task graphs. In Proceedings of the Ninth international Symposium on Hardware/Software Codesign (Copenhagen, Denmark). CODES '01. 42--47
[32]
Dave, B. P. 1999. CRUSADE: hardware/software co-synthesis of dynamically reconfigurable heterogeneous real-time distributed embedded systems. Proc. of Design, Automation and Test in Europe Conference and Exhibition (1999), pp. 97--104.
[33]
Hoare, C. A. 1978. Communicating sequential processes. Communication ACM 21, 8 (Aug. 1978), pp. 666--677.
[34]
RFC 4506 - XDR: External Data Representation Standard. May 2006. http://www.rfc-editor.org/rfc/rfc4506.txt, (website accessed July 27, 2009).
[35]
RFC 3072 - Structured Data Exchange Format (SDXF). Mar 2001. http://tools.ietf.org/html/rfc3072, (website accessed July 27, 2009).
[36]
Xtext Reference Documentation. 2009. http://www.openarchitectureware.org/pub/documentation/4.1/r80_xtextReference.pdf (website accessed July 27, 2009).
[37]
Eclipse. Eclipse SDK, Version: 3.4.1., www.eclipse.org (website accessed July 27, 2009).
[38]
Lee, C. R., and Z. Salcic. 1997. A Fully-hardware-type Maximum-parallel Architecture for Kalman Tracking Filter in FPGAs. International Conference on Communications and Signal Processing (1997), 1243--1247.

Cited By

View all
  • (2012)SCFACM Transactions on Reconfigurable Technology and Systems10.1145/2209285.22092865:2(1-23)Online publication date: 1-Jun-2012
  • (2012)Heterogeneous tasks and conduits framework for rapid application portability and deployment2012 Innovative Parallel Computing (InPar)10.1109/InPar.2012.6339588(1-9)Online publication date: May-2012
  • (2011)Platform-aware bottleneck detection for reconfigurable computing applicationsACM Transactions on Reconfigurable Technology and Systems10.1145/2000832.20008424:3(1-28)Online publication date: 22-Aug-2011
  • Show More Cited By

Index Terms

  1. SCF: a device- and language-independent task coordination framework for reconfigurable, heterogeneous systems

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    HPRCTA '09: Proceedings of the Third International Workshop on High-Performance Reconfigurable Computing Technology and Applications
    November 2009
    61 pages
    ISBN:9781605587219
    DOI:10.1145/1646461
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 15 November 2009

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. accelerators
    2. communication
    3. coordination
    4. heterogeneous computing
    5. portability
    6. productivity
    7. reconfigurable computing

    Qualifiers

    • Research-article

    Funding Sources

    Conference

    SC '09
    Sponsor:

    Upcoming Conference

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)1
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 10 Feb 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2012)SCFACM Transactions on Reconfigurable Technology and Systems10.1145/2209285.22092865:2(1-23)Online publication date: 1-Jun-2012
    • (2012)Heterogeneous tasks and conduits framework for rapid application portability and deployment2012 Innovative Parallel Computing (InPar)10.1109/InPar.2012.6339588(1-9)Online publication date: May-2012
    • (2011)Platform-aware bottleneck detection for reconfigurable computing applicationsACM Transactions on Reconfigurable Technology and Systems10.1145/2000832.20008424:3(1-28)Online publication date: 22-Aug-2011
    • (2011)SHMEM+ACM Transactions on Reconfigurable Technology and Systems10.1145/2000832.20008384:3(1-24)Online publication date: 22-Aug-2011
    • (2011)Extending Java for heterogeneous embedded system description6th International Workshop on Reconfigurable Communication-Centric Systems-on-Chip (ReCoSoC)10.1109/ReCoSoC.2011.5981527(1-6)Online publication date: Jun-2011
    • (2011)An End-to-End Tool Flow for FPGA-Accelerated Scientific ComputingIEEE Design & Test10.1109/MDT.2011.4628:4(68-77)Online publication date: 1-Jul-2011
    • (2010)Integrating High-Level Synthesis into MPIProceedings of the 2010 18th IEEE Annual International Symposium on Field-Programmable Custom Computing Machines10.1109/FCCM.2010.34(175-178)Online publication date: 2-May-2010

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media