Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Synthesis of networks of custom processing elements for real-time physical system emulation

Published: 11 April 2013 Publication History

Abstract

Emulating a physical system in real-time or faster has numerous applications in cyber-physical system design and deployment. For example, testing of a cyber-device's software (e.g., a medical ventilator) can be done via interaction with a real-time digital emulation of the target physical system (e.g., a human's respiratory system). Physical system emulation typically involves iteratively solving thousands of ordinary differential equations (ODEs) that model the physical system. We describe an approach that creates custom processing elements (PEs) specialized to the ODEs of a particular model while maintaining some programmability, targeting implementation on field-programmable gate arrays (FPGAs). We detail the PE micro-architecture and accompanying automated compilation and synthesis techniques. Furthermore, we describe our efforts to use a high-level synthesis approach that incorporates regularity extraction techniques as an alternative FPGA-based solution, and also describe an approach using graphics processing units (GPUs). We perform experiments with five models: a Weibel lung model, a Lutchen lung model, an atrial heart model, a neuron model, and a wave model; each model consists of several thousand ODEs and targets a Xilinx Virtex 6 FPGA. Results of the experiments show that the custom PE approach achieves 4X-9X speedups (average 6.7X) versus our previous general ODE-solver PE approach, and 7X-10X speedups (average 8.7X) versus high-level synthesis, while using approximately the same or fewer FPGA resources. Furthermore, the approach achieves speedups of 18X-32X (average 26X) versus an Nvidia GTX 460 GPU, and average speedups of more than 100X compared to a six-core TI DSP processor or a four-core ARM processor, and 24X versus an Intel I7 quad core processor running at 3.06 GHz. While an FPGA implementation costs about 3X-5X more than the non-FPGA approaches, a speedup/dollar analysis shows 10X improvement versus the next best approach, with the trend of decreasing FPGA costs improving speedup/dollar in the future.

References

[1]
ATI Graphics Cards. 2011. http://ati.amd.com/support/driver.html.
[2]
Atkinson, K. 1993. Elementary Numerical Analysis 2nd Ed. John Wiley & Sons, Inc. New York, New York.
[3]
AutoEsl. http://www.xilinx.com/tools/autoesl.htm.
[4]
Butcher, J. C. 2003. Numerical Methods for Ordinary Differential Equations. Wiley.
[5]
CellML. 2011. http://www.cellml.org.
[6]
Celoxica. 2011. http://www.celoxica.com/.
[7]
Gokhale, M. B., Stone, J. M., Arnold, J., and Lalinowski, M. 2000. Stream-oriented FPGA computing in the Streams-C high level language. In Proceedings of the IEEE Symposium on FPGAs for Custom Computing Machines.
[8]
Huang, C., Vahid, F., and Givargis, T. 2011. A custom FPGA processor for physical model ordinary differential equation solving. IEEE Embed. Sys. Lett. 3, 4, 113--116.
[9]
Huang, C., Vahid, F., and Givargis, T. 2012. Automatic synthesis of physical system differential equation models to a processing element network on FPGAs. Trans. Embed. Comput. Syst. To appear.
[10]
Hucka, M., Finney, A., et al. 2004. Evolving a lingua franca and associated software infrastructure for computational systems biology: The Systems Biology Markup Language (SBML) project. IEEE Syst. Biology, 41--53.
[11]
Impulse C. 2011. http://www.impulseaccelerated.com/.
[12]
Jsim. 2011. http://nsr.bioeng.washington.edu/jsim/.
[13]
Kum, K., Kang, J., and Sung, W. 2000. AUTOSCALER for C: an optimizing floating-point to integer C program converter for fixed-point digital signal processors. IEEE Trans. Analog Digital Signal Process. 47, 9, 840--848.
[14]
Lee, E. A. 2008. Cyber physical systems: Design challenges. Tech. rep. UCB/EECS-2008-8, EECS Department University of California.
[15]
Lionetti, F. 2010. http://cseweb.ucsd.edu/groups/-hpcl/scg/papers/2010/Europ10-src-src-GPU.pdf.
[16]
Lutchen, F. P. Primiano, J. R., and Saidel, G. M. 1982. A nonlinear model combining pulmonary mechanics and gas concentration dynamics. IEEE Trans. Bio-med. Electron. 29, 629--641.
[17]
Mathworks. 2011. Matlab and Simulink. http://www.mathworks.com/.
[18]
medGadget. 2008. Supercomputer creates most advanced heart model. Internet J. Emerg. Med. Tech.
[19]
Meyer, J. and Kocan, F. 2007. Sharing of SRAM Tables Among NPN-Equivalent LUTs in SRAM-Based FPGAs. IEEE Trans. VLSI Syst. 15, 2, 182--195.
[20]
Motuk, E., Woods, R., and Bilbao, S. 2005. Implementation of finite difference schemes for the wave equation on FPGA. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing.
[21]
National Instruments. 2011. LabView FPGA Module. http://www.ni.com/fpga/.
[22]
Nsr Physiome Project. 2011. Mathematical Markup Language. http://nsr.bioeng.washington.edu/jsim/docs/MML_Intro.html.
[23]
Nvidia Corporation. 2011. http://www.nvidia.com/object/gpu.html.
[24]
Osana, Y., Fukushima, T., and Amano, H. 2004. ReCSiP: a reconfigurable cell simulation platform: accelerating biological applications with FPGA. In Proceedings of the Asia and South Pacific Design Automation Conference.
[25]
Paulin, P. G., Knight, J. P., and Girczyc, E. F. 1986. HAL: a multi-paradigm approach to automatic data path synthesis. In Proceedings of the 23rd ACM/IEEE Design Automation Conference (DAC'86). IEEE, 263--270.
[26]
Pimentel, J. and Tirat-Gefen, Y. 2006. Hardware acceleration for real time simulation of physiological systems. In Proceedings of the 28th Annual International Conference of the Engineering in Medicine and Biology Society (EMBS'06). IEEE.
[27]
Rao, D. S. and Kurdahi, F. J. 1993. On clustering for maximal regularity extraction. IEEE Trans. Comput. Aided Des. Integr. Circuits Syst. 12, 8, 1198--1208.
[28]
Reshadi, M., B. Gorjiara, B., and Gajski, D. 2005. Utilizing horizontal and vertical parallelism using a no-instruction-set compiler and custom datapaths, In Proceedings of the IEEE International Conference on Computer Design.
[29]
Spark Project. 2005. http://mesl.ucsd.edu/spark/.
[30]
SynphonyC. 2011. http://www.synopsys.com/Systems/BlockDesign/HLS/Pages/SynphonyC-Compiler.aspx.
[31]
Terman, D., Ahn, S., Wang, X., and Just, W. 2008. Reducing neuronal networks to discrete dynamics. Physica D 237, 3, 324--338.
[32]
Villarreal, J., Park, A., Najjar, W., and Halstead, R. Designing modular hardware accelerators in C with ROCCC 2.0. In Proceedings of the IEEE International Symposium on Field-Programmable Custom Computing Machines. 127--134.
[33]
Weibel, E. R. 1963. Morphometry of the Human Lung. Springer.
[34]
Xilinx Ise. 2011. http://www.xilinx.com/support/documentation/dt_ise12-4.htm.
[35]
Yoshimi, M., Osana, Y., Fukushima, T., and Amano, H. 2004. Stochastic Simulation for Biochemical Reactions on FPGA. In Field Programmable Logic and Application, Lecture Notes in Computer Science, vol. 3203, 105--114.
[36]
Zhang, H., Holden, A. V., and Boyett, M. R. 2001. Gradient model versus mosaic model of the sinoatrial node. Circulation 103, 4, 584--588.

Cited By

View all
  • (2023)Towards an Accelerator for Differential and Algebraic Equations Useful to ScientistsIEEE Computer Architecture Letters10.1109/LCA.2023.333231822:2(185-188)Online publication date: Jul-2023
  • (2019)Robust Design and Validation of Cyber-physical SystemsACM Transactions on Embedded Computing Systems10.1145/336209818:6(1-21)Online publication date: 15-Nov-2019
  • (2017)Review of Hardware Platforms for Real-Time Simulation of Electric MachinesIEEE Transactions on Transportation Electrification10.1109/TTE.2017.26561413:1(130-146)Online publication date: Mar-2017
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Design Automation of Electronic Systems
ACM Transactions on Design Automation of Electronic Systems  Volume 18, Issue 2
March 2013
429 pages
ISSN:1084-4309
EISSN:1557-7309
DOI:10.1145/2442087
Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Journal Family

Publication History

Published: 11 April 2013
Accepted: 01 September 2012
Revised: 01 May 2012
Received: 01 January 2012
Published in TODAES Volume 18, Issue 2

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Custom processor
  2. field-programmable gate array (FPGA)
  3. high-level synthesis
  4. ordinary differential equation (ODE) solving
  5. physical models
  6. real-time emulation

Qualifiers

  • Research-article
  • Research
  • Refereed

Funding Sources

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)3
  • Downloads (Last 6 weeks)1
Reflects downloads up to 12 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2023)Towards an Accelerator for Differential and Algebraic Equations Useful to ScientistsIEEE Computer Architecture Letters10.1109/LCA.2023.333231822:2(185-188)Online publication date: Jul-2023
  • (2019)Robust Design and Validation of Cyber-physical SystemsACM Transactions on Embedded Computing Systems10.1145/336209818:6(1-21)Online publication date: 15-Nov-2019
  • (2017)Review of Hardware Platforms for Real-Time Simulation of Electric MachinesIEEE Transactions on Transportation Electrification10.1109/TTE.2017.26561413:1(130-146)Online publication date: Mar-2017
  • (2016)GeckoFTLProceedings of the 2016 International Conference on Management of Data10.1145/2882903.2915219(327-342)Online publication date: 26-Jun-2016
  • (2015)Rethinking I/O request management over eMMC-based solid-state drivesProceedings of the 2015 Conference on research in adaptive and convergent systems10.1145/2811411.2811536(451-456)Online publication date: 9-Oct-2015

View Options

Get Access

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media