Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

HopliteBuf: Network Calculus-Based Design of FPGA NoCs with Provably Stall-Free FIFOs

Published: 13 February 2020 Publication History
  • Get Citation Alerts
  • Abstract

    HopliteBuf is a deflection-free, low-cost, and high-speed FPGA overlay Network-on-chip (NoC) with stall-free buffers. It is an FPGA-friendly 2D unidirectional torus topology built on top of HopliteRT overlay NoC. The stall-free buffers in HopliteBuf are supported by static analysis tools based on network calculus that help determine worst-case FIFO occupancy bounds for a prescribed workload. We implement these FIFOs using cheap LUT SRAMs (Xilinx SRL32s and Intel MLABs) to reduce cost. HopliteBuf is a hybrid microarchitecture that combines the performance benefits of conventional buffered NoCs by using stall-free buffers with the cost advantages of deflection-routed NoCs by retaining the lightweight unidirectional torus topology structure. We present two design variants of the HopliteBuf NoC: (1) single corner-turn FIFO (WS) and (2) dual corner-turn FIFO (WS+N). The single corner-turn (WS) design is simpler and only introduces a buffering requirement for packets changing dimension from the X ring to the downhill Y ring (or West to South). The dual corner-turn variant requires two FIFOs for turning packets going downhill (WS) as well as uphill (WN). The dual corner-turn design overcomes the mathematical analysis challenges associated with single corner-turn designs for communication workloads with cyclic dependencies between flow traversal paths at the expense of a small increase in resource cost. Our static analysis delivers bounds that are not only better (in latency) than HopliteRT but also tighter by 2−3×. Across 100 randomly generated flowsets mapped to a 5×5 system size, HopliteBuf is able to route a larger fraction of these flowsets with <128-deep FIFOs, boost worst-case routing latency by ≈ 2× for mutually feasible flowsets, and support a 10% higher injection rate than HopliteRT. At 20% injection rates, HopliteRT is only able to route 1--2% of the flowsets, while HopliteBuf can deliver 40--50% sustainability. When compared to the WSbkp backpressure-based router, we observe that our HopliteBuf solution offers 25--30% better feasibility at 30--40% lower LUT cost.

    References

    [1]
    Altera Corp. 2015. Arria 10 Core Fabric and General Purpose I/Os Handbook. Retrieved from https://www.altera.com/en_US/pdfs/literature/hb/arria-10/a10_handbook.pdf.
    [2]
    Ahmed Amari and Ahlem Mifdaoui. 2017. Worst-case timing analysis of ring networks with cyclic dependencies using network calculus. In Proceedings of the IEEE International Conference on Embedded and Real-Time Computing Systems and Applications (RTCSA’17). IEEE.
    [3]
    Ken Chapman. 2008. Saving costs with the SRL16E. White Paper WP271 (v1. 0), Xilinx Inc (2008).
    [4]
    Jan Gray. 2016. GRVI-Phalanx: A massively parallel RISC-V FPGA accelerator accelerator. In Proceedings of the 24th IEEE Symposium on Field-Programmable Custom Computing Machines. IEEE, 17--20.
    [5]
    Yutian Huan and A. DeHon. 2012. FPGA optimized packet-switched NoC using split and merge primitives. In Field-Programmable Technology. 47--52.
    [6]
    S. Jeon, J. Cho, Y. Jung, S. Park, and T. Han. 2011. Automotive hardware development according to ISO 26262. In 13th International Conference on Advanced Communication Technology (ICACT’11). 588--592.
    [7]
    N. Kapre and J. Gray. 2015. Hoplite: Building austere overlay NoCs for FPGAs. In Field Programmable Logic and Applications. 1--8.
    [8]
    H. Kashif and H. Patel. 2014. Bounding buffer space requirements for real-time priority-aware networks. In 2014 19th Asia and South Pacific Design Automation Conference (ASP-DAC’14). 113--118.
    [9]
    Hany Kashif and Hiren Patel. 2016. Buffer space allocation for real-time priority-aware networks. In Proceedings of the IEEE Real-Time and Embedded Technology and Applications Symposium (RTAS’16). IEEE, 1--12.
    [10]
    John Kim. 2009. Low-cost router microarchitecture for on-chip networks. In 42st Annual IEEE/ACM International Symposium on Microarchitecture (MICRO-42’09), David H. Albonesi, Margaret Martonosi, David I. August, and José F. Martínez (Eds.). ACM, 255--266.
    [11]
    Jean-Yves Le Boudec and Patrick Thiran. 2001. Network Calculus: A Theory of Deterministic Queuing Systems for the Internet. Springer-Verlag.
    [12]
    Michael K. Papamichael and James C. Hoe. 2012. CONNECT: Re-examining conventional wisdom for designing NoCs in the context of FPGAs. In Proceedings of the ACM/SIGDA International Symposium on Field Programmable Gate Arrays. ACM, 37--46.
    [13]
    Ian Swarbrick, Dinesh Gaitonde, Sagheer Ahmad, Brian Gaide, and Ygal Arbel. 2019. Network-on-chip programmable platform in VersalTM ACAP architecture. In Proceedings of the 2019 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays (FPGA’19). ACM, New York, NY, 212--221.
    [14]
    Saud Wasly, Rodolfo Pellizzoni, and Nachiket Kapre. 2017. HopliteRT: An efficient FPGA NoC for real-time applications. In 2017 International Conference on Field Programmable Technology (ICFPT’17). IEEE, 64--71.
    [15]
    Saud Wasly, Rodolfo Pellizzoni, and Nachiket Kapre. 2017. Worst Case Latency Analysis for Hoplite FPGA-based NoC. Retrieved from http://hdl.handle.net/10012/12600.
    [16]
    Xilinx Inc. 2015. 7 Series FPGAs Configurable Logic Block User Guide. Retrieved from http://www.xilinx.com/support/documentation/user_guides/ug474_7Series_CLB.pdf.

    Cited By

    View all
    • (2022)HopliteML: Evolving Application Customized FPGA NoCs with Adaptable Routers and RegulatorsACM Transactions on Reconfigurable Technology and Systems10.1145/350769915:4(1-33)Online publication date: 14-Feb-2022
    • (2021)User online consumption behaviour based on fractional differential equationApplied Mathematics and Nonlinear Sciences10.2478/amns.2021.2.000937:1(415-424)Online publication date: 22-Nov-2021
    • (2021)Programming and Synthesis for Software-defined FPGA Acceleration: Status and Future ProspectsACM Transactions on Reconfigurable Technology and Systems10.1145/346966014:4(1-39)Online publication date: 13-Sep-2021
    • Show More Cited By

    Index Terms

    1. HopliteBuf: Network Calculus-Based Design of FPGA NoCs with Provably Stall-Free FIFOs

          Recommendations

          Comments

          Information & Contributors

          Information

          Published In

          cover image ACM Transactions on Reconfigurable Technology and Systems
          ACM Transactions on Reconfigurable Technology and Systems  Volume 13, Issue 2
          June 2020
          185 pages
          ISSN:1936-7406
          EISSN:1936-7414
          DOI:10.1145/3383521
          • Editor:
          • Deming Chen
          Issue’s Table of Contents
          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          Published: 13 February 2020
          Accepted: 01 December 2019
          Revised: 01 December 2019
          Received: 01 May 2019
          Published in TRETS Volume 13, Issue 2

          Permissions

          Request permissions for this article.

          Check for updates

          Author Tags

          1. FPGA overlay NoC
          2. Network calculus
          3. stall-free buffers

          Qualifiers

          • Research-article
          • Research
          • Refereed

          Contributors

          Other Metrics

          Bibliometrics & Citations

          Bibliometrics

          Article Metrics

          • Downloads (Last 12 months)20
          • Downloads (Last 6 weeks)1
          Reflects downloads up to 26 Jul 2024

          Other Metrics

          Citations

          Cited By

          View all
          • (2022)HopliteML: Evolving Application Customized FPGA NoCs with Adaptable Routers and RegulatorsACM Transactions on Reconfigurable Technology and Systems10.1145/350769915:4(1-33)Online publication date: 14-Feb-2022
          • (2021)User online consumption behaviour based on fractional differential equationApplied Mathematics and Nonlinear Sciences10.2478/amns.2021.2.000937:1(415-424)Online publication date: 22-Nov-2021
          • (2021)Programming and Synthesis for Software-defined FPGA Acceleration: Status and Future ProspectsACM Transactions on Reconfigurable Technology and Systems10.1145/346966014:4(1-39)Online publication date: 13-Sep-2021
          • (2020)Learn the Switches: Evolving FPGA NoCs with Stall-Free and Backpressure Based Routers2020 30th International Conference on Field-Programmable Logic and Applications (FPL)10.1109/FPL50879.2020.00015(18-25)Online publication date: Aug-2020

          View Options

          Get Access

          Login options

          Full Access

          View options

          PDF

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader

          HTML Format

          View this article in HTML Format.

          HTML Format

          Media

          Figures

          Other

          Tables

          Share

          Share

          Share this Publication link

          Share on social media