Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1555754.1555782acmconferencesArticle/Chapter ViewAbstractPublication PagesiscaConference Proceedingsconference-collections
research-article

Application-aware deadlock-free oblivious routing

Published: 20 June 2009 Publication History

Abstract

Conventional oblivious routing algorithms are either not application-aware or assume that each flow has its own private channel to ensure deadlock avoidance. We present a framework for application-aware routing that assures deadlock-freedom under one or more channels by forcing routes to conform to an acyclic channel dependence graph. Arbitrary minimal routes can be made deadlock-free through appropriate static channel allocation when two or more channels are available. Given bandwidth estimates for flows, we present a mixed integer-linear programming (MILP) approach and a heuristic approach for producing deadlock-free routes that minimize maximum channel load. The heuristic algorithm is calibrated using the MILP algorithm and evaluated on a number of benchmarks through detailed network simulation. Our framework can be used to produce application-aware routes that target the minimization of latency, number of flows through a link, bandwidth, or any combination thereof.

References

[1]
Tobias Bjerregaard and Shankar Mahadevan. A survey of research and practices of network-on-chip. ACM Computing Surveys, 38(1), 2006.
[2]
Tobias Bjerregaard and Jens Sparsø. Virtual channel designs for guaranteeing bandwidth in asynchronous network-on-chip. In Proceedings of the IEEE Norchip Conference (NORCHIP 2004). IEEE, 2004.
[3]
Ge-Ming Chiu. The odd-even turn model for adaptive routing. IEEE Trans. Parallel Distrib. Syst., 11(7):729--738, 2000.
[4]
M. H. Cho, C-C. Cheng, M. Kinsy, G. E. Suh, and S. Devadas. Diastolic Arrays: Throughput-Driven Reconfigurable Computing. In Proceedings of the Int'l Conference on Computer-Aided Design, November 2008.
[5]
M. H. Cho, M. Lis, K. S. Shim, M. Kinsy, T. Wen, and S. Devadas. Oblivious routing in on-chip bandwidth-adaptive networks. Technical Report CSAIL--TR--2009--011 (http://hdl.handle.net/1721.1/44958), Massachusetts Institute of Technology, March 2009.
[6]
Thomas H. Cormen, Charles E. Leiserson, Ronald L. Rivest, and Clifford Stein. Introduction to Algorithms. MIT Press/McGraw-Hill, 2001.
[7]
William J. Dally, P. P. Carvey, and L. R. Dennison. The Avici terabit switch/router. In Proceedings of the Symposium on Hot Interconnects, pages 41--50, August 1998.
[8]
William J. Dally and Charles L. Seitz. Deadlock-Free Message Routing in Multiprocessor Interconnection Networks. IEEE Trans. Computers, 36(5):547--553, 1987.
[9]
William J. Dally and Brian Towles. Principles and Practices of Interconnection Networks. Morgan Kaufmann, 2003.
[10]
W.J. Dally. Virtual-channel flow control. IEEE Transactions on Paral lel and Distributed Systems, 03(2):194--205, 1992. 35
[11]
José Duato. A new theory of deadlock-free adaptive routing in wormhole networks. IEEE Trans. Paral lel Distrib. Syst., 4(12):1320--1331, 1993.
[12]
José Duato. A necessary and sufficient condition for deadlock-free adaptive routing in wormhole networks. IEEE Trans. Paral lel Distrib. Syst., 6(10):1055--1067, 1995.
[13]
Mike Galles. Scalable pipelined interconnect for distributed endpoint routing: The SGI SPIDER chip. In Proceedings of the Symposium on Hot Interconnects, pages 141--146, August 1996.
[14]
Roman Gindin, Israel Cidon, and Idit Keidar. NoC-Based FPGA: Architecture and Routing. In First International Symposium on Networks-on-Chips (NOCS 2007), pages 253--264, 2007.
[15]
Christopher J. Glass and Lionel M. Ni. The turn model for adaptive routing. J. ACM, 41(5):874--902, 1994.
[16]
Thomas Gross and David R. O'Hallaron. iWarp: anatomy of a parallel computing system. MIT Press, Cambridge, MA, USA, 1998.
[17]
Zvika Guz, Isask'har Walter, Evgeny Bolotin, Israel Cidon, Ran Ginosar, and Avinoam Kolodny. Efficient link capacity and qos design for network-on-chip. In DATE '06: Proceedings of the conference on Design, automation and test in Europe, pages 9--14, 2006.
[18]
J. Hu and R. Marculescu. Exploiting the Routing Flexibility for Energy/Performance Aware Mapping of Regular NoC Architectures. In Proc. Design, Automation and Test in Europe Conference, 2003.
[19]
Jingcao Hu and Radu Marculescu. DyAD: Smart Routing for Networks on Chip. In Design Automation Conference, June 2004.
[20]
Natalie Enright Jerger, Li-Shiuan Peh, and Mikko Lipasti. Virtual circuit tree multicasting: A case for on-chip hardware multicast support. In ISCA '08: Proceedings of the 35th annual international symposium on Computer architecture, 2008.
[21]
N. K. Kavaldjiev, G. J. M. Smit, and P. G. Jansen. A virtual channel router for on-chip networks. In IEEE Int. SOC Conf., Santa Clara, California, pages 289--293. IEEE Computer Society Press, September 2004.
[22]
Jon Michael Kleinberg. Approximation algorithms for disjoint paths problems. PhD thesis, Massachusetts Institute of Technology, 1996. Supervisor-Michel X. Goemans.
[23]
Amit Kumar, Li-Shiuan Peh, Partha Kundu, and Nira j K. Jha. Toward ideal on-chip communication using express virtual channels. IEEE Micro, 28(1):80--90, 2008. 36
[24]
Olav Lysne and José Duato. Fast dynamic reconfiguration in irregular networks. In ICPP '00: Proceedings of the Proceedings of the 2000 International Conference on Parallel Processing, page 449, 2000.
[25]
Robert D. Mullins, Andrew F. West, and Simon W. Moore. Low-latency virtual-channel routers for on-chip networks. In Proc. of the 31st Annual Intl. Symp. on Computer Architecture (ISCA), pages 188--197, 2004.
[26]
Srinivasan Murali, David Atienz, Luca Benini, and Giovanni De Micheli. A Method for Routing Packets Across Multiple Paths in NoCs with In-Order Delivery and FaultTolerance Gaurantees. VLSI Design, 2007.
[27]
Srinivasan Murali and Giovanni De Micheli. Sunmap: a tool for automatic topology selection and generation for nocs. In DAC '04: Proceedings of the 41st annual conference on Design automation, pages 914--919, 2004.
[28]
Ted Nesson and S. Lennart Johnsson. ROMM routing on mesh and torus networks. In Proc. 7th Annual ACM Symposium on Paral lel Algorithms and Architectures SPAA'95, pages 275--287, 1995.
[29]
Lionel M. Ni and Philip K. McKinley. A survey of wormhole routing techniques in direct networks. Computer, 26(2):62--76, 1993.
[30]
Chrysostomos A. Nicopoulos, Dongkook Park, Jongman Kim, Narayanan Vijaykrishnan, Mazin S. Yousif, and Chita R. Das. ViChaR: A dynamic virtual channel regulator for network-on-chip routers. In Proc. of the 39th Annual Intl. Symp. on Microarchitecture (MICRO), 2006.
[31]
M. Palesi, R. Holsmark, S. Kumar, and V. Catania. A methodology for design of application specific deadlock-free routing algorithms for NoC systems. In Proc. Intl. Conf. on Hardware-Software Codesign and System Synthesis, Seoul, Korea, October 2006.
[32]
M. Palesi, G. Longo, S. Signorino, R. Holsmark, S. Kumar, and V. Catania. Design of bandwidth aware and congestion avoiding efficient routing algorithms for networks-on-chip platforms. Proc. of the ACM/IEEE Int. Symp. on Networks-on-Chip (NOCS), pages 97--106, 2008.
[33]
Li-Shiuan Peh and William J. Dally. Flit-reservation flow control. In In Proc. of the 6th Int. Symp. on High-Performance Computer Architecture (HPCA), pages 73--84, January 2000. 37
[34]
Li-Shiuan Peh and William J. Dally. A Delay Model and Speculative Architecture for Pipelined Routers. In Proc. International Symposium on High-Performance Computer Architecture (HPCA), pages 255--266, January 2001.
[35]
Loren Schwiebert. Deadlock-free oblivious wormhole routing with cyclic dependencies. In SPAA '97: Proceedings of the ninth annual ACM symposium on Parallel algorithms and architectures, pages 149--158, 1997.
[36]
Daeho Seo, Akif Ali, Won-Taek Lim, Nauman Rafique, and Mithuna Thottethodi. Near-optimal worst-case throughput routing for two--dimensional mesh networks. In Proceedings of the 32nd Annual International Symposium on Computer Architecture (ISCA 2005), pages 432--443, 2005.
[37]
K. S. Shim, M. H. Cho, M. Kinsy, T. Wen, M. Lis, G. E. Suh, and S. Devadas. Static Virtual Channel Allocation in Oblivious Routing. In Proceedings of the 3rd ACM/IEEE International Symposium on Networks-on-Chip, May 2009.
[38]
Craig B. Stunkel and Peter H. Hochschild. SP2 high-performance switch architecture. In Proceedings of the Symposium on Hot Interconnects, pages 115--121, August 1994.
[39]
Craig B. Stunkel, Dennis G. Shea, Don G. Grice, Peter H. Hochschild, and Michael Tsao. The SP1 high-performance switch. In Proceedings of the Scalable High Performance Computing Conference, pages 150--157, May 1994.
[40]
Brian Towles, William J. Dally, and Stephen Boyd. Throughput-centric routing algorithm design. In SPAA '03: Proceedings of the fifteenth annual ACM symposium on Paral lel algorithms and architectures, pages 200--209, 2003.
[41]
L. G. Valiant and G. J. Brebner. Universal schemes for parallel communication. In STOC '81: Proceedings of the thirteenth annual ACM symposium on Theory of computing, pages 263--277, 1981.
[42]
Krzysztof Walkowiak. New algorithms for the unsplittable flow problem. In ICCSA (2), volume 3981 of Lecture Notes in Computer Science, pages 1101--1110, 2006.
[43]
Xiaoxiong Zhong and Virginia Mary Lo. Application-specific deadlock free worwhole routing on multicomputers. In PARLE '92: Proceedings of the 4th International PARLE Conference on Paral lel Architectures and Languages Europe, pages 193--208, 1992.

Cited By

View all
  • (2024)NetSmith: An Optimization Framework for Machine-Discovered Network TopologiesProceedings of the 53rd International Conference on Parallel Processing10.1145/3673038.3673060(421-432)Online publication date: 12-Aug-2024
  • (2023)Graph Based Routing Algorithm for Torus Topology and Its Evaluation for the Angara InterconnectJournal of Parallel and Distributed Computing10.1016/j.jpdc.2023.104765(104765)Online publication date: Sep-2023
  • (2020)Accelerated Reply Injection for Removing NoC Bottleneck in GPGPUs2020 IEEE International Parallel and Distributed Processing Symposium (IPDPS)10.1109/IPDPS47924.2020.00013(22-31)Online publication date: May-2020
  • Show More Cited By

Index Terms

  1. Application-aware deadlock-free oblivious routing

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    ISCA '09: Proceedings of the 36th annual international symposium on Computer architecture
    June 2009
    510 pages
    ISBN:9781605585260
    DOI:10.1145/1555754
    • cover image ACM SIGARCH Computer Architecture News
      ACM SIGARCH Computer Architecture News  Volume 37, Issue 3
      June 2009
      495 pages
      ISSN:0163-5964
      DOI:10.1145/1555815
      Issue’s Table of Contents
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 20 June 2009

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. oblivious routing
    2. on-chip interconnection networks
    3. systems-on-chip

    Qualifiers

    • Research-article

    Conference

    ISCA '09
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 543 of 3,203 submissions, 17%

    Upcoming Conference

    ISCA '25

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)12
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 10 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)NetSmith: An Optimization Framework for Machine-Discovered Network TopologiesProceedings of the 53rd International Conference on Parallel Processing10.1145/3673038.3673060(421-432)Online publication date: 12-Aug-2024
    • (2023)Graph Based Routing Algorithm for Torus Topology and Its Evaluation for the Angara InterconnectJournal of Parallel and Distributed Computing10.1016/j.jpdc.2023.104765(104765)Online publication date: Sep-2023
    • (2020)Accelerated Reply Injection for Removing NoC Bottleneck in GPGPUs2020 IEEE International Parallel and Distributed Processing Symposium (IPDPS)10.1109/IPDPS47924.2020.00013(22-31)Online publication date: May-2020
    • (2019)LEF: An Effective Routing Algorithm for Two-Dimensional MeshesIEICE Transactions on Information and Systems10.1587/transinf.2019EDP7019E102.D:10(1925-1941)Online publication date: 1-Oct-2019
    • (2019)Scalable interconnects for reconfigurable spatial architecturesProceedings of the 46th International Symposium on Computer Architecture10.1145/3307650.3322249(615-628)Online publication date: 22-Jun-2019
    • (2018)High-Performance, Low-Complexity Deadlock Avoidance for Arbitrary Topologies/RoutingsProceedings of the 2018 International Conference on Supercomputing10.1145/3205289.3205307(129-138)Online publication date: 12-Jun-2018
    • (2018)Application Specific Networks-on-Chip Synthesis: An Energy Efficient Approach2018 IEEE Computer Society Annual Symposium on VLSI (ISVLSI)10.1109/ISVLSI.2018.00020(52-57)Online publication date: Jul-2018
    • (2017)On-Chip Networks, Second EditionSynthesis Lectures on Computer Architecture10.2200/S00772ED1V01Y201704CAC04012:3(1-210)Online publication date: 17-Jun-2017
    • (2017)Adaptive Manycore Architectures for Big Data ComputingProceedings of the Eleventh IEEE/ACM International Symposium on Networks-on-Chip10.1145/3130218.3130236(1-8)Online publication date: 19-Oct-2017
    • (2017)Static Bubble: A Framework for Deadlock-Free Irregular On-chip Topologies2017 IEEE International Symposium on High Performance Computer Architecture (HPCA)10.1109/HPCA.2017.44(253-264)Online publication date: Feb-2017
    • Show More Cited By

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media