Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.5555/2616606.2616683acmotherconferencesArticle/Chapter ViewAbstractPublication PagesdateConference Proceedingsconference-collections
research-article

A fault detection mechanism in a data-flow scheduled multithreaded processor

Published: 24 March 2014 Publication History

Abstract

This paper designs and implements the Redundant Multi-Threading (RMT) in a Data-flow scheduled Multi-Threaded (DMT) multicore processor, called Data-flow scheduled Redundant Multi-Threading (DRMT). Meanwhile, It presents Asynchronous Output Comparison (AOC) for RMT techniques to avoid fault detection related inter-core communication and alleviate the performance and hardware overheads induced by output comparison. Results show that the performance overhead of DRMT is less than 60% even when the number of threads is four times the number of processing elements. Also the performance and hardware overheads of AOC are insignificant.

References

[1]
A. Bolychevsky, C. R. Jesshope, and V. B. Muchnick, "Dynamic scheduling in RISC architectures," in IEE Proceedings Computers and Digital Techniques, vol. 143, no. 5, 1996, pp. 309--317.
[2]
R. A. Iannucci, "Toward a dataflow/von Neumann hybrid architecture," in Proc. of ISCA, 1988, pp. 131--140.
[3]
R. S. Nikhil, "Can dataflow subsume von Neumann computing?" in Proc. of ISCA, 1989, pp. 262--272.
[4]
M. Lankamp, R. Poss, Q. Yang, J. Fu, I. Uddin, and C. R. Jesshope, "MGSim---simulation tools for multi-core processor architectures," University of Amsterdam, Tech. Rep. arXiv:1302.1390v1, 2013.
[5]
J. C. Smolens, B. T. Gold, B. Falsafi, and J. C. Hoe, "Reunion: complexity-effective multicore redundancy," in Proc. of MICRO, 2006, pp. 223--234.
[6]
F. H. McMahon, "Livermore Fortran kernels: a computer test of the numerical performance range." Lawrence Livermore National Laboratory, Tech. Rep. UCRL-53745, Dec 1986.
[7]
E. Rotenberg, "AR-SMT: a microarchitectural approach to fault tolerance in microprocessors," in Proc. of FTCS, 1999, pp. 84--91.
[8]
S. K. Reinhardt and S. S. Mukherjee, "Transient fault detection via simultaneous multithreading," in Proc. of ISCA, 2000, pp. 25--36.
[9]
T. N. Vijaykumar, I. Pomeranz, and K. Cheng, "Transient-fault recovery using simultaneous multithreading," in Proc. of ISCA, 2002, pp. 87--98.
[10]
S. S. Mukherjee, M. Kontz, and S. K. Reinhardt, "Detailed design and evaluation of redundant multithreading alternatives," in Proc. of ISCA, 2002, pp. 99--110.
[11]
M. Gomaa, C. Scarbrough, T. N. Vijaykumar, and I. Pomeranz, "Transient-fault recovery for chip multiprocessors," in Proc. of ISCA, 2003, pp. 98--109.
[12]
C. LaFrieda, E. Ipek, J. F. Martinez, and R. Manohar, "Utilizing dynamically coupled cores to form a resilient chip multiprocessor," in Proc. of DSN, 2007, pp. 317--326.
[13]
M. Rashid and M. Huang, "Supporting highly-decoupled thread-level redundancy for parallel programs," in Proc. of HPCA, 2008, pp. 393--404.

Cited By

View all
  • (2019)A Survey on Multithreading Alternatives for Soft Error Fault ToleranceACM Computing Surveys10.1145/330225552:2(1-38)Online publication date: 27-Mar-2019

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
DATE '14: Proceedings of the conference on Design, Automation & Test in Europe
March 2014
1959 pages
ISBN:9783981537024

Sponsors

  • EDAA: European Design Automation Association
  • ECSI
  • EDAC: Electronic Design Automation Consortium
  • IEEE Council on Electronic Design Automation (CEDA)
  • The Russian Academy of Sciences: The Russian Academy of Sciences

In-Cooperation

Publisher

European Design and Automation Association

Leuven, Belgium

Publication History

Published: 24 March 2014

Check for updates

Qualifiers

  • Research-article

Conference

DATE '14
Sponsor:
  • EDAA
  • EDAC
  • The Russian Academy of Sciences
DATE '14: Design, Automation and Test in Europe
March 24 - 28, 2014
Dresden, Germany

Acceptance Rates

Overall Acceptance Rate 518 of 1,794 submissions, 29%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)2
  • Downloads (Last 6 weeks)0
Reflects downloads up to 18 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2019)A Survey on Multithreading Alternatives for Soft Error Fault ToleranceACM Computing Surveys10.1145/330225552:2(1-38)Online publication date: 27-Mar-2019

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media