Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2765491.2765500acmconferencesArticle/Chapter ViewAbstractPublication PagesnanoarchConference Proceedingsconference-collections
research-article

Introducing OVP awareness to achieve an efficient permanent defect locating

Published: 04 July 2012 Publication History

Abstract

Fault-tolerance now plays an important role in covering the increasing soft/hard error rates in electronic devices that accompany the advances in process technologies. Research shows that wear-out faults have a gradual onset, starting with a timing fault and then eventually leading to a permanent fault. Error detection is thus a required function to maintain execution correctness. Currently, however, many highly dependable methods to cover permanent faults are commonly over-designed for very frequent checking, due to the lack of the awareness of the potential fault rate in the near future. In this research, we propose a method to lower the cost of locating the permanent defects by introducing an operation vulnerable probability (OVP). Specifically, this method segments a long data-path by means of check instructions which tag the segment with their check results and thus allow the later permanent error locating to be carried out inside the segmented zone. We find that our approach saves 64% energy per 10M executions by setting the OVP threshold at 20%.

References

[1]
F. Wang and V. Agrawal, "Single Event Upset: An Embedded Tutorial," in Proceedings of the 21st International Conference on VLSI Design, 2008, pp. 429--434, jan. 2008.
[2]
H. Wang, S. Rodriguez, C. Dirik, A. Gole, V. Chan, and B. Jacob, "TERPS: the embedded reliable processing system," in Proceedings of the Asia and South Pacific Design Automation Conference, 2005, vol. 2 of ASPDAC'05, pp. D/1--D/2, jan. 2005.
[3]
J. C. Pickel and J. T. Blandford, "Cosmic Ray Induced in MOS Memory Cells," IEEE Transactions on Nuclear Science, vol. 25, pp. 1166--1171, dec. 1978.
[4]
T. Karnik and P. Hazucha, "Characterization of soft errors caused by single event upsets in CMOS processes," IEEE Transactions on Dependable and Secure Computing, vol. 1, pp. 128--143, april--june 2004.
[5]
J. M. Rabaey, A. Chandrakasan, and B. Nikolic, Digital integrated circuits- A design perspective. Prentice Hall, 2ed ed., 2004.
[6]
J. H. Stathis, "Reliability limits for the gate insulator in CMOS technology," IBM Journal of Research and Development, vol. 46, pp. 265--286, march 2002.
[7]
J. Srinivasan, S. Adve, P. Bose, and J. Rivers, "The impact of technology scaling on lifetime reliability," in Proceedings of the 2004 International Conference on Dependable Systems and Networks, DSN'04, pp. 177--186, june-1 july 2004.
[8]
J. W. McPherson, "Reliability challenges for 45nm and beyond," in Proceedings of the 43rd annual Design Automation Conference, DAC '06, pp. 176--181, 2006.
[9]
J. Srinivasan, S. Adve, P. Bose, and J. Rivers, "Lifetime reliability: toward an architectural solution," Micro, IEEE, vol. 25, pp. 70--80, may--june 2005.
[10]
A. A. Hwang, I. A. Stefanovici, and B. Schroeder, "Cosmic rays don't strike twice: understanding the nature of DRAM errors and the implications for system design," in Proceedings of the seventeenth international conference on Architectural Support for Programming Languages and Operating Systems, ASPLOS '12, pp. 111--122, ACM, 2012.
[11]
S. Shyam, K. Constantinides, S. Phadke, V. Bertacco, and T. Austin, "Ultra low-cost defect protection for microprocessor pipelines," SIGARCH Comput. Archit. News, vol. 34, pp. 73--82, Oct. 2006.
[12]
S. Arslan and G. Shah, "A Flexible in-Field Test Controller," in Proceedings of the 2011 IEEE 14th International Multitopic Conference (INMIC'11), pp. 71--75, dec. 2011.
[13]
G. Shah and S. Arslan, "Design of an in-field Embedded Test Controller," Master's thesis, Linköping University, Sweden, 2011.
[14]
S.-B. Park, T. Hong, and S. Mitra, "Post-Silicon Bug Localization in Processors Using Instruction Footprint Recording and Analysis (IFRA)," IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, vol. 28, pp. 1545--1558, oct. 2009.
[15]
N. Devisetti, T. Iwakami, K. Yoshimura, T. Nakada, J. Yao, and Y. Nakashima, "LAPP: A Low Power Array Accelerator with Binary Compatibility," in Proceedings of the 2011 IEEE International Symposium on Parallel and Distributed Processing Workshops and PhD Forum, IPDPSW '11, pp. 854--862, IEEE Computer Society, 2011.
[16]
K. Yoshimura, T. Iwakami, T. Nakada, J. Yao, H. Shimada, and Y. Nakashima, "An Instruction Mapping Scheme for FU Array Accelerator," IEICE Transaction on Information and Systems, vol. E94D, Feb 2011.
[17]
N. Oh, P. Shirvani, and E. McCluskey, "Error detection by duplicated instructions in super-scalar processors," IEEE Transactions on Reliability, vol. 51, pp. 63--75, mar 2002.
[18]
Y.-Y. Chen and K.-L. Leu, "Reliable data path design of VLIW processor cores with comprehensive error-coverage assessment," Microprocessors and Microsystems, vol. 34, no. 1, pp. 49--61, 2010.
[19]
J. Gaisler, "A portable and fault-tolerant microprocessor based on the SPARC v8 architecture," in Proceedings of the 2002 International Conference on Dependable Systems and Networks., DSN'02, pp. 409--415, 2002.
[20]
Y.-Y. Chen, S.-J. Horng, and H.-C. Lai, "An integrated fault-tolerant design framework for VLIW processors," in Proceedings of the 18th IEEE International Symposium on Defect and Fault Tolerance in VLSI Systems, 2003, pp. 555--562, nov. 2003.
[21]
S. Hauck and A. DeHon, Reconfigurable Computing: The Theory and Practice of FPGA-Based Computation. Morgan Kaufmann, Amsterdam, nov 2007.
[22]
S. Vassiliadis, S. Wong, G. Gaydadjiev, K. Bertels, G. Kuzmanov, and E. Panainte, "The MOLEN polymorphic processor," IEEE Transactions on Computers, vol. 53, pp. 1363--1375, nov. 2004.
[23]
R. Hartenstein, "A decade of reconfigurable computing: a visionary retrospective," in Proceedings of the 2001 conference on Design, automation and test in Europe, DATE '01, pp. 642--649, 2001.
[24]
M. Azeem, S. Piestrak, O. Sentieys, and S. Pillement, "Error recovery technique for coarse-grained reconfigurable architectures," in Proceedings of the 2011 IEEE 14th International Symposium on Design and Diagnostics of Electronic Circuits Systems (DDECS), pp. 441--446, april 2011.
[25]
S. Mukherjee, C. Weaver, J. Emer, S. Reinhardt, and T. Austin, "A systematic methodology to compute the architectural vulnerability factors for a high-performance microprocessor," in Proceedings of the 36th Annual IEEE/ACM International Symposium on Microarchitecture, 2003. MICRO-36., pp. 29--40, dec. 2003.

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
NANOARCH '12: Proceedings of the 2012 IEEE/ACM International Symposium on Nanoscale Architectures
July 2012
243 pages
ISBN:9781450316712
DOI:10.1145/2765491
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 04 July 2012

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. FU array
  2. fault-tolerant computing
  3. low power
  4. operation vulnerable probability

Qualifiers

  • Research-article

Conference

NANOARCH '12
Sponsor:

Acceptance Rates

Overall Acceptance Rate 55 of 87 submissions, 63%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 15
    Total Downloads
  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 30 Aug 2024

Other Metrics

Citations

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media