Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Compiler-Assisted Refresh Minimization for Volatile STT-RAM Cache

Published: 01 August 2015 Publication History

Abstract

Spin-transfer torque RAM (STT-RAM) has been proposed to build on-chip caches because of its attractive features such as high storage density and ultra low leakage power. However, long write latency and high write energy are the two challenges for STT-RAM. Recently, researchers propose to improve the write performance of STT-RAM by relaxing its non-volatility property. To avoid data losses resulting from volatility, refresh schemes have been proposed. However, refresh operations consume additional overhead. In this paper, we propose to significantly reduce the number of refresh operations through re-arranging program data layout at compilation time. An N-refresh scheme is also proposed to further reduce the number of refreshes. Experimental results show that, on average, the proposed methods can reduce the number of refresh operations by 84.2 percent, and reduce the dynamic energy consumption by 38.0 percent for volatile STT-RAM caches while incurring only 4.1 percent performance degradation.

References

[1]
X. Dong, X. Wu, G. Sun, Y. Xie, H. Li, and Y. Chen, “ Circuit and microarchitecture evaluation of 3D stacking magnetic RAM (MRAM) as a universal memory replacement,” in Proc. 45th Annu. Des. Autom. Conf., 2008, pp. 554 –559.
[2]
P. Zhou, B. Zhao, J. Yang, and Y. Zhang, “Energy reduction for STT-RAM using early write termination,” in Proc. IEEE/ACM Int. Conf. Comput.-Aided Des., Nov. 2009, pp. 264–268.
[3]
W. Xu, H. Sun, X. Wang, Y. Chen, and T. Zhang, “Design of last-level on-chip cache using spin-torque transfer RAM (STT RAM),” IEEE Trans. Very Large Scale Integr. Syst., vol. 19, no. 3, pp. 483–493, Mar. 2011.
[4]
G. Sun, X. Dong, Y. Xie, J. Li, and Y. Chen, “A novel architecture of the 3D stacked MRAM L2 cache for CMPs,” in Proc. IEEE 15th Int. Symp. High Perform. Comput. Archit., Feb. 2009, pp. 239 –249.
[5]
C. Smullen, V. Mohan, A. Nigam, S. Gurumurthi, and M. Stan, “Relaxing non-volatility for fast and energy-efficient STT-RAM caches,” in Proc. IEEE 17th Int. Symp. High Perform. Comput. Archit., Feb. 2011, pp. 50–61.
[6]
Z. Sun, X. Bi, H. H. Li, W.-F. Wong, Z.-L. Ong, X. Zhu, and W. Wu, “Multi retention level STT-RAM cache designs with a dynamic refresh scheme,” in Proc. 44th Annu. IEEE/ACM Int. Symp. Microarchit., 2011, pp. 329–338.
[7]
A. Jog, A. K. Mishra, C. Xu, Y. Xie, V. Narayanan, R. Iyer, and C. R. Das, “Cache revive: Architecting volatile STT-RAM caches for enhanced performance in CMPs,” in Proc. 49th Annu. Des. Autom. Conf., 2012, pp. 243–252.
[8]
M. R. Garey and D. S. Johnson, Computers and Intractability: A Guide to the Theory of NP-Completeness. San Francisco, CA, USA: Freeman, 1990.
[9]
C.-K. Luk, R. Cohn, R. Muth, H. Patil, A. Klauser, G. Lowney, S. Wallace, V. J. Reddi, and K. Hazelwood, “Pin: Building customized program analysis tools with dynamic instrumentation,” in Proc. ACM SIGPLAN Conf. Program. Lang. Des. Implementation, 2005, pp. 190–200.
[10]
TI. (2011). Tms320dm368 digital media system-on-chip (DMSoC) [Online]. Available: http://www.ti.com/lit/ds/symlink/tms320dm368.pdf
[11]
J. Scott, L. H. Lee, J. Arends, and B. Moyer, “Designing the low-power m*core architecture,” in Proc. Workshop Power Driven Microarchit., 1998, pp. 145–150.
[12]
C. Lattner and V. Adve, “LLVM: A compilation framework for lifelong program analysis & transformation,” in Proc. Int. Symp. Code Gen. Optim., 2004, pp. 75– 86.
[13]
LINGO. (2013). Lingo-optimization modeling software for linear, nonlinear, and integer programming [Online]. Available: http://www.lindo.com
[14]
X. Wu, J. Li, L. Zhang, E. Speight, R. Rajamony, and Y. Xie, “ Hybrid cache architecture with disparate memory technologies,” in Proc. 36th Annu. Int. Symp. Comput. Archit., 2009, pp. 34–45.
[15]
A. Jadidi, M. Arjomand, and H. Sarbazi-Azad, “ High-endurance and performance-efficient design of hybrid cache architectures through adaptive line replacement,” in Proc. 17th IEEE/ACM Int. Symp. Low-Power Electron. Des., 2011, pp. 79–84.
[16]
J. Li, C. J. Xue, and Y. Xu, “STT-RAM based energy-efficiency hybrid cache for CMPs,” in Proc. 19th IFIP/IEEE Int. Conf. Very Large Scale Integr., 2011, pp. 31–36.
[17]
Y.-T. Chen, J. Cong, H. Huang, C. Liu, R. Prabhakar, and G. Reinman, “Static and dynamic co-optimizations for blocks mapping in hybrid caches,” in Proc. ACM/IEEE Int. Symp. Low Power Electron. Des., 2012, pp. 237 –242.
[18]
Q. Li, J. Li, L. Shi, M. Zhao, C. J. Xue, and Y. He, “ Compiler-assisted STT-RAM-based hybrid cache for energy efficient embedded systems,”IEEE Trans. Very Large Scale Integr. Syst., vol. 22, no. 8, pp. 1829 –1840, Aug. 2014.
[19]
K. Qiu, M. Zhao, Q. Li, C. Fu, and C. J. Xue, “Migration-aware loop retiming for STT-RAM-based hybrid cache in embedded systems,”IEEE Trans. Comput.-Aided Des. Integr. Circuits Syst., vol. 33, no. 3, pp. 329–342, Mar. 2014.
[20]
J. Li, L. Shi, Q. Li, C. J. Xue, Y. Chen, Y. Xu, and W. Wang, “Low-energy volatile STT-RAM cache design using cache-coherence-enabled adaptive refresh,”ACM Trans. Des. Autom. Electron. Syst. , vol. 19, no. 5, pp. 5:1–5:13, 2013.
[21]
S. P. Song, “Method and system for selective dram refresh to reduce power consumption,” Patent US6 094 705 A, Jul. 25, 2000.
[22]
M. Ghosh, and H.-H. S. Lee, “Smart refresh: An enhanced memory controller design for reducing energy in conventional and 3D die-stacked drams,” in Proc. 40th Annu. IEEE/ACM Int. Symp. Microarchit., 2007, pp. 134–145.
[23]
T. Ohsawa, K. Kai, and K. Murakami, “ Optimizing the DRAM refresh count for merged DRAM/logic LSIs,” in Proc. Int. Symp. Low Power Electron. Des., 1998, pp. 82–87.
[24]
J. Kim and M. Papaefthymiou, “ Block-based multiperiod dynamic memory design for low data-retention power,” IEEE Trans. Very Large Scale Integr. Syst., vol. 11, no. 6, pp. 1006–1018, Dec. 2003.
[25]
P. G. Emma, W. R. Reohr, and M. Meterelliyoz, “ Rethinking refresh: Increasing availability and reducing power in dram for cache applications,” IEEE Micro, vol. 28, no. 6, pp. 47–56, Nov./Dec. 2008.
[26]
C. Wilkerson, A. R. Alameldeen, Z. Chishti, W. Wu, D. Somasekhar, and S.-L. Lu, “Reducing cache power with low-cost, multi-bit error-correcting codes,” in Proc. 37th Annu. Int. Symp. Comput. Archit., 2010, pp. 83–93.

Cited By

View all

Index Terms

  1. Compiler-Assisted Refresh Minimization for Volatile STT-RAM Cache
      Index terms have been assigned to the content through auto-classification.

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image IEEE Transactions on Computers
      IEEE Transactions on Computers  Volume 64, Issue 8
      Aug. 2015
      318 pages

      Publisher

      IEEE Computer Society

      United States

      Publication History

      Published: 01 August 2015

      Author Tags

      1. refresh
      2. Compilation
      3. volatile STT-RAM

      Qualifiers

      • Research-article

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)0
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 12 Nov 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2019)MH CacheACM Transactions on Architecture and Code Optimization10.1145/332852016:3(1-26)Online publication date: 18-Jul-2019
      • (2019)Energy minimization in the STT-RAM-based high-capacity last-level cachesThe Journal of Supercomputing10.1007/s11227-019-02918-275:10(6831-6854)Online publication date: 1-Oct-2019
      • (2017)Cross-Layer Optimization for Multilevel Cell STT-RAM CachesIEEE Transactions on Very Large Scale Integration (VLSI) Systems10.1109/TVLSI.2017.266554325:6(1807-1820)Online publication date: 1-Jun-2017
      • (2017)Energy-Aware Adaptive Restore Schemes for MLC STT-RAM CacheIEEE Transactions on Computers10.1109/TC.2016.262524566:5(786-798)Online publication date: 1-May-2017

      View Options

      View options

      Get Access

      Login options

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media