Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1654059.1654102acmconferencesArticle/Chapter ViewAbstractPublication PagesscConference Proceedingsconference-collections
research-article

Future scaling of processor-memory interfaces

Published: 14 November 2009 Publication History

Abstract

Continuous evolution in process technology brings energy-efficiency and reliability challenges, which are harder for memory system designs since chip multiprocessors demand high bandwidth and capacity, global wires improve slowly, and more cells are susceptible to hard and soft errors. Recently, there are proposals aiming at better main-memory energy efficiency by dividing a memory rank into subsets.
We holistically assess the effectiveness of rank subsetting in the context of system-wide performance, energy-efficiency, and reliability perspectives. We identify the impact of rank subsetting on memory power and processor performance analytically, then verify the analyses by simulating a chipmultiprocessor system using multithreaded and consolidated workloads. We extend the design of Multicore DIMM, one proposal embodying rank subsetting, for high-reliability systems and show that compared with conventional chipkill approaches, it can lead to much higher system-level energy efficiency and performance at the cost of additional DRAM devices.

References

[1]
"Calculating Memory System Power for DDR3," Micron, Tech. Rep. TN-41-01, 2007.
[2]
J. Ahn, J. Leverich, R. S. Schreiber, and N. P. Jouppi, "Multicore DIMM: an Energy Efficient Memory Module with Independently Controlled DRAMs," Computer Architecture Letters, vol. 7, no. 1, 2008.
[3]
AMD, BIOS and Kernel Developer's Guide for AMD NPT Family 0Fh Processors, Jul 2007, http://www.amd.com/us-en/assets/content_type/white_papers_and_tech_docs/32559.pdf.
[4]
L. A. Barroso, "The price of performance," Queue, vol. 3, no. 7, pp. 48--53, 2005.
[5]
C. Bienia, S. Kumar, J. P. Singh, and K. Li, "The PARSEC Benchmark Suite: Characterization and Architectural Implications," in PACT, Oct 2008.
[6]
T. J. Dell, "A White Paper on the Benefits of Chipkill-Correct ECC for PC Server Main Memory," IBM Microelectronics Division, Nov. 1997.
[7]
J. M. Frailong, W. Jalby, and J. Lenfant, "XOR-Schemes: A Flexible Data Organization in Parallel Memories," in ICPP, Aug 1985.
[8]
J. Gee, M. D. Hill, D. N. Pnevmatikatos, and A. J. Smith, "Cache Performance of the SPEC92 Benchmark Suite," IEEE Micro, vol. 13, 1993.
[9]
M. Ghosh and H.-H. S. Lee, "Smart Refresh: An Enhanced Memory Controller Design for Reducing Energy in Conventional and 3D Die-Stacked DRAMs," in MICRO, Dec 2007.
[10]
J. L. Henning, "Performance counters and development of SPEC CPU2006," Computer Architecture News, vol. 35, no. 1, 2007.
[11]
R. Ho, K. Mai, and M. A. Horowitz, "The Future of Wires," Proceedings of the IEEE, vol. 89, no. 4, 2001.
[12]
H. Huang, P. Pillai, and K. G. Shin, "Design and Implementation of Power-Aware Virtual Memory," in USENIX, Jun 2003.
[13]
I. Hur and C. Lin, "A Comprehensive Approach to DRAM Power Management," in HPCA, Feb 2008.
[14]
Intel, "Intel® Core#8482; i7 Processor, http://www.intel.com/products/processor/corei7/."
[15]
B. Jacob, S. W. Ng, and D. T. Wang, Memory Systems: Cache, DRAM, Disk. Morgan Kaufmann, 2007.
[16]
P. Kongetira, K. Aingaran, and K. Olukotun, "Niagara: A 32-Way Multithreaded Sparc Processor," IEEE Micro, vol. 25, no. 2, 2005.
[17]
A. R. Lebeck, X. Fan, H. Zeng, and C. Ellis, "Power Aware Page Allocation," in ASPLOS, Nov 2000.
[18]
A. Leon, B. Langley, and J. L. Shin, "The UltraSPARC T1 Processor: CMT Reliability," CICC, Sep 2006.
[19]
C.-K. Luk, et al., "Pin: Building Customized Program Analysis Tools with Dynamic Instrumentation," in PLDI, Jun 2005.
[20]
M. M. K. Martin, et al., "Multifacet's general execution-driven multiprocessor simulator (GEMS) toolset," SIGARCH Computer Architecture News, vol. 33, no. 4, 2005.
[21]
B. K. Mathew, S. A. Mckee, J. B. Carter, and A. Davis, "Design of a Parallel Vector Access Unit for SDRAM Memory Systems," in HPCA, Jan 2000.
[22]
H. McGhan, "SPEC CPU2006 Benchmark Suite," in Microprocessor Report, Oct 2006.
[23]
Micron Technology Inc., DDR2 SDRAM Datasheet, 2008, http://www.micron.com/products/dram/ddr2/.
[24]
Micron Technology Inc., DDR3 SDRAM Datasheet, 2008, http://www.micron.com/products/dram/ddr3/.
[25]
Micron Technology Inc., RLDRAM Datasheet, 2008, http://www.micron.com/products/dram/rldram/.
[26]
O. Mutlu and T. Moscibroda, "Stall-Time Fair Memory Access Scheduling for Chip Multiprocessors," in MICRO, Dec 2007.
[27]
K. J. Nesbit, N. Aggarwal, J. Laudon, and J. E. Smith, "Fair Queuing Memory Systems," in MICRO, Dec 2006.
[28]
H. Pan, K. Asanović, R. Cohn, and C.-K. Luk, "Controlling Program Execution through Binary Instrumentation," SIGARCH Computer Architecture News, vol. 33, no. 5, 2005.
[29]
W. W. Peterson and J. E. J. Weldon, Error-Correctin Codes, 2nd ed. MIT Press, 1972.
[30]
Rambus, "RDRAM, http://www.rambus.com," 1999.
[31]
S. H. Reiger, "Codes for the Correction of "Clustered" Errors," in IRE Transactions on Information Theory, 1960.
[32]
S. Rixner, W. J. Dally, U. J. Kapasi, P. R. Mattson, and J. D. Owens, "Memory Access Scheduling," in ISCA, Jun 2000.
[33]
T. Sherwood, E. Perelman, G. Hamerly, and B. Calder, "Automatically Characterizing Large Scale Program Behavior," in ASPLOS, Oct 2002.
[34]
S. Thoziyoor, J. Ahn, M. Monchiero, J. B. Brockman, and N. P. Jouppi, "A Comprehensive Memory Modeling Tool and its Application to the Design and Analysis of Future Memory Hierarchies," in ISCA, Jun 2008.
[35]
F. A. Ware and C. Hampel, "Improving Power and Data Efficiency with Threaded Memory Modules," in ICCD, Oct 2006.
[36]
S. C. Woo, M. Ohara, E. Torrie, J. P. Singh, and A. Gupta, "The SPLASH-2 Programs: Characterization and Methodological Considerations," in ISCA, Jun 1995.
[37]
H. Zheng, J. Lin, Z. Zhang, E. Gorbatov, H. David, and Z. Zhu, "Mini-Rank: Adaptive DRAM Architecture for Improving Memory Power Efficiency," in Micro, Nov 2008.

Cited By

View all
  • (2023)How to Kill the Second Bird with One ECC: The Pursuit of Row Hammer Resilient DRAMProceedings of the 56th Annual IEEE/ACM International Symposium on Microarchitecture10.1145/3613424.3623777(986-1001)Online publication date: 28-Oct-2023
  • (2020)Dynamic Colocation Policies with Reinforcement LearningACM Transactions on Architecture and Code Optimization10.1145/337571417:1(1-25)Online publication date: 4-Mar-2020
  • (2019)Innovations in the Memory SystemSynthesis Lectures on Computer Architecture10.2200/S00933ED1V01Y201906CAC04814:2(1-151)Online publication date: 10-Sep-2019
  • Show More Cited By

Index Terms

  1. Future scaling of processor-memory interfaces

        Recommendations

        Comments

        Information & Contributors

        Information

        Published In

        cover image ACM Conferences
        SC '09: Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis
        November 2009
        778 pages
        ISBN:9781605587448
        DOI:10.1145/1654059
        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Sponsors

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        Published: 14 November 2009

        Permissions

        Request permissions for this article.

        Check for updates

        Qualifiers

        • Research-article

        Conference

        SC '09
        Sponsor:

        Acceptance Rates

        SC '09 Paper Acceptance Rate 59 of 261 submissions, 23%;
        Overall Acceptance Rate 1,516 of 6,373 submissions, 24%

        Upcoming Conference

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)32
        • Downloads (Last 6 weeks)7
        Reflects downloads up to 22 Feb 2025

        Other Metrics

        Citations

        Cited By

        View all
        • (2023)How to Kill the Second Bird with One ECC: The Pursuit of Row Hammer Resilient DRAMProceedings of the 56th Annual IEEE/ACM International Symposium on Microarchitecture10.1145/3613424.3623777(986-1001)Online publication date: 28-Oct-2023
        • (2020)Dynamic Colocation Policies with Reinforcement LearningACM Transactions on Architecture and Code Optimization10.1145/337571417:1(1-25)Online publication date: 4-Mar-2020
        • (2019)Innovations in the Memory SystemSynthesis Lectures on Computer Architecture10.2200/S00933ED1V01Y201906CAC04814:2(1-151)Online publication date: 10-Sep-2019
        • (2019)Demystifying Complex Workload-DRAM InteractionsProceedings of the ACM on Measurement and Analysis of Computing Systems10.1145/33667083:3(1-50)Online publication date: 17-Dec-2019
        • (2019)Exploiting Correcting Codes: On the Effectiveness of ECC Memory Against Rowhammer Attacks2019 IEEE Symposium on Security and Privacy (SP)10.1109/SP.2019.00089(55-71)Online publication date: May-2019
        • (2019)A Vulnerability Factor for ECC-protected Memory2019 IEEE 25th International Symposium on On-Line Testing and Robust System Design (IOLTS)10.1109/IOLTS.2019.8854397(176-181)Online publication date: Jul-2019
        • (2019)Enabling Transparent Memory-Compression for Commodity Memory Systems2019 IEEE International Symposium on High Performance Computer Architecture (HPCA)10.1109/HPCA.2019.00010(570-581)Online publication date: Feb-2019
        • (2019)Quantifying the Impact of Memory Errors in Deep Learning2019 IEEE International Conference on Cluster Computing (CLUSTER)10.1109/CLUSTER.2019.8890989(1-12)Online publication date: Sep-2019
        • (2018)Dynamic fine-grained sparse memory accessesProceedings of the International Symposium on Memory Systems10.1145/3240302.3240416(85-97)Online publication date: 1-Oct-2018
        • (2018)An MLP-aware leakage-free memory controllerProceedings of the 7th International Workshop on Hardware and Architectural Support for Security and Privacy10.1145/3214292.3214296(1-7)Online publication date: 2-Jun-2018
        • Show More Cited By

        View Options

        Login options

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        Figures

        Tables

        Media

        Share

        Share

        Share this Publication link

        Share on social media