Article

Stall-Time Fair Memory Access Scheduling for Chip Multiprocessors

Authors:

Onur Mutlu,

Thomas MoscibrodaAuthors Info & Claims

MICRO 40: Proceedings of the 40th Annual IEEE/ACM International Symposium on Microarchitecture

Pages 146 - 160

https://doi.org/10.1109/MICRO.2007.40

Published: 01 December 2007 Publication History

Get Access

Abstract

DRAM memory is a major resource shared among cores in a chip multiprocessor (CMP) system. Memory requests from different threads can interfere with each other. Existing memory access scheduling techniques try to optimize the overall data throughput obtained from the DRAM and thus do not take into account inter-thread interference. Therefore, different threads running together on the same chip can ex- perience extremely different memory system performance: one thread can experience a severe slowdown or starvation while another is un- fairly prioritized by the memory scheduler. This paper proposes a new memory access scheduler, called the Stall-Time Fair Memory scheduler (STFM), that provides quality of service to different threads sharing the DRAM memory system. The goal of the proposed scheduler is to "equalize" the DRAM-related slowdown experienced by each thread due to interference from other threads, without hurting overall system performance. As such, STFM takes into account inherent memory characteristics of each thread and does not unfairly penalize threads that use the DRAM system without interfering with other threads. We show that STFM significantly reduces the unfairness in the DRAM system while also improving system throughput (i.e., weighted speedup of threads) on a wide variety of workloads and systems. For example, averaged over 32 different workloads running on an 8-core CMP, the ratio between the highest DRAM-related slowdown and the lowest DRAM-related slowdown reduces from 5.26X to 1.4X, while the average system throughput improves by 7.6%. We qualitatively and quantitatively compare STFM to one new and three previously- proposed memory access scheduling algorithms, including network fair queueing. Our results show that STFM provides the best fairness, system throughput, and scalability.

Cited By

View all

Du HQin YChen SKang Y(2024)FASA-DRAM: Reducing DRAM Latency with Destructive Activation and Delayed RestorationACM Transactions on Architecture and Code Optimization10.1145/364945521:2(1-27)Online publication date: 30-Jun-2024
https://dl.acm.org/doi/10.1145/3649455
Kim DLee JJung WSullivan MKim JMohror KArnold DBadia R(2023)Unity ECC: Unified Memory Protection Against Bit and Chip ErrorsProceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis10.1145/3581784.3607081(1-16)Online publication date: 12-Nov-2023
https://dl.acm.org/doi/10.1145/3581784.3607081
Luo HOlgun AYağlıkçı ATuğrul YRhyner SCavlak MLindegger JSadrosadati MMutlu OSolihin YHeinrich M(2023)RowPress: Amplifying Read Disturbance in Modern DRAM ChipsProceedings of the 50th Annual International Symposium on Computer Architecture10.1145/3579371.3589063(1-18)Online publication date: 17-Jun-2023
https://dl.acm.org/doi/10.1145/3579371.3589063
Show More Cited By

Index Terms

Stall-Time Fair Memory Access Scheduling for Chip Multiprocessors
1. Hardware
  1. Hardware validation
  2. Integrated circuits
    1. Semiconductor memory
      1. Dynamic memory
2. Theory of computation
  1. Design and analysis of algorithms
    1. Approximation algorithms analysis
      1. Scheduling algorithms
    2. Online algorithms
      1. Online learning algorithms
        Scheduling algorithms
  2. Theory and algorithms for application domains
    1. Machine learning theory
      1. Reinforcement learning
        Sequential decision making

Recommendations

Efficient Loop Scheduling for Chip Multiprocessors with Non-Volatile Main Memory

Non-volatile memories (NVMs) show great potential in replacing DRAM as the main memory in many embedded systems because of their attractive characteristics such as low cost, high density, and low energy consumption. However, the problem of asymmetric ...
A fair thread-aware memory scheduling algorithm for chip multiprocessor
ICA3PP'10: Proceedings of the 10th international conference on Algorithms and Architectures for Parallel Processing - Volume Part I

In Chip multiprocessor (CMP) systems, DRAM memory is a critical resource shared among cores Scheduled by one single memory controller, memory access requests from different cores may interfere with each other This interference causes extra waiting time ...
Write activity reduction on non-volatile main memories for embedded chip multiprocessors

Recent advances in circuit and semiconductor technologies have pushed Non-Volatile Memory (NVM) technologies into a new era. These technologies exhibit appealing properties such as low power consumption, non-volatility, shock-resistivity, and high ...

Comments

Information & Contributors

Information

Published In

MICRO 40: Proceedings of the 40th Annual IEEE/ACM International Symposium on Microarchitecture

December 2007

435 pages

ISBN:0769530478

Publisher

IEEE Computer Society

United States

Publication History

Published: 01 December 2007

Check for updates

Qualifiers

Article

Conference

Micro-40

Sponsor:

SIGMICRO

Micro-40: The 40th Annual IEEE/ACM International Symposium on Microarchitecture

December 1 - 5, 2007

Acceptance Rates

MICRO 40 Paper Acceptance Rate 35 of 166 submissions, 21%;

Overall Acceptance Rate 484 of 2,242 submissions, 22%

Upcoming Conference

MICRO '24

Sponsor:
sigmicro

57th Annual IEEE/ACM International Symposium on Microarchitecture

November 2 - 6, 2024

Austin , TX , USA

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

204
Total Citations
View Citations
1,285
Total Downloads

Downloads (Last 12 months)3
Downloads (Last 6 weeks)0

Reflects downloads up to 30 Aug 2024

Other Metrics

View Author Metrics

Citations

Cited By

View all

Du HQin YChen SKang Y(2024)FASA-DRAM: Reducing DRAM Latency with Destructive Activation and Delayed RestorationACM Transactions on Architecture and Code Optimization10.1145/364945521:2(1-27)Online publication date: 30-Jun-2024
https://dl.acm.org/doi/10.1145/3649455
Kim DLee JJung WSullivan MKim JMohror KArnold DBadia R(2023)Unity ECC: Unified Memory Protection Against Bit and Chip ErrorsProceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis10.1145/3581784.3607081(1-16)Online publication date: 12-Nov-2023
https://dl.acm.org/doi/10.1145/3581784.3607081
Luo HOlgun AYağlıkçı ATuğrul YRhyner SCavlak MLindegger JSadrosadati MMutlu OSolihin YHeinrich M(2023)RowPress: Amplifying Read Disturbance in Modern DRAM ChipsProceedings of the 50th Annual International Symposium on Computer Architecture10.1145/3579371.3589063(1-18)Online publication date: 17-Jun-2023
https://dl.acm.org/doi/10.1145/3579371.3589063
Chakraborty SSaha SSjälander MMcdonald-Maier K(2021)Prepare: Power-Aware Approximate Real-time Task Scheduling for Energy-Adaptive QoS MaximizationACM Transactions on Embedded Computing Systems10.1145/347699320:5s(1-25)Online publication date: 17-Sep-2021
https://dl.acm.org/doi/10.1145/3476993
Xu YBelviranli MShen XVetter J(2021)PCCS: Processor-Centric Contention-aware Slowdown Model for Heterogeneous System-on-ChipsMICRO-54: 54th Annual IEEE/ACM International Symposium on Microarchitecture10.1145/3466752.3480101(1282-1295)Online publication date: 18-Oct-2021
https://dl.acm.org/doi/10.1145/3466752.3480101
Ghose SLi THajinazar NCali DMutlu O(2019)Demystifying Complex Workload-DRAM InteractionsProceedings of the ACM on Measurement and Analysis of Computing Systems10.1145/33667083:3(1-50)Online publication date: 17-Dec-2019
https://dl.acm.org/doi/10.1145/3366708
Garcia-Garcia ASaez JCastro FPrieto-Matias M(2019)LFOCProceedings of the 48th International Conference on Parallel Processing10.1145/3337821.3337925(1-10)Online publication date: 5-Aug-2019
https://dl.acm.org/doi/10.1145/3337821.3337925
Xiang YYe CWang XLuo YWang Z(2019)EMBAProceedings of the 48th International Conference on Parallel Processing10.1145/3337821.3337863(1-12)Online publication date: 5-Aug-2019
https://dl.acm.org/doi/10.1145/3337821.3337863
Li BMao MLiu XLiu TLiu ZWen WChen YLi H(2019)Thread Batching for High-performance Energy-efficient GPU Memory DesignACM Journal on Emerging Technologies in Computing Systems10.1145/333015215:4(1-21)Online publication date: 16-Dec-2019
https://dl.acm.org/doi/10.1145/3330152
Lin ZDai HMantor MZhou H(2019)Coordinated CTA Combination and Bandwidth Partitioning for GPU Concurrent Kernel ExecutionACM Transactions on Architecture and Code Optimization10.1145/332612416:3(1-27)Online publication date: 17-Jun-2019
https://dl.acm.org/doi/10.1145/3326124
Show More Cited By

View Options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Cited By

Index Terms

Recommendations

Efficient Loop Scheduling for Chip Multiprocessors with Non-Volatile Main Memory

A fair thread-aware memory scheduling algorithm for chip multiprocessor

Write activity reduction on non-volatile main memories for embedded chip multiprocessors

Comments

Published In

Sponsors

Publisher

Publication History

Check for updates

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Other Metrics

Article Metrics

Other Metrics

Cited By

Login options

Full Access

PDF

eReader

Abstract

Cited By

Index Terms

Recommendations

Efficient Loop Scheduling for Chip Multiprocessors with Non-Volatile Main Memory

A fair thread-aware memory scheduling algorithm for chip multiprocessor

Write activity reduction on non-volatile main memories for embedded chip multiprocessors

Comments

Information

Published In

Sponsors

Publisher

Publication History

Check for updates

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Get Access

Login options

Full Access

View options

PDF

eReader

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations