Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1621960.1621966acmotherconferencesArticle/Chapter ViewAbstractPublication PagesmedeaConference Proceedingsconference-collections
research-article

Performance balancing: software-based on-chip memory management for effective CMP executions

Published: 13 September 2009 Publication History

Abstract

This paper proposes the concept of performance balancing, and reports its performance impact on a Chip multiprocessor (CMP). Integrating multiple processor cores into a single chip, or CMPs, can achieve higher peak performance by means of exploiting thread level parallelism. However, the off-chip memory bandwidth which does not scale with the number of cores tends to limit the potential of CMPs. To solve this issue, the technique proposed in this paper attempts to make a good balance between computation and memorization. Unlike conventional parallel executions, this approach exploits some cores to improve the memory performance. These cores devote the on-chip memory hardware resources to the remaining cores executing the parallelized threads. In our evaluation, it is observed that our approach can achieve 31% of performance improvement compared to a conventional parallel execution model in the specified program.

References

[1]
O. Avissar, R. Barua, and D. Stewart, An Optimal Memory Allocation Scheme for Scratchpad-based Embedded Systems. ACM Transactions on Embedded Computing Systems, pp. 6--26, 2002.
[2]
N. L. Binkert, E. G. Hallnor and S. K. Reinhardt. Network Oriented Full-System Simulation using M5. 6th Workshop on Computer Architecture Evaluation using Commercial Workloads, 2003.
[3]
J. F. Cantin, M. H. Lipasti, and J. E. Smith. Stealth prefetching. Proceedings of the 12th International Conference on Architectural Support for Programming Languages and Operating Systems, pp. 274--282, 2006.
[4]
J. Chang and G. S. Sohi. Cooperative Caching for Chip Multiprocessors. The 33rd Intl. Symposium on Computer Architecture, pp 264--276, 2006.
[5]
T. Chen, R. Raghavan, J. N. Dale, and E. Iwata. Cell Broadband Engine Architecture and its first implementation performance view. IBM Journal of Research and Development, 51(5): pp. 559--572, 2007.
[6]
M. Curtis-Maury, K. Singh, S. A. McKee, F. Blagojevic, D. S. Nikolopoulos, B. R. de Supinski, M. Schulz. Identifying Energy-Efficient Concurrency Levels using Machine Learning. The Intl. Workshop on Green Computing, pp. 488--495, 2007.
[7]
I. Ganusov and M. Burtscher. Efficient Emulation of Hardware Prefetchers via Event-Driven Helper Threading. The 15th Intl. Conference on Parallel Architectures and Compilation Techniques, pp. 144--153, 2006.
[8]
M. R. Guthaus, J. S. Ringenberg, D. Ernst, T. M. Austin, T. Mudge and R. B. Brown. MiBench: A Free, Commercially Representative Embedded Benchmark suite. The IEEE 4th Annual Workshop on Workload Characterization, pp. 3--14, 2001.
[9]
J. A. Kahle, M. N. Day, H. P. Hofstee, C. R. Johns, T. R. Maeurer and D. Shippy. Introduction to the Cell multiprocessor. IBM journal of Research and Development, 49(4--5), 2005.
[10]
M. Kistler, M. Perrone, F. Petrini. Cell Multiprocessor Communication Network: Built for Speed. Micro, IEEE, 26(3), pp. 10--23, 2006.
[11]
S. Steinke, L. Wehmeyer, B. S. Lee, and P. Marwedel. Assigning Program and Data Objects to Scratchpad for Energy Reduction. Proceedings of the conference on Design, automation and test in Europe, pp. 409--415, 2002
[12]
M. A. Suleman, M. K. Qureshi and Y. N. Patt. Feedback-Driven Threading: Power-Efficient and High-Performance Execution of Multi-threaded Workloads on CMPs. 13th Intl. Conference on Architectural Support for Programming Languages and Operating Systems, pp. 277--286, 2008
[13]
S. Udayakumaran, A. Dominguez, and R. Barua, Dynamic allocation for scratchpad memory using compile-time decisions. ACM Transactions on Embedded Computing Systems, pp. 472--511, 2006.
[14]
S. C. Woo, M. Ohara, E. Torrie, J. P. Singh, and A. Gupta. The SPLASH-2 programs: Characterization and methodological considerations. The Intl. Symposium on Computer Architecture, pp. 24--36, 1995.
[15]
Himeno Benchmark: http://accc.riken.jp/HPC/HimenoBMT/index_e.html

Cited By

View all
  • (2011)Scratchpad memory based power efficient optimization for MPSoC2011 International Conference on Electronics, Communications and Control (ICECC)10.1109/ICECC.2011.6067865(455-458)Online publication date: Sep-2011

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
MEDEA '09: Proceedings of the 10th workshop on MEmory performance: DEaling with Applications, systems and architecture
September 2009
48 pages
ISBN:9781605588308
DOI:10.1145/1621960
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 13 September 2009

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. chip multiprocessors
  2. memory-wall problem
  3. parallel execution
  4. scratchpad memory

Qualifiers

  • Research-article

Funding Sources

Conference

MEDEA '09

Acceptance Rates

Overall Acceptance Rate 6 of 9 submissions, 67%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)1
  • Downloads (Last 6 weeks)0
Reflects downloads up to 17 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2011)Scratchpad memory based power efficient optimization for MPSoC2011 International Conference on Electronics, Communications and Control (ICECC)10.1109/ICECC.2011.6067865(455-458)Online publication date: Sep-2011

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media