research-article

Open access

Chameleon: Virtualizing idle acceleration cores of a heterogeneous multicore processor for caching and prefetching

Authors:

Dong Hyuk Woo,

Joshua B. Fryman,

Allan D. Knies,

Hsien-Hsin S. LeeAuthors Info & Claims

ACM Transactions on Architecture and Code Optimization (TACO), Volume 7, Issue 1

Article No.: 3, Pages 1 - 35

https://doi.org/10.1145/1736065.1736068

Published: 07 May 2010 Publication History

PDF eReader

Abstract

Heterogeneous multicore processors have emerged as an energy- and area-efficient architectural solution to improving performance for domain-specific applications such as those with a plethora of data-level parallelism. These processors typically contain a large number of small, compute-centric cores for acceleration while keeping one or two high-performance ILP cores on the die to guarantee single-thread performance. Although a major portion of the transistors are occupied by the acceleration cores, these resources will sit idle when running unparallelized legacy codes or the sequential part of an application. To address this underutilization issue, in this article, we introduce Chameleon, a flexible heterogeneous multicore architecture to virtualize these resources for enhancing memory performance when running sequential programs. The Chameleon architecture can dynamically virtualize the idle acceleration cores into a last-level cache, a data prefetcher, or a hybrid between these two techniques. In addition, Chameleon can operate in an adaptive mode that dynamically configures the acceleration cores between the hybrid mode and the prefetch-only mode by monitoring the effectiveness of the Chameleon cache mode. In our evaluation with SPEC2006 benchmark suite, different levels of performance improvements were achieved in different modes for different applications. In the case of the adaptive mode, Chameleon improves the performance of SPECint06 and SPECfp06 by 31% and 15%, on average. When considering only memory-intensive applications, Chameleon improves the system performance by 50% and 26% for SPECint06 and SPECfp06, respectively.

References

[1]

Agarwal, M., Malik, K., Woley, K. M., Stone, S. S., and Frank, M. I. 2007. Exploiting postdominance for speculative parallelization. In Proceedings of the 13th International Symposium on High-Performance Computer Architecture. IEEE, Los Alamitos, CA, 295--305.

Abstract

References

Cited By

Index Terms

Recommendations

Performance-Energy Considerations for Shared Cache Management in a Heterogeneous Multicore Processor

Increasing hardware data prefetching performance using the second-level cache

Adaptive prefetching using global history buffer in multicore processors

Comments

Information

Published In

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Funding Sources

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

View options

PDF

eReader

Get Access

Login options

Full Access

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations