Dynamic Capacity-Speed Tradeoffs in SMT Processor Caches

López, Sonia; Dropsho, Steve; Albonesi, David H.; Garnica, Oscar; Lanchares, Juan

doi:10.1007/978-3-540-69338-3_10

Sonia López¹,
Steve Dropsho²,
David H. Albonesi³,
Oscar Garnica¹ &
…
Juan Lanchares¹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 4367))

Included in the following conference series:

International Conference on High-Performance Embedded Architectures and Compilers

411 Accesses
6 Citations

Abstract

Caches are designed to provide the best tradeoff between access speed and capacity for a set of target applications. Unfortunately, different applications, and even different phases within the same application, may require a different capacity-speed tradeoff. This problem is exacerbated in a Simultaneous Multi-Threaded (SMT) processor where the optimal cache design may vary drastically with the number of running threads and their characteristics.

We propose to make this capacity-speed cache tradeoff dynamic within an SMT core. We extend a previously proposed globally asynchronous, locally synchronous (GALS) processor core with multi-threaded support, and implement dynamically resizable instruction and data caches. As the number of threads and their characteristics change, these adaptive caches automatically adjust from small sizes with fast access times to higher capacity configurations. While the former is more performance-optimal when the core runs a single thread, or a dual-thread workload with modest cache requirements, higher capacity caches work best with most multiple thread workloads. The use of a GALS microarchitecture permits the rest of the processor, namely the execution core, to run at full speed irrespective of the cache speeds. This approach yields an overall performance improvement of 24.7% over the best fixed-size caches for dual-thread workloads, and 19.2% for single-threaded applications.

This research was supported in part by Spanish Government Grant TIN2005-05619, National Science Foundation Grant CCF-0304574, an IBM Faculty Partnership Award, a grant from the Intel Research Council, and by equipment grants from Intel and IBM.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Adaptive Cache Structures

Performance Modelling and Dynamic Scheduling on Heterogeneous-ISA Multi-core Architectures

Addressing isolation challenges of non-blocking caches for multicore real-time systems

Article 25 May 2017

Author information

Authors and Affiliations

Departamento de Arquitectura de Computadores y Automatica, U. Complutense de Madrid, Spain
Sonia López, Oscar Garnica & Juan Lanchares
School of Computer and Communication Science, EPFL, Switzerland
Steve Dropsho
Computer Systems Laboratory, Cornell University, USA
David H. Albonesi

Authors

Sonia López
View author publications
You can also search for this author in PubMed Google Scholar
Steve Dropsho
View author publications
You can also search for this author in PubMed Google Scholar
David H. Albonesi
View author publications
You can also search for this author in PubMed Google Scholar
Oscar Garnica
View author publications
You can also search for this author in PubMed Google Scholar
Juan Lanchares
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Koen De Bosschere David Kaeli Per Stenström David Whalley Theo Ungerer

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

López, S., Dropsho, S., Albonesi, D.H., Garnica, O., Lanchares, J. (2007). Dynamic Capacity-Speed Tradeoffs in SMT Processor Caches. In: De Bosschere, K., Kaeli, D., Stenström, P., Whalley, D., Ungerer, T. (eds) High Performance Embedded Architectures and Compilers. HiPEAC 2007. Lecture Notes in Computer Science, vol 4367. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-69338-3_10

Download citation

DOI: https://doi.org/10.1007/978-3-540-69338-3_10
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-69337-6
Online ISBN: 978-3-540-69338-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Dynamic Capacity-Speed Tradeoffs in SMT Processor Caches

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Adaptive Cache Structures

Performance Modelling and Dynamic Scheduling on Heterogeneous-ISA Multi-core Architectures

Addressing isolation challenges of non-blocking caches for multicore real-time systems

Author information

Authors and Affiliations

Editor information

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Dynamic Capacity-Speed Tradeoffs in SMT Processor Caches

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Adaptive Cache Structures

Performance Modelling and Dynamic Scheduling on Heterogeneous-ISA Multi-core Architectures

Addressing isolation challenges of non-blocking caches for multicore real-time systems

Author information

Authors and Affiliations

Editor information

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation