Abstract
In order to alleviate the ever-increasing processor-memory performance gap of high-end parallel computers, on-chip compressed caches have been developed that can reduce the cache miss count and off-chip memory traffic by storing and transferring cache lines in a compressed form. However, we observed that their performance gain is often limited due to their use of the coarse-grained compressed cache line management which incurs internally fragmented space. In this paper, we present the fine-grained compressed cache line management which addresses the fragmentation problem, while avoiding an increase in the metadata size such as tag field and VM page table. Based on the SimpleScalar simulator with the SPEC benchmark suite, we show that over an existing compressed cache system the proposed cache organization can reduce the memory traffic by 15%, as it delivers compressed cache lines in a fine-grained way, and the cache miss count by 23%, as it stores up to three compressed cache lines in a physical cache line.
“This research was supported by University IT Research center project in korea”.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Hennessy, J.L., Patterson, D.A., Goldberg, D.: Computer Architecture – A Quantitative Approach, 3rd edn. Morgan Kaufmann Publishers, San Francisco (2002)
Lee, J.S., Hong, W.K., Kim, S.D.: Design and Evaluation of On-Chip Cache Compression Technology. In: Proceedings of the IEEE International Conference on Computer Design, pp. 184–191 (1999)
Yang, J., Zhang, Y., Gupta, R.: Frequent Value Compression in Data Caches. In: Proceedings of ACM/IEEE International Symposium on Microarchitecture, pp. 258–265 (2000)
Zhang, Y., Yang, J., Gupta, R.: Frequent Value Locality and Value-centric Data Cache Design. In: Proceedings of the ACM International Conference on Architectural Support for Programming Languages and Operating Systems (2000)
Franaszek, P.A., Robinson, J., Thomas, J.: Parallel Compression with Cooperative Dictionary Construction. In: Proceedings of the IEEE Data Compression Conference, pp. 200–209 (1996)
Austin, T., Larson, E., Ernst, D.: SimpleScalar: an Infrastructure for Computer System Modeling. IEEE Computer 35(2), 59–67 (2002)
Henning, J.L.: SPEC CPU2000: Measuring CPU Performance in the New Millennium. IEEE Computer 33(7), 28–35 (2000)
Kjelso, M., Gooch, M., Jones, S.: Design and Performance of a Main Memory Hardware Data Compressor. In: Proceedings of the 22nd EuroMicro Conference, pp. 422–430. IEEE Computer Society Press, Los Alamitos (1996)
Jouppi, N.P.: Improving Direct-Mapped Cache Performance by the Addition of a Small Fully Associative Cache and Prefetch Buffers. In: Proceedings of the ACM/IEEE International Symposium on Computer Architecture, pp. 364–373 (1990)
Beszedes, A., Ferenc, R., Gyimothy, T., Dolenc, A., Karsisto, K.: Survey of Code-Size Reduction Methods. ACM Computing Surveys 35(3), 223–267 (2003)
Silberschatz, A., Galvin, P.B., Gagne, G.: Operating System Concepts, 6th edn., pp. 285–287. John Wiley & Sons Inc., Chichester (2003)
Shivakumar, P., Jouppi, N.P.: CACTI 3.0: An Integrated Cache Timing, Power, and Area Model. Compaq Computer Corporation Western Research Laboratory, Research Report 2001/2 (2001)
Gomez, I., Pifiuel, L., Prieto, M., Tirado, F.: Analysis of Simulation-adapted Benchmarks SPEC 2000. ACM Computer Architecture News 30(4), 4–10 (2002)
Yim, K.S., Kim, J., Koh, K.: Performance Analysis of On-Chip Cache and Main Memory Compression Systems for High-End Parallel Computers. In: Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications, pp. 469–475 (2004)
Benini, L., Bruni, D., Macii, A., Macii, E.: Hardware-Assisted Data Compression for Energy Minimization in Systems with Embedded Processors. In: Processing of the IEEE Design, Automation and Test in Europe Conference and Exhibition, pp. 449–453 (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Yim, K.S., Lee, JS., Kim, J., Kim, SD., Koh, K. (2004). A Space-Efficient On-Chip Compressed Cache Organization for High Performance Computing. In: Cao, J., Yang, L.T., Guo, M., Lau, F. (eds) Parallel and Distributed Processing and Applications. ISPA 2004. Lecture Notes in Computer Science, vol 3358. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30566-8_109
Download citation
DOI: https://doi.org/10.1007/978-3-540-30566-8_109
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-24128-7
Online ISBN: 978-3-540-30566-8
eBook Packages: Computer ScienceComputer Science (R0)