Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
opinion

Preserving Virtual Memory by Mitigating the Address Translation Wall

Published: 01 September 2017 Publication History

Abstract

The concept of virtual memory is one of the classic computer science abstractions and has long enabled systems that are easy to program and that operate quickly. Today, however, virtual memory faces challenges that pose a threat to the continued progress of computing. This article discusses these threats and presents some promising ways to counter them.

References

[1]
A. Basu et al., “Efficient Virtual Memory for Big Memory Servers,” in Proc. 40th Ann. Int'l Symp. Computer Architecture (ISCA 13), 2013, pp. 237–248.
[2]
B. Pham et al., “CoLT: Coalesced Large-Reach TLBs,” in Proc. 45th Ann. IEEE/ACM Int'l Symp. Microarchitecture (MICRO), 2012. https://doi.org/10.1109/MICRO.2012.32.
[3]
J.H. Ryoo et al., “Rethinking TLB Designs in Virtualized Environments: A Very Large Part-of-Memory TLB,” in Proc. 44th Ann. Int'l Symp. Computer Architecture (ISCA 17), 2017, pp. 469–480.
[4]
M.-M. Papadopoulou et al., “Prediction-Based Superpage-Friendly TLB Designs,” Proc. IEEE 21st Int'l Symp. High Performance Computer Architecture (HPCA), 2015. https://doi.org/10.1109/HPCA.2015.7056034.
[5]
A. Basu, M. Hill, and M. Swift, “Reducing Memory Reference Energy with Opportunistic Virtual Caching,” in Proc. 39th Ann. Int'l Symp. Computer Architecture (ISCA 12), 2012, pp. 297–308.
[6]
V. Karakostas et al., “Energy-Efficient Address Translation,” in Proc. Int'l Symp. High Performance Computer Architecture (HPCA), 2016. https://doi.org/10.1109/HPCA.2016.7446100.
[7]
Z. Yan et al., “Hardware Translation Coherence for Virtualized Systems,” in Proc. 44th Ann. Int'l Symp. Computer Architecture (ISCA 17), 2017, pp. 430–443.
[8]
M. Oskin and G.H. Loh, “A Software-Managed Approach to Die-Stacked DRAM,” Proc. Int'l Conf. Parallel Architecture and Compilation (PACT), 2015, pp. 188–200.
[9]
N. Amit, “Optimizing the TLB Shootdown Algorithm with Page Access Tracking,” in Proc. USENIX Ann. Conf., 2017, pp. 27–39.
[10]
A. Bhattacharjee, “Large-Reach Memory Management Unit Caches,” in Proc. 46th Ann. IEEE/ACM Int'l Symp. Microarchitecture (MICRO), 2013, pp. 383–394.
[11]
T.W. Barr, A.L. Cox, and S. Rixner, “Translation Caching: Skip, Don't Walk (the Page Table),” in Proc. 37th Ann. Int'l Symp. Computer Architecture, 2010, pp. 48–59.
[12]
R. Bhargava et al., “Accelerating Two-Dimensional Page Walks for Virtualized Systems,” in Proc. 13th Int'l Conf. Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2008, pp. 26–35.
[13]
B. Pichai, L. Hsu, and A. Bhattacharjee, “Architectural Support for Address Translation on GPUs,” in Proc. 19th Int'l Conf. Architectural Support for Programming Languages and Operating Systems, 2014, pp. 743–758.
[14]
J. Power, M. Hill, and D. Wood, “Supporting x86-64 Address Translation for 100s of GPU Lanes,” in Proc. IEEE 20th Int'l Symp. High Performance Computer Architecture (HPCA), 2014; https://doi.org/10.1109/HPCA.2014.6835965.
[15]
J. Vesely et al., “Observations and Opportunities in Architecting Shared Virtual Memory for Heterogeneous Systems,” in Proc. IEEE Int'l Symp. Performance Analysis of Systems and Software (ISPASS), 2016. https://doi.org/10.1109/ISPASS.2016.7482091.
[16]
ETH Mining: Lower VRAM GPUs to be Rendered Unprofitable in Time,” Tech Power Up, blog, 19 June 2017; www.techpowerup.com/234482/eth-mining-lower-vram-gpus-to-be-rendered-unprofitable-in-time.
[17]
Ethereum Hashrate Drop for Radeon RX400/RX500 GPUs Is Incoming,” Crypto Mining Blog, 18 June 2017; http://cryptomining-blog.com/8822-ethereum-hashrate-drop-for-radeon-rx400rx500-gpus-is-incoming.
[18]
I. King, “Chipmakers Nvidia, AMD Ride Cryptocurrency Wave—for Now,” SlashDot, blog, 17 July 2017; www.bloomberg.com/news/articles/2017-07-17/chipmakers-nvidia-amd-ride-cryptocurrency-wave-for-now.
[19]
Y. Kwon et al., “Coordinated and Efficient Huge Page Management with Ingens,” in Proc. 12th USENIX Conf. Operating Systems Design and Implementation (OSDI 16), 2016, pp. 705–721.
[20]
Y. Du et al., “SupportingSuperpages in Non-Contiguous Physical Memory,” in Proc. IEEE 21st Int'l Symp. High Performance Computer Architecture (HPCA), 2015. https://doi.org/10.1109/HPCA.2015.7056035.
[21]
V. Karakostas et al., “Redundant Memory Mappings for Fast Access to Large Memories,” Proc. 42nd Ann. Int'l Symp. Computer Architecture (ISCA 15), 2015, pp. 66–78.
[22]
H. Alam et al., “Do It Yourself Virtual Memory Translation,” in Proc. 44th Ann. Int'l Symp. Computer Architecture (ISCA 17), 2017, pp. 457–468.
[23]
J. Woodruff et al., “The CHERI Capability Model: Revisiting RISC in an Age of Risk,” in Proc. 41st Ann. Int'l Symp. Computer Architecture (ISCA 14), 2014, pp. 457–468.
[24]
B. Pham et al., “Increasing TLB Reach by Exploiting Clustering in Page Translations,” in Proc. IEEE 21st Int'l Symp. High Performance Computer Architecture (HPCA), 2014. https://doi.org/10.1109/HPCA.2014.6835964.
[25]
G. Cox and A. Bhattacharjee, “Efficient Address Translation for Architectures with Multiple Page Sizes,” in Proc. 22nd Int'l Conf. Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2017, pp. 435–448.
[26]
C.H. Park et al., “Hybrid TLB Coalescing: Improving TLB Translation Coverage Under Diverse Fragmented Memory Allocation,” in Proc. 44th Ann. Int'l Symp. Computer Architecture (ISCA 17), 2017, pp. 444–456.
[27]
B. Pham et al., “Large Pages and Lightweight Memory Management in Virtualized Environments: Can You Have It Both Ways?,” Proc. 48th Ann. IEEE/ACM Int'l Symp. Microarchitecture (MICRO), 2015. https://doi.org/10.1145/2830772.2830773.
[28]
T.W. Barr, A.L. Cox, and S. Rixner, “SpecTLB: A Mechanism for Speculative Address Translation,” in Proc. 38th Ann. Int'l Symp. Computer Architecture (ISCA 11), 2011, pp. 307–318.
[29]
A. Bhattacharjee, “Translation-Triggered Prefetching,” in Proc. 22nd Int'l Conf. Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2017, pp. 63–76.
[30]
B.F. Romanescu et al., “Unified Instruction/Translation/Data (UNITD) Coherence: One Protocol to Rule Them All,” in Proc. IEEE 21st Int'l Symp. High Performance Computer Architecture (HPCA), 2010. https://doi.org/10.1109/HPCA.2010.5416643.
[31]
A. Awad et al., “Avoiding TLB Shootdown with Self-Invalidating TLB Entries,” to be published in Proc. Int'l Conf. Parallel Architectures and Compilation Techniques (PACT), 2017.
[32]
C. Villavieja et al., “DiDi: Mitigating the Performance Impact of TLB Shootdowns Using a Shared TLB Directory,” in Proc. Int'l Conf. Parallel Architectures and Compilation Techniques (PACT), 2011. https://doi.org/10.1109/PACT.2011.65.
[33]
D. Lustig et al., “COATCheck: Verifying Memory Ordering at the Hardware-OS Interface,” in Proc. 21st Int'l Conf. Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2016, pp. 233–247.
[34]
B.F. Romanescu, A.R. Lebeck, and D.J. Sorin, “Specifying and Dynamically Verifying Address Translation-Aware Memory Consistency,” in Proc. 15th Conf. Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2010, pp. 323–334.
[35]
J. Picorel, D. Jevdjic, and B. Falsafi, “Near-Memory Address Translation,” to be published in Proc. 26th Int'l Conf. Parallel Architectures and Compilation Techniques (PACT), 2017.

Cited By

View all
  • (2023)Towards High Performance and Efficient Memory Deduplication via Mixed PagesIEEE Transactions on Computers10.1109/TC.2022.319174272:4(926-940)Online publication date: 1-Apr-2023
  • (2022)CARAT CAKE: replacing paging via compiler/kernel cooperationProceedings of the 27th ACM International Conference on Architectural Support for Programming Languages and Operating Systems10.1145/3503222.3507771(98-114)Online publication date: 28-Feb-2022
  • (2021)Paging and the Address-Translation ProblemProceedings of the 33rd ACM Symposium on Parallelism in Algorithms and Architectures10.1145/3409964.3461814(105-117)Online publication date: 6-Jul-2021
  • Show More Cited By

Index Terms

  1. Preserving Virtual Memory by Mitigating the Address Translation Wall
        Index terms have been assigned to the content through auto-classification.

        Recommendations

        Comments

        Information & Contributors

        Information

        Published In

        cover image IEEE Micro
        IEEE Micro  Volume 37, Issue 5
        September/October 2017
        68 pages

        Publisher

        IEEE Computer Society Press

        Washington, DC, United States

        Publication History

        Published: 01 September 2017

        Qualifiers

        • Opinion

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)0
        • Downloads (Last 6 weeks)0
        Reflects downloads up to 28 Jan 2025

        Other Metrics

        Citations

        Cited By

        View all
        • (2023)Towards High Performance and Efficient Memory Deduplication via Mixed PagesIEEE Transactions on Computers10.1109/TC.2022.319174272:4(926-940)Online publication date: 1-Apr-2023
        • (2022)CARAT CAKE: replacing paging via compiler/kernel cooperationProceedings of the 27th ACM International Conference on Architectural Support for Programming Languages and Operating Systems10.1145/3503222.3507771(98-114)Online publication date: 28-Feb-2022
        • (2021)Paging and the Address-Translation ProblemProceedings of the 33rd ACM Symposium on Parallelism in Algorithms and Architectures10.1145/3409964.3461814(105-117)Online publication date: 6-Jul-2021
        • (2020)ValkyrieProceedings of the ACM International Conference on Parallel Architectures and Compilation Techniques10.1145/3410463.3414639(455-466)Online publication date: 30-Sep-2020
        • (2020)CARAT: a case for virtual memory through compiler- and runtime-based address translationProceedings of the 41st ACM SIGPLAN Conference on Programming Language Design and Implementation10.1145/3385412.3385987(329-345)Online publication date: 11-Jun-2020
        • (2019)Translation rangerProceedings of the 46th International Symposium on Computer Architecture10.1145/3307650.3322223(698-710)Online publication date: 22-Jun-2019

        View Options

        View options

        Figures

        Tables

        Media

        Share

        Share

        Share this Publication link

        Share on social media