Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3503646.3524297acmconferencesArticle/Chapter ViewAbstractPublication PageseurosysConference Proceedingsconference-collections
research-article

SLRL: a simple least remaining lifetime file evicition policy for HPC multi-tier storage systems

Published: 05 April 2022 Publication History

Abstract

HPC systems are composed of multiple tiers of storage, from the top high performance tier (high speed SSDs) to the bottom capacitive one (tapes). File placement in such architecture is managed through prefetchers (bottom-up) and eviction policies (top-down). Most state-of-the-art work focus on the former while using algorithm flavors of LRU, LFU and FIFO for the latter. LRU was for long considered the best choice. However, recent studies has shown that the simplicity of FIFO could make it more scalable than LRU because of metadata management, and thus more adequate in several cases. In this paper, we propose a new eviction policy based on predicted files lifetimes. It is comparable to FIFO in terms of metadata overhead and simplicity (thus scalability), while giving a hit ratio comparable to LRU (or even 10% better for some tested traces). We also propose a naive multi-tier heterogeneous storage simulator implementation to evaluate such policies.

References

[1]
Aaron Blankstein, Siddhartha Sen, and Michael J. Freedman. 2017. Hyperbolic Caching: Flexible Caching for Web Applications. In Proceedings of the 2017 USENIX Conference on Usenix Annual Technical Conference (Santa Clara, CA, USA) (USENIX ATC '17). USENIX Association, USA, 499--511.
[2]
Allan Borodin, Prabhakar Raghavan, Sandy Irani, and Baruch Schieber. 1991. Competitive Paging with Locality of Reference. In Proceedings of the Twenty-Third Annual ACM Symposium on Theory of Computing (New Orleans, Louisiana, USA) (STOC '91). Association for Computing Machinery, New York, NY, USA, 249--259.
[3]
Djillali Boukhelef, Jalil Boukhobza, Kamel Boukhalfa, Hamza Ouarnoughi, and Laurent Lemarchand. 2019. Optimizing the cost of DBaaS object placement in hybrid storage systems. Future Generation Computer Systems 93 (apr 2019), 176--187.
[4]
Jalil Boukhobza and Pierre Olivier. 2017. Flash Memory Integration (1st ed.). ISTE Press - Elsevier.
[5]
Jalil Boukhobza, Stéphane Rubini, Renhai Chen, and Zili Shao. 2017. Emerging NVM: A Survey on Architectural Integration and Research Challenges. ACM Trans. Des. Autom. Electron. Syst. 23, 2, Article 14 (nov 2017), 32 pages.
[6]
Joan Boyar, Sushmita Gupta, and Kim S. Larsen. 2012. Access Graphs Results for LRU versus FIFO under Relative Worst Order Analysis. arXiv:1204.4047 [cs.DS]
[7]
John S Bucy, Gregory R Ganger, et al. 2003. The DiskSim simulation environment version 3.0 reference manual. School of Computer Science, Carnegie Mellon University.
[8]
Zhiguang Chen, Nong Xiao, Yutong Lu, and Fang Liu. 2016. Me-CLOCK:A Memory-Efficient Framework to Implement Replacement Policies for Large Caches. IEEE Trans. Comput. 65, 8 (2016), 2665--2671.
[9]
Marek Chrobak and John Noga. 1999. LRU is better than FIFO. Algorithmica 23 (02 1999), 180--185.
[10]
Sean Cochrane, Ken Kutzer, and L McIntosh. 2009. Solving the HPC I/O bottleneck: Sun Lustre storage system. Sun BluePrint Online 820 (2009). http://nz11-agh1.ifj.edu.pl/public_users/b14olsze/Lustre.pdf
[11]
Asit Dan and Don Towsley. 1990. An Approximate Analysis of the LRU and FIFO Buffer Replacement Schemes. SIGMETRICS Perform. Eval Rev. 18, 1 (apr 1990), 143--152.
[12]
Gil Einziger and Roy Friedman. 2014. TinyLFU: A Highly Efficient Cache Admission Policy. In 2014 22nd Euromicro International Conference on Parallel, Distributed, and Network-Based Processing. 146--153.
[13]
Ohad Eytan, Danny Harnik, Effi Ofer, Roy Friedman, and Ronen Kat. 2020. It's Time to Revisit LRU vs. FIFO. In 12th USENIX Workshop on Hot Topics in Storage and File Systems (HotStorage 20). USENIX Association. https://www.usenix.org/conference/hotstorage20/presentation/eytan
[14]
Ting Gong, Tyler Lee, Cory Stephenson, Venkata Renduchintala, Suchismita Padhy, Anthony Ndirango, Gokce Keskin, and Oguz Elibol. 2019. A Comparison of Loss Weighting Strategies for Multi task Learning in Deep Neural Networks. IEEE Access 7 (2019), 141627--141632.
[15]
Sebastien Gougeaud, Soraya Zertal, Jacques-Charles Lafoucriere, and Philippe Deniel. 2015. OGSSim: Open Generic data Storage systems Simulation tool. In Eighth EAI International Conference on Simulation Tools and Techniques. ACM, Athens, Greece.
[16]
Takahiro Hirofuchi and Ryousei Takano. 2020. A Prompt Report on the Performance of Intel Optane DC Persistent Memory Module. IEICE Transactions on Information and Systems E103.D, 5 (May 2020), 1168--1172. arXiv: 2002.06018.
[17]
Bruce Jacob, Spencer Ng, and David Wang. 2007. Memory Systems: Cache, DRAM, Disk. Morgan Kaufmann Publishers Inc., San Francisco, CA, USA.
[18]
Song Jiang and Xiaodong Zhang. 2002. LIRS: An Efficient Low Inter-Reference Recency Set Replacement Policy to Improve Buffer Cache Performance. In Proceedings of the 2002 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems (Marina Del Rey, California) (SIGMETRICS '02). Association for Computing Machinery, New York, NY, USA, 31--42.
[19]
Youngjae Kim, Brendan Tauras, Aayush Gupta, and Bhuvan Urgaonkar. 2009. FlashSim: A Simulator for NAND Flash-Based Solid-State Drives. In 2009 First International Conference on Advances in System Simulation. 125--131.
[20]
Kathy Kincade. 2019. UniviStor: Next-Generation Data Storage for Heterogeneous HPC. Retrieved 2021-02-23 from https://cs.lbl.gov/news-media/news/2019/univistor-a-next-generation-data-storage-tool-for-heterogeneous-hpc-storage/
[21]
S Klasky, Hasan Abbasi, M Ainsworth, Jong Youl Choi, Matthew Curry, T Kurc, Q Liu, Jay Lofstead, Carlos Maltzahn, Manish Parashar, Norbert Podhorszki, Eric Suchyta, F Wang, M Wolf, C.S. Chang, R. Churchill, and Stéphane Ethier. 2016. Exascale Storage Systems the SIRIUS Way. Journal of Physics: Conference Series 759 (Oct. 2016), 012095.
[22]
Thomas Leibovici. 2015. Taking back control of HPC file systems with Robinhood Policy Engine. International Workshop on the Lustre Ecosystem: Challenges and Opportunities (2015). arXiv:1505.01448 http://arxiv.org/abs/1505.01448
[23]
Yonggang Liu, Renato Figueiredo, Yiqi Xu, and Ming Zhao. 2013. On the design and implementation of a simulator for parallel file system research. In 2013 IEEE 29th Symposium on Mass Storage Systems and Technologies (MSST). IEEE, Long Beach, CA, USA, 1--5.
[24]
Glenn K. Lockwood, Wucherl Yoo, Suren Byna, Nicholas J. Wright, Shane Snyder, Kevin Harms, Zachary Nault, and Philip Carns. 2017. UMAMI: A recipe for generating meaningful metrics through holistic I/O performance analysis. In Proceedings of PDSW-DISCS 2017 - 2nd Joint International Workshop on Parallel Data Storage and Data Intensive Scalable Computing Systems - Held in conjunction with SC 2017: The International Conference for High Performance Computing, Networking, Storage a. 55--60.
[25]
Jakob Lüttgau, Michael Kuhn, Kira Duwe, Yevhen Alforov, Eugen Betke, Julian Kunkel, and Thomas Ludwig. 2018. Survey of Storage Systems for High-Performance Computing. Supercomputing Frontiers and Innovations 5, 1 (April 2018), 31-58-58. Number: 1.
[26]
Nimrod Megiddo and Dharmendra S. Modha. 2003. ARC: A Self-Tuning, Low Overhead Replacement Cache (FAST '03). USENIX Association, USA, 115--130.
[27]
Florent Monjalet and Thomas Leibovici. 2019. Predicting File Lifetimes with Machine Learning. In High Performance Computing, Vol. 11887 LNCS. Springer, 288--299.
[28]
Onur Mutlu. 2013. Memory scaling: A systems architecture perspective. 2013 5th IEEE International Memory Workshop, IMW 2013, 21--25.
[29]
Mohammed Islam Naas, François Trahay, Alexis Colin, Pierre Olivier, Stéphane Rubini, Frank Singhoff, and Jalil Boukhobza. 2021. EZIO-Tracer: Unifying Kernel and User Space I/O Tracing for Data-Intensive Applications. In Proceedings of the Workshop on Challenges and Opportunities of Efficient and Performant Storage Systems (Online Event, United Kingdom) (CHEOPS '21). Association for Computing Machinery, New York, NY, USA, Article 4, 11 pages.
[30]
John K Ousterhout. 1990. Why Aren't Operating Systems Getting Faster As Fast as Hardware? 1990 Summer USENIX Annual Technical Conference (1990), 247--256.
[31]
Sejin Park and Chanik Park. 2017. FRD: A filtering based buffer cache algorithm that considers both frequency and reuse distance. In Proc. of the 33rd IEEE International Conference on Massive Storage Systems and Technology (MSST).
[32]
César San-Lucas and Cristina L. Abad. 2016. Towards a fast multi-tier storage system simulator. In 2016 IEEE Ecuador Technical Chapters Meeting (ETCM). 1--5.
[33]
Woong Shin, Christopher Brumgard, Bing Xie, Sudharshan Vazhkudai, Devarshi Ghoshal, Sarp Oral, and Lavanya Ramakrishnan. 2019. Data Jockey: Automatic Data Management for HPC Multi-tiered Storage Systems. In 2019 IEEE International Parallel and Distributed Processing Symposium (IPDPS). 511--522. ISSN: 1530-2075.
[34]
Luis Thomas, Sebastien Gougeaud, Stéphane Rubini, Philippe Deniel, and Jalil Boukhobza. 2021. Predicting File Lifetimes for Data Placement in Multi-Tiered Storage Systems for HPC. In Proceedings of the Workshop on Challenges and Opportunities of Efficient and Performant Storage Systems (Online Event, United Kingdom) (CHEOPS '21). Association for Computing Machinery, New York, NY, USA, Article 2, 9 pages.
[35]
Cristian Ungureanu, Biplob Debnath, Stephen Rago, and Akshat Aranya. 2013. TBF: A memory-efficient replacement policy for flash-based caches. In 2013 IEEE 29th International Conference on Data Engineering (ICDE). 1117--1128.
[36]
J. van den Berg and A. Gandolfi. 1992. LRU is better than FIFO under the independent reference model. Journal of Applied Probability 29, 1 (1992), 239--243.
[37]
Bharti Wadhwa, Surendra Byna, and Ali Butt. 2018. Toward Transparent Data Management in Multi-Layer Storage Hierarchy of HPC Systems. In 2018 IEEE International Conference on Cloud Engineering (IC2E). 211--217.
[38]
Lipeng Wan, Zheng Lu, Qing Cao, Feiyi Wang, Sarp Oral, and Bradley Settlemyer. 2014. SSD-optimized workload placement with adaptive learning and classification in HPC environments. In 2014 30th Symposium on Mass Storage Systems and Technologies (MSST). 1--6.
[39]
Wenguang Wang. 2004. Storage Management for Large Scale Systems. Ph.D. Dissertation. CAN.
[40]
Orcun Yildiz, Amelie Zhou, and Shadi Ibrahim. 2017. Eley: On the Effectiveness of Burst Buffers for Big Data Processing in HPC Systems. In 2017 IEEE International Conference on Cluster Computing (CLUSTER). 87--91.

Cited By

View all
  • (2024)Optimizing Distributed Tiered Data Storage Systems with DITISProceedings of the VLDB Endowment10.14778/3685800.368588317:12(4393-4396)Online publication date: 8-Nov-2024
  • (2022)The Lannion report on Big Data and Security Monitoring Research2022 IEEE International Conference on Big Data (Big Data)10.1109/BigData55660.2022.10020852(2960-2969)Online publication date: 17-Dec-2022

Index Terms

  1. SLRL: a simple least remaining lifetime file evicition policy for HPC multi-tier storage systems

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      CHEOPS '22: Proceedings of the Workshop on Challenges and Opportunities of Efficient and Performant Storage Systems
      April 2022
      44 pages
      ISBN:9781450392099
      DOI:10.1145/3503646
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 05 April 2022

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. data placement
      2. file lifetime
      3. high performance computing
      4. multi-tier storage
      5. simulation

      Qualifiers

      • Research-article

      Conference

      EuroSys '22
      Sponsor:

      Acceptance Rates

      Overall Acceptance Rate 6 of 8 submissions, 75%

      Upcoming Conference

      EuroSys '25
      Twentieth European Conference on Computer Systems
      March 30 - April 3, 2025
      Rotterdam , Netherlands

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)30
      • Downloads (Last 6 weeks)2
      Reflects downloads up to 11 Feb 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)Optimizing Distributed Tiered Data Storage Systems with DITISProceedings of the VLDB Endowment10.14778/3685800.368588317:12(4393-4396)Online publication date: 8-Nov-2024
      • (2022)The Lannion report on Big Data and Security Monitoring Research2022 IEEE International Conference on Big Data (Big Data)10.1109/BigData55660.2022.10020852(2960-2969)Online publication date: 17-Dec-2022

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Figures

      Tables

      Media

      Share

      Share

      Share this Publication link

      Share on social media