Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article
Public Access

On Resource Pooling and Separation for LRU Caching

Published: 03 April 2018 Publication History

Abstract

Caching systems using the Least Recently Used (LRU) principle have now become ubiquitous. A fundamental question for these systems is whether the cache space should be pooled together or divided to serve multiple flows of data item requests in order to minimize the miss probabilities. In this paper, we show that there is no straight yes or no answer to this question, depending on complex combinations of critical factors, including, e.g., request rates, overlapped data items across different request flows, data item popularities and their sizes. To this end, we characterize the performance of multiple flows of data item requests under resource pooling and separation for LRU caching when the cache size is large.
Analytically, we show that it is asymptotically optimal to jointly serve multiple flows if their data item sizes and popularity distributions are similar and their arrival rates do not differ significantly; the self-organizing property of LRU caching automatically optimizes the resource allocation among them asymptotically. Otherwise, separating these flows could be better, e.g., when data sizes vary significantly. We also quantify critical points beyond which resource pooling is better than separation for each of the flows when the overlapped data items exceed certain levels. Technically, for a broad class of heavy-tailed distributions we derive the asymptotic miss probabilities of multiple flows of requests with varying data item sizes in a shared LRU cache space. It also validates the characteristic time approximation under certain conditions. These results provide new insights on improving the performance of caching systems.

References

[1]
Susanne Albers and Jeffery Westbrook. 1998. Self-organizing data structures. Online Algorithms: The state of the art Vol. 1442 (1998), 13--41.
[2]
Brian Allen and Ian Munro. 1978. Self-organizing binary search trees. J. ACM Vol. 25 (1978), 526--535.
[3]
Noga Alon and Joel H. Spencer. 2000. The probabilistic method (2nd ed.). John Wiley.
[4]
Sigal Ar, Bernard Chazelle, and Ayellet Tal. 2000. Self-customized BSP trees for collision detection. Computational Geometry: Theory and Applications Vol. 10 (2000), 23--29.
[5]
Martin Arlitt, Rich Friedrich, and Tai Jin. 1999. Workload characterization of a Web proxy in a cable modem environment. SIGMETRICS Performance Evalation Review Vol. 27, 2 (Sept. 1999), 25--36. 2010. A fluid limit for a cache algorithm with general request processes. Advances in Applied Probability Vol. 42, 3 (2010), 816--833.
[6]
Stefan Podlipnig and Laszlo Böszörmenyi. 2003. A survey of web cache replacement strategies. ACM Computing Surveys (CSUR) Vol. 35, 4 (Dec. 2003), 374--398. ISSN0360-0300
[7]
Guocong Quan, Kaiyi Ji, and Jian Tan. 2018. LRU caching with dependent competing requests. IEEE INFOCOM 2018 - IEEE Conference on Computer Communications (INFOCOM 2018). Honolulu, USA.
[8]
Ronald Rivest. 1976. On self-organizing sequential search heuristics. Commun. ACM Vol. 19 (1976), 63--67.
[9]
James Roberts and Nada Sbihi. 2013. Exploring the memory-bandwidth tradeoff in an information-centric network Teletraffic Congress (ITC), 2013 25th International. IEEE, 1--9.
[10]
Liam Roditty and Uri Zwick. 2004. A fully dynamic reachability algorithm for directed graphs with an almost linear update time Proceedings of the 36th STOC. 184--191.
[11]
Elisha J. Rosensweig, Jim Kurose, and Don Towsley. 2010. Approximate models for general cache networks. In Proceedings of the 29th Conference on Information Communications (INFOCOM'10). IEEE Press, San Diego, California, USA, 1100--1108.
[12]
Albers S. and M. Mitzenmacher. 1998. Average case analyses of list update algorithms. Algorithmica Vol. 21 (1998), 312--329.
[13]
Daniel D. Sleator and Robert E. Tarjan. 1985 a. Amortized efficiency of list update and paging rules. Commun. ACM Vol. 28 (1985), 202--208.
[14]
Daniel D. Sleator and Robert E. Tarjan. 1985 b. Self-adjusting binary search trees. J. ACM Vol. 32 (1985), 652--686.
[15]
Alexander L. Stolyar. 2004. MaxWeight scheduling in a generalized switch: State space collapse and workload minimization in heavy traffic. The Annals of Applied Probability Vol. 14, 1 (02. 2004), 1--53.
[16]
Toyoaki Sugimoto and Naoto Miyoshi. 2006. On the asymptotics of fault probability in least-recently-used caching with Zipf-type request distribution. Random Structures & Algorithms Vol. 29, 3 (2006), 296--323.
[17]
Jian Tan, Li Zhang, and Yandong Wang. 2015. Miss behavior for caching with lease. SIGMETRICS Performance Evaluation Review, MAMA workshop, Vol. 43, 2 (2015), 60--62.
[18]
Andrew S. Tanenbaum. 2001. Modern Operating Systems (2rd ed.). Prentice Hall Press, Upper Saddle River, NJ, USA.
[19]
Naoki Tsukada, Ryo Hirade, and Naoto Miyoshi. 2012. Fluid limit analysis of FIFO and RR caching for independent reference models. Performance Evaluation Vol. 69, 9 (2012), 403--412.
[20]
Jia Wang. 1999. A survey of Web caching schemes for the Internet. SIGCOMM Computer Communication Review Vol. 29, 5 (Oct. 1999), 36--46. ISSN 0146-4833
[21]
Xingbo Wu, Li Zhang, Yandong Wang, Yufei Ren, Michel Hack, and Song Jiang. 2016. zExpander: a key-value cache with both high performance and fewer misses Proceedings of the Eleventh European Conference on Computer Systems (EuroSys '16). ACM, New York, NY, USA, Article 14, 15 pages.
[22]
Yuehai Xu, Eitan Frachtenberg, Song Jiang, and Mike Paleczny. 2014. Characterizing Facebook's Memcached workload. IEEE Internet Computing Vol. 18, 2 (2014), 41--49.
[23]
Yuehai Xua, Eitan Frachtenbergb, and Song Jiang. 2014. Building a high-performance key-value cache as an energy-efficient appliance. Performance Evaluation Vol. 79 (September. 2014), 24--37.
[24]
Yue Yang and Jianwen Zhu. 2016. Write skew and Zipf distribution: Evidence and implications. ACM Transactions on Storage (TOS) Vol. 12, 4, Article 21 (June. 2016), pages 19 pages. ISSN1553-3077

Cited By

View all
  • (2024)Edge-assisted Real-time Dynamic 3D Point Cloud Rendering for Multi-party Mobile Virtual RealityProceedings of the 32nd ACM International Conference on Multimedia10.1145/3664647.3681650(2824-2832)Online publication date: 28-Oct-2024
  • (2024)M2HO: Mitigating the Adverse Effects of 5G Handovers on TCPProceedings of the 30th Annual International Conference on Mobile Computing and Networking10.1145/3636534.3690680(1089-1103)Online publication date: 4-Dec-2024
  • (2024)Hybrid-Coding Based Content Access Control for Information-Centric NetworkingIEEE Transactions on Wireless Communications10.1109/TWC.2023.333293023:7(6765-6777)Online publication date: 1-Jul-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Proceedings of the ACM on Measurement and Analysis of Computing Systems
Proceedings of the ACM on Measurement and Analysis of Computing Systems  Volume 2, Issue 1
March 2018
603 pages
EISSN:2476-1249
DOI:10.1145/3203302
Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 03 April 2018
Published in POMACS Volume 2, Issue 1

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. caching algorithm
  2. lru
  3. memcached
  4. miss probability

Qualifiers

  • Research-article

Funding Sources

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)81
  • Downloads (Last 6 weeks)14
Reflects downloads up to 23 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Edge-assisted Real-time Dynamic 3D Point Cloud Rendering for Multi-party Mobile Virtual RealityProceedings of the 32nd ACM International Conference on Multimedia10.1145/3664647.3681650(2824-2832)Online publication date: 28-Oct-2024
  • (2024)M2HO: Mitigating the Adverse Effects of 5G Handovers on TCPProceedings of the 30th Annual International Conference on Mobile Computing and Networking10.1145/3636534.3690680(1089-1103)Online publication date: 4-Dec-2024
  • (2024)Hybrid-Coding Based Content Access Control for Information-Centric NetworkingIEEE Transactions on Wireless Communications10.1109/TWC.2023.333293023:7(6765-6777)Online publication date: 1-Jul-2024
  • (2024)Management of Caching Policies and Redundancy Over Unreliable ChannelsIEEE Transactions on Network and Service Management10.1109/TNSM.2023.333455921:2(1912-1928)Online publication date: Apr-2024
  • (2023)A Lightweight and Adaptive Cache Allocation Scheme for Content Delivery Networks2023 Design, Automation & Test in Europe Conference & Exhibition (DATE)10.23919/DATE56975.2023.10136922(1-6)Online publication date: Apr-2023
  • (2023)Handling Data Handoff of AI-Based Applications in Edge Computing SystemsIEEE Transactions on Network and Service Management10.1109/TNSM.2023.326794220:4(4435-4447)Online publication date: 1-Dec-2023
  • (2023)5G-Advanced Toward 6G: Past, Present, and FutureIEEE Journal on Selected Areas in Communications10.1109/JSAC.2023.327403741:6(1592-1619)Online publication date: 1-Jun-2023
  • (2023)Offline and Online Algorithms for Cache Allocation with Monte Carlo Tree Search and a Learned Model2023 IEEE 41st International Conference on Computer Design (ICCD)10.1109/ICCD58817.2023.00028(126-133)Online publication date: 6-Nov-2023
  • (2022)TuttiProceedings of the 28th Annual International Conference on Mobile Computing And Networking10.1145/3495243.3560538(729-742)Online publication date: 14-Oct-2022
  • (2022)Balancing Revenue and Cache Partition Fairness for CDN Providers2022 IEEE 24th Int Conf on High Performance Computing & Communications; 8th Int Conf on Data Science & Systems; 20th Int Conf on Smart City; 8th Int Conf on Dependability in Sensor, Cloud & Big Data Systems & Application (HPCC/DSS/SmartCity/DependSys)10.1109/HPCC-DSS-SmartCity-DependSys57074.2022.00239(1562-1569)Online publication date: Dec-2022
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Full Access

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media