Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3485447.3512154acmconferencesArticle/Chapter ViewAbstractPublication PageswebconfConference Proceedingsconference-collections
research-article

Commutativity-guaranteed Docker Image Reconstruction towards Effective Layer Sharing

Published: 25 April 2022 Publication History
  • Get Citation Alerts
  • Abstract

    Owing to the benefit of light weight, containers have become a promising enabler for cloud native computing. Container images composed of applications and dependencies support flexible service deployment and migration. Rapid adoption and integration of containers generate millions of images to be stored. Additionally, non-local images have to be frequently downloaded from the registry, resulting in huge amounts of traffic. Content Addressable Storage (CAS) has been adopted for saving storage and networking by enabling identical layers sharing across images. However, according to our measurements, the implication of CAS is significantly limited as layers are rarely fully identical in practice. In this paper, we propose to reconstruct the docker images to raise the number of identical layers and thereby reduce storage and network consumption. We explore the layered structure of images and define the commutativity of files to assure image validity. The image reconstruction is formulated as an integer nonlinear programming problem. Inspired by the observed similarity of layers, we design a similarity-aware online image reconstruction algorithm. Extensive evaluations are conducted to verify the performance of the proposed approach.

    References

    [1]
    2021. Docker. https://www.docker.com/.
    [2]
    2021. Docker Hub. https://hub.docker.com/.
    [3]
    2021. How the overlay2 driver works. https://docs.docker.com/storage/storagedriver/overlayfs-driver/##how-the-overlay2-driver-works.
    [4]
    Alexandru Agache, Marc Brooker, Alexandra Iordache, Anthony Liguori, Rolf Neugebauer, Phil Piwonka, and Diana-Maria Popa. 2020. Firecracker: Lightweight virtualization for serverless applications. In 17th usenix symposium on networked systems design and implementation (nsdi 20). 419–434.
    [5]
    Istemi Ekin Akkus, Ruichuan Chen, Ivica Rimac, Manuel Stein, Klaus Satzke, Andre Beck, Paarijaat Aditya, and Volker Hilt. 2018. SAND: Towards High-Performance Serverless Computing. In 2018 Usenix Annual Technical Conference (USENIX ATC 18). 923–935.
    [6]
    Ali Anwar, Mohamed Mohamed, Vasily Tarasov, Michael Littley, Lukas Rupprecht, Yue Cheng, Nannan Zhao, Dimitrios Skourtis, Amit S Warke, Heiko Ludwig, 2018. Improving docker registry design based on production workload analysis. In 16th USENIX Conference on File and Storage Technologies (FAST 18). 265–278.
    [7]
    Ali Anwar, Lukas Rupprecht, Dimitris Skourtis, and Vasily Tarasov. 2019. Challenges in Storing Docker Images. login Usenix Mag. 44, 3 (2019).
    [8]
    Eric A Brewer. 2015. Kubernetes and the path to cloud native. In Proceedings of the sixth ACM symposium on cloud computing. 167–167.
    [9]
    Jad Darrous, Thomas Lambert, and Shadi Ibrahim. 2019. On the importance of container image placement for service provisioning in the edge. In 2019 28th International Conference on Computer Communication and Networks (ICCCN). IEEE, 1–9.
    [10]
    Dong Du, Tianyi Yu, Yubin Xia, Binyu Zang, Guanglu Yan, Chenggang Qin, Qixuan Wu, and Haibo Chen. 2020. Catalyzer: Sub-millisecond startup for serverless computing with initialization-less booting. In Proceedings of the Twenty-Fifth International Conference on Architectural Support for Programming Languages and Operating Systems. 467–481.
    [11]
    Mike Dutch. 2008. Understanding data deduplication ratios. In SNIA Data Management Forum, Vol. 7.
    [12]
    Wes Felter, Alexandre Ferreira, Ram Rajamony, and Juan Rubio. 2015. An updated performance comparison of virtual machines and linux containers. In 2015 IEEE international symposium on performance analysis of systems and software (ISPASS). 171–172.
    [13]
    Lin Gu, Deze Zeng, Jie Hu, Hai Jin, Song Guo, and Albert Y Zomaya. 2021. Exploring Layered Container Structure for Cost Efficient Microservice Deployment. IEEE INFOCOM 2021-IEEE Conference on Computer Communications (2021), 1–9.
    [14]
    Lin Gu, Deze Zeng, Jie Hu, Bo Li, and Hai Jin. 2021. Layer Aware Microservice Placement and Request Scheduling at the Edge. IEEE INFOCOM 2021-IEEE Conference on Computer Communications (2021), 1–9.
    [15]
    Lieve Hamers 1989. Similarity measures in scientometric research: The Jaccard index versus Salton’s cosine formula.Information Processing and Management 25, 3 (1989), 315–18.
    [16]
    Tyler Harter, Brandon Salmon, Rose Liu, Andrea C Arpaci-Dusseau, and Remzi H Arpaci-Dusseau. 2016. Slacker: Fast distribution with lazy docker containers. In 14th USENIX Conference on File and Storage Technologies (FAST 16). 181–195.
    [17]
    Devki Nandan Jha, Michael Nee, Zhenyu Wen, Albert Zomaya, and Rajiv Ranjan. 2019. SmartDBO: smart docker benchmarking orchestrator for web-application. In The World Wide Web Conference. 3555–3559.
    [18]
    Yan Li, Bo An, Junming Ma, and Donggang Cao. 2019. Comparison between Chunk-Based and Layer-Based Container Image Storage Approaches: an Empirical Study. In 2019 IEEE International Conference on Service-Oriented System Engineering (SOSE). IEEE, 197–1975.
    [19]
    Paul B Menage. 2007. Adding generic process containers to the linux kernel. In Proceedings of the Linux symposium, Vol. 2. 45–57.
    [20]
    Edward Oakes, Leon Yang, Dennis Zhou, Kevin Houck, Tyler Harter, Andrea Arpaci-Dusseau, and Remzi Arpaci-Dusseau. 2018. SOCK: Rapid task provisioning with serverless-optimized containers. In 2018 USENIX Annual Technical Conference (USENIX ATC 18). 57–70.
    [21]
    Dian Rachmawati, JT Tarigan, and ABC Ginting. 2018. A comparative study of Message Digest 5 (MD5) and SHA256 algorithm. In Journal of Physics: Conference Series, Vol. 978. IOP Publishing, 012116.
    [22]
    Ronald Rivest and S Dusse. 1992. The MD5 message-digest algorithm.
    [23]
    Mendel Rosenblum and Tal Garfinkel. 2005. Virtual machine monitors: Current technology and future trends. Computer 38, 5 (2005), 39–47.
    [24]
    Mohammad Shahrad, Rodrigo Fonseca, Íñigo Goiri, Gohar Chaudhry, Paul Batum, Jason Cooke, Eduardo Laureano, Colby Tresness, Mark Russinovich, and Ricardo Bianchini. 2020. Serverless in the wild: Characterizing and optimizing the serverless workload at a large cloud provider. In 2020 USENIX Annual Technical Conference (USENIX ATC 20). 205–218.
    [25]
    Dimitris Skourtis, Lukas Rupprecht, Vasily Tarasov, and Nimrod Megiddo. 2019. Carving perfect layers out of docker images. In 11th USENIX Workshop on Hot Topics in Cloud Computing (HotCloud 19).
    [26]
    Piet Smet, Bart Dhoedt, and Pieter Simoens. 2018. Docker layer placement for on-demand provisioning of services on edge clouds. IEEE Transactions on Network and Service Management 15, 3(2018), 1161–1174.
    [27]
    Gaetano Somma, Constantine Ayimba, Paolo Casari, Simon Pietro Romano, and Vincenzo Mancuso. 2020. When Less is More: Core-Restricted Container Provisioning for Serverless Computing. In IEEE INFOCOM 2020-IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS). IEEE, 1153–1159.
    [28]
    Vasily Tarasov, Lukas Rupprecht, Dimitris Skourtis, Amit Warke, Dean Hildebrand, Mohamed Mohamed, Nagapramod Mandagere, Wenji Li, Raju Rangaswami, and Ming Zhao. 2017. In search of the ideal storage configuration for Docker containers. In 2017 IEEE 2nd International Workshops on Foundations and Applications of Self* Systems (FAS* W). IEEE, 199–206.
    [29]
    Niraj Tolia, Michael Kozuch, Mahadev Satyanarayanan, Brad Karp, Thomas C Bressoud, and Adrian Perrig. 2003. Opportunistic Use of Content Addressable Storage for Distributed File Systems. In USENIX Annual Technical Conference, General Track, Vol. 3. 127–140.
    [30]
    Kai-Ting Amy Wang, Rayson Ho, and Peng Wu. 2019. Replayable execution optimized for page sharing for a managed runtime environment. In Proceedings of the Fourteenth EuroSys Conference 2019. 1–16.
    [31]
    Frank Zhao, Kevin Xu, and Randy Shain. 2016. Improving copy-on-write performance in container storage drivers. In Storage Developers Conference.
    [32]
    Nannan Zhao, Hadeel Albahar, Subil Abraham, Keren Chen, Vasily Tarasov, Dimitrios Skourtis, Lukas Rupprecht, Ali Anwar, and Ali R Butt. 2020. Duphunter: Flexible high-performance deduplication for docker registries. In 2020 USENIX Annual Technical Conference (USENIX ATC 20). 769–783.
    [33]
    Nannan Zhao, Vasily Tarasov, Hadeel Albahar, Ali Anwar, Lukas Rupprecht, Dimitrios Skourtis, Arnab K Paul, Keren Chen, and Ali R Butt. 2020. Large-Scale Analysis of Docker Images and Performance Implications for Container Storage Systems. IEEE Transactions on Parallel and Distributed Systems 32, 4 (2020), 918–930.
    [34]
    Nannan Zhao, Vasily Tarasov, Ali Anwar, Lukas Rupprecht, Dimitrios Skourtis, Amit Warke, Mohamed Mohamed, and Ali Butt. 2019. Slimmer: Weight loss secrets for docker registries. In 2019 IEEE 12th International Conference on Cloud Computing (CLOUD). IEEE, 517–519.

    Cited By

    View all
    • (2023)LOPO: An Out-of-order Layer Pulling Orchestration Strategy for Fast Microservice StartupIEEE INFOCOM 2023 - IEEE Conference on Computer Communications10.1109/INFOCOM53939.2023.10229072(1-9)Online publication date: 17-May-2023
    • (2023)Container Image Similarity-Aware Resource Provisioning for Serverless Edge Computing2023 IEEE International Conference on Web Services (ICWS)10.1109/ICWS60048.2023.00047(278-288)Online publication date: Jul-2023
    • (2022)Improving Performance of Docker Instance via Image ReconstructionBig Data Intelligence and Computing10.1007/978-981-99-2233-8_37(511-522)Online publication date: 8-Dec-2022

    Index Terms

    1. Commutativity-guaranteed Docker Image Reconstruction towards Effective Layer Sharing
          Index terms have been assigned to the content through auto-classification.

          Recommendations

          Comments

          Information & Contributors

          Information

          Published In

          cover image ACM Conferences
          WWW '22: Proceedings of the ACM Web Conference 2022
          April 2022
          3764 pages
          ISBN:9781450390965
          DOI:10.1145/3485447
          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Sponsors

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          Published: 25 April 2022

          Permissions

          Request permissions for this article.

          Check for updates

          Author Tags

          1. Container images
          2. Containers
          3. Docker
          4. Docker hub

          Qualifiers

          • Research-article
          • Research
          • Refereed limited

          Conference

          WWW '22
          Sponsor:
          WWW '22: The ACM Web Conference 2022
          April 25 - 29, 2022
          Virtual Event, Lyon, France

          Acceptance Rates

          Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

          Contributors

          Other Metrics

          Bibliometrics & Citations

          Bibliometrics

          Article Metrics

          • Downloads (Last 12 months)74
          • Downloads (Last 6 weeks)4
          Reflects downloads up to 27 Jul 2024

          Other Metrics

          Citations

          Cited By

          View all
          • (2023)LOPO: An Out-of-order Layer Pulling Orchestration Strategy for Fast Microservice StartupIEEE INFOCOM 2023 - IEEE Conference on Computer Communications10.1109/INFOCOM53939.2023.10229072(1-9)Online publication date: 17-May-2023
          • (2023)Container Image Similarity-Aware Resource Provisioning for Serverless Edge Computing2023 IEEE International Conference on Web Services (ICWS)10.1109/ICWS60048.2023.00047(278-288)Online publication date: Jul-2023
          • (2022)Improving Performance of Docker Instance via Image ReconstructionBig Data Intelligence and Computing10.1007/978-981-99-2233-8_37(511-522)Online publication date: 8-Dec-2022

          View Options

          Get Access

          Login options

          View options

          PDF

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader

          HTML Format

          View this article in HTML Format.

          HTML Format

          Media

          Figures

          Other

          Tables

          Share

          Share

          Share this Publication link

          Share on social media