research-article

Extremely-Compressed SSDs with I/O Behavior Prediction

Authors:

Jason XueAuthors Info & Claims

ACM Transactions on Storage, Volume 20, Issue 4

Article No.: 26, Pages 1 - 38

https://doi.org/10.1145/3677044

Published: 06 August 2024 Publication History

Abstract

As the data volume continues to grow exponentially, there is an increasing demand for large storage system capacity. Data compression techniques effectively reduce the volume of written data, enhancing space efficiency. As a result, many modern SSDs have already incorporated data compression capabilities. However, data compression introduces additional processing overhead in critical I/O paths, potentially affecting system performance. Currently, most compression solutions in flash-based storage systems employ fixed compression algorithms for all incoming data without leveraging differences among various data access patterns. This leads to sub-optimal compression efficiency.

This article proposes a data-type-aware Flash Translation Layer (DAFTL) scheme to maximize space efficiency without compromising system performance. First, we propose an I/O behavior prediction method to forecast future access on specific data. Then, DAFTL matches data types with distinct I/O behaviors to compression algorithms of varying intensities, achieving an optimal balance between performance and space efficiency. Specifically, it employs higher-intensity compression algorithms for less frequently accessed data to maximize space efficiency. For frequently accessed data, it utilizes lower-intensity but faster compression algorithms to maintain system performance. Finally, an improved compact compression method is proposed to effectively eliminate page fragmentation and further enhance space efficiency. Extensive evaluations using a variety of real-world workloads, as well as the workloads with real data we collected on our platforms, demonstrate that DAFTL achieves more data reductions than other approaches. When compared to the state-of-the-art compression schemes, DAFTL reduces the total number of pages written to the SSD by an average of 8%, 21.3%, and 25.6% for data with high, medium, and low compressibility, respectively. In the case of workloads with real data, DAFTL achieves an average reduction of 10.4% in the total number of pages written to SSD. Furthermore, DAFTL exhibits comparable or even improved read and write performance compared to other solutions.

References

[1]

Przemyslaw Skibinski, Jinfei Han, Dmitry Atamanov, Andrea Bocci, and Chip Turner. 2015. Lzbench. Retrieved 30 June 2024 from https://github.com/inikep/lzbench

[2]

Milosz Krajewski. 2015. Silesia Compression Corpus. Retrieved 30 June 2024 from https://github.com/MiloszKrajewski/SilesiaCorpus

[3]

Facebook. 2016. ZSTD. Retrieved 30 June 2024 from https://github.com/facebook/zstd

[4]

Chao Shi and Qiuping Wang. 2018. Alibaba Block Traces. Retrieved 30 June 2024 from https://github.com/alibaba/block-traces

[5]

Ohad Rodeh, Josef Bacik, and Chris Mason. 2013. BTRFS: The linux B-tree filesystem. ACM Transactions on Storage (TOS) 9, 3 (2013), 1–32.

[6]

ScaleFlux. 2023. CSD3000 Solid State Drive. Retrieved 30 June 2024 from https://scaleflux.com/products/csd-3000

[7]

OpenZFS. 2023. OpenZFS on Linux and FreeBSD. Retrieved 30 June 2024 from https://github.com/openzfs/zfs

[8]

Abutalib Aghayev, Theodore Ts’o, Garth Gibson, and Peter Desnoyers. 2017. Evolving ext4 for shingled disks. In Proceedings of the 15th USENIX Conference on File and Storage Technologies (FAST’17). 105–120.

Digital Library

[9]

Tobias Ahrens, Mohammed Rajab, and Jürgen Freudenberger. 2016. Compression of short data blocks to improve the reliability of non-volatile flash memories. In Proceedings of the 2016 International Conference on Information and Digital Technologies (IDT’16). IEEE, 1–4.

[10]

Matěj Bartík, Sven Ubik, and Pavel Kubalik. 2015. LZ4 compression algorithm on FPGA. In Proceedings of the 2015 IEEE International Conference on Electronics, Circuits, and Systems (ICECS’15). IEEE, 179–182.

[11]

Chandranil Chakraborttii and Heiner Litz. 2021. Reducing write amplification in flash by death-time prediction of logical block addresses. In Proceedings of the 14th ACM International Conference on Systems and Storage. 1–12.

Digital Library

[12]

Chin-Hsing Chen, Chun-Ta Chen, and Wen-Tzeng Huang. 2008. The real-time compression layer for flash memory in mobile multimedia devices. Mobile Networks and Applications 13, 6 (2008), 547–554.

[13]

Feng Chen, David A. Koufaty, and Xiaodong Zhang. 2009. Understanding intrinsic characteristics and system implications of flash memory based solid state drives. ACM SIGMETRICS Performance Evaluation Review 37, 1 (2009), 181–192.

Digital Library

[14]

Xubin Chen, Yin Li, Jingpeng Hao, Hyunsuk Shin, Michael Suh, and Tong Zhang. 2019. Simultaneously reducing cost and improving performance of NVM-based block devices via transparent data compression. In Proceedings of the International Symposium on Memory Systems. 331–341.

Digital Library

[15]

Xubin Chen, Yin Li, and Tong Zhang. 2018. Reducing flash memory write traffic by exploiting a few MBs of capacitor-powered write buffer inside solid-state drives (SSDs). IEEE Transactions on Computers 68, 3 (2018), 426–439.

Digital Library

[16]

Jinhua Cui, Youtao Zhang, Liang Shi, Chun Jason Xue, Weiguo Wu, and Jun Yang. 2018. ApproxFTL: On the performance and lifetime improvement of 3-D NAND flash-based SSDs. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems 37, 10 (2018), 1957–1970. DOI:

Digital Library

[17]

Carlos Escuin, Asif Ali Khan, Pablo Ibánez, Teresa Monreal, Jeronimo Castrillon, and Víctor Viñals. 2023. Compression-aware and performance-efficient insertion policies for long-lasting hybrid LLCs. In Proceedings of the 2023 IEEE International Symposium on High-Performance Computer Architecture (HPCA’23). IEEE, 179–192.

[18]

Raúl Gracia-Tinedo, Danny Harnik, Dalit Naor, Dmitry Sotnikov, Sivan Toledo, and Aviad Zuck. 2015. \(\lbrace\)SDGen\(\rbrace\): Mimicking datasets for content generation in storage benchmarks. In Proceedings of the 13th USENIX Conference on File and Storage Technologies (FAST’15). 317–330.

[19]

Jen-Wei Hsieh, Tei-Wei Kuo, and Li-Pin Chang. 2006. Efficient identification of hot data for flash memory storage systems. ACM Transactions on Storage (TOS) 2, 1 (2006), 22–40.

Digital Library

[20]

Xiaokang Hu, Fuzong Wang, Weigang Li, Jian Li, and Haibing Guan. 2019. \(\lbrace\)QZFS\(\rbrace\):\(\lbrace\)QAT\(\rbrace\) Accelerated compression in file system for application agnostic and cost efficient data storage. In Proceedings of the 2019 USENIX Annual Technical Conference (ATC’19). 163–176.

[21]

Yang Hu, Hong Jiang, Dan Feng, Lei Tian, Hao Luo, and Shuping Zhang. 2011. Performance impact and interplay of SSD parallelism through advanced commands, allocation strategy and data granularity. In Proceedings of the International Conference on Supercomputing. 96–107.

Digital Library

[22]

Cheng Ji, Li-Pin Chang, Riwei Pan, Chao Wu, Congming Gao, Liang Shi, Tei-Wei Kuo, and Chun Jason Xue. 2021. \(\lbrace\)Pattern-Guided\(\rbrace\) file compression with \(\lbrace\)User-Experience\(\rbrace\) enhancement for \(\lbrace\)Log-Structured\(\rbrace\) file system on mobile devices. In Proceedings of the 19th USENIX Conference on File and Storage Technologies (FAST’21). 127–140.

[23]

Cheng Ji, Li-Pin Chang, Liang Shi, Congming Gao, Chao Wu, Yuangang Wang, and Chun Jason Xue. 2017. Lightweight data compression for mobile flash storage. ACM Transactions on Embedded Computing Systems (TECS) 16, 5s (2017), 1–18.

Digital Library

[24]

Jeong-Uk Kang, Jeeseok Hyun, Hyunjoo Maeng, and Sangyeun Cho. 2014. The multi-streamed \(\lbrace\)Solid-State\(\rbrace\) drive. In Proceedings of the 6th USENIX Workshop on Hot Topics in Storage and File Systems (HotStorage’14).

[25]

Mincheol Kang, Wonyoung Lee, Jinkwon Kim, and Soontae Kim. 2022. PR-SSD: Maximizing partial read potential by exploiting compression and channel-level parallelism. IEEE Transactions on Computers 72, 3 (2022), 772–785.

[26]

Mincheol Kang, Wonyoung Lee, and Soontae Kim. 2018. Subpage-aware solid state drive for improving lifetime and performance. IEEE Transactions on Computers 67, 10 (2018), 1492–1505.

[27]

Dohyun Kim, Kwangwon Min, Joontaek Oh, and Youjip Won. 2022. \(\lbrace\)ScaleXFS\(\rbrace\): Getting scalability of \(\lbrace\)XFS\(\rbrace\) back on the ring. In Proceedings of the 20th USENIX Conference on File and Storage Technologies (FAST’22). 329–344.

[28]

Jungrae Kim, Michael Sullivan, Seong-Lyong Gong, and Mattan Erez. 2015. Frugal ECC: Efficient and versatile memory error protection through fine-grained compression. In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis. 1–12.

Digital Library

[29]

Changman Lee, Dongho Sim, Jooyoung Hwang, and Sangyeun Cho. 2015. \(\lbrace\)F2FS\(\rbrace\): A new file system for flash storage. In Proceedings of the 13th USENIX Conference on File and Storage Technologies (FAST’15). 273–286.

[30]

Jongsung Lee and Jin-Soo Kim. 2013. An empirical study of hot/cold data separation policies in solid state drives (SSDs). In Proceedings of the 6th International Systems and Storage Conference. 1–6.

Digital Library

[31]

Sungjin Lee, Jihoon Park, Kermin Fleming, Arvind Arvind, and Jihong Kim. 2011. Improving performance and lifetime of solid-state drives using hardware-accelerated compression. In IEEE Transactions on Consumer Electronics 57, 4 (2011), 1732–1739.

[32]

Jinhong Li, Qiuping Wang, Patrick P. C. Lee, and Chao Shi. 2023. An in-depth comparative analysis of cloud block storage workloads: Findings and implications. ACM Transactions on Storage 19, 2 (2023), 1–32.

Digital Library

[33]

Jiangpeng Li, Kai Zhao, Xuebin Zhang, Jun Ma, Ming Zhao, and Tong Zhang. 2015. How much can data compressibility help to improve \(\lbrace\)NAND\(\rbrace\) flash memory lifetime?. In Proceedings of the 13th USENIX Conference on File and Storage Technologies (FAST’15). 227–240.

Digital Library

[34]

Qiao Li, Liang Shi, Riwei Pan, Cheng Ji, Xiaoqiang Li, and Chun Jason Xue. 2018. Selective compression scheme for read performance improvement on flash devices. In Proceedings of the 2018 IEEE 36th International Conference on Computer Design (ICCD’18). IEEE, 43–50.

[35]

Qiao Li, Liang Shi, Chun Jason Xue, Kaijie Wu, Cheng Ji, Qingfeng Zhuge, and Edwin H-M Sha. 2016. Access characteristic guided read and write cost regulation for performance improvement on flash memory. In Proceedings of the14th USENIX Conference on File and Storage Technologies (FAST’16). 125–132.

[36]

Qiang Li, Qiao Xiang, Yuxin Wang, Haohao Song, Ridi Wen, Wenhui Yao, Yuanyuan Dong, Shuqi Zhao, Shuo Huang, Zhaosheng Zhu, Huayong Wang, Shanyang Liu, Lulu Chen, Zhiwu Wu, Haonan Qiu, Derui Liu, Gexiao Tian, Chao Han, Shaozong Liu, Yaohui Wu, Zicheng Luo, Yuchao Shao, Junping Wu, Zheng Cao, Zhongjie Wu, Jiaji Zhu, Jinbo Wu, Jiwu Shu, and Jiesheng Wu. 2023. More than capacity: Performance-oriented evolution of Pangu in Alibaba. In Proceedings of the 21st USENIX Conference on File and Storage Technologies (FAST’23). 331–346.

[37]

Weiqiang Liu, Faqiang Mei, Chenghua Wang, Maire O’Neill, and Earl E. Swartzlander. 2018. Data compression device based on modified LZ4 algorithm. IEEE Transactions on Consumer Electronics 64, 1 (2018), 110–117.

[38]

Thanos Makatos, Yannis Klonatos, Manolis Marazakis, Michail D. Flouris, and Angelos Bilas. 2010. Using transparent compression to improve SSD-based I/O caches. In Proceedings of the 5th European Conference on Computer Systems. 1–14.

Digital Library

[39]

Bo Mao, Suzhen Wu, Hong Jiang, Yaodong Yang, and Zaifa Xi. 2018. EDC: Improving the performance and space efficiency of flash-based storage systems with elastic data compression. IEEE Transactions on Parallel and Distributed Systems 29, 6 (2018), 1261–1274.

[40]

Nimrod Megiddo and Dharmendra S. Modha. 2004. Outperforming LRU with an adaptive replacement cache algorithm. Computer 37, 4 (2004), 58–65.

Digital Library

[41]

Changwoo Min, Sanidhya Kashyap, Steffen Maass, and Taesoo Kim. 2016. Understanding manycore scalability of file systems. In Proceedings of the 2016 USENIX Annual Technical Conference (ATC’16). 71–85.

[42]

Dushyanth Narayanan, Austin Donnelly, and Antony Rowstron. 2008. Write off-loading: Practical power management for enterprise storage. ACM Transactions on Storage (TOS) 4, 3 (2008), 1–23.

Digital Library

[43]

Kazuichi Oe, Kazutaka Ogihara, and Takeo Honda. 2018. Analysis of commercial cloud workload and study on how to apply cache methods. IEICE Technical Report; IEICE Tech. Rep. 118, 165 (2018), 7–12.

[44]

Elizabeth J. O’neil, Patrick E. O’neil, and Gerhard Weikum. 1993. The LRU-K page replacement algorithm for database disk buffering. ACM SIGMOD Record 22, 2 (1993), 297–306.

Digital Library

[45]

Yubiao Pan, Yongkun Li, Huizhen Zhang, and Yinlong Xu. 2019. Lifetime-aware FTL to improve the lifetime and performance of solid-state drives. Future Generation Computer Systems 93 (2019), 58–67.

Digital Library

[46]

Lu Pang, Anis Alazzawe, Krishna Kant, and Jeremy Swift. 2019. Data heat prediction in storage systems using behavior specific prediction models. In Proceedings of the 2019 IEEE 38th International Performance Computing and Communications Conference (IPCCC’19). IEEE, 1–8.

[47]

Jisung Park, Jeonggyun Kim, Yeseong Kim, Sungjin Lee, and Onur Mutlu. 2022. \(\lbrace\)DeepSketch\(\rbrace\): A new machine \(\lbrace\)Learning-Based\(\rbrace\) reference search technique for \(\lbrace\)Post-Deduplication\(\rbrace\) delta compression. In Proceedings of the 20th USENIX Conference on File and Storage Technologies (FAST’22). 247–264.

[48]

Youngjo Park and Jin-Soo Kim. 2011. zFTL: Power-efficient data compression support for NAND flash-based consumer electronics devices. IEEE Transactions on Consumer Electronics 57, 3 (2011), 1148–1156.

[49]

Yifan Qiao, Xubin Chen, Ning Zheng, Jiangpeng Li, Yang Liu, and Tong Zhang. 2022. Closing the B+-tree vs.\(\lbrace\)LSM-tree\(\rbrace\) write amplification gap on modern storage hardware with built-in transparent compression. In Proceedings of the 20th USENIX Conference on File and Storage Technologies (FAST’22). 69–82.

[50]

Mansour Shafaei, Peter Desnoyers, and Jim Fitzpatrick. 2016. Write amplification reduction in \(\lbrace\)Flash-Based\(\rbrace\)\(\lbrace\)SSDs\(\rbrace\) through \(\lbrace\)Extent-Based\(\rbrace\) temperature identification. In Proceedings of the 8th USENIX Workshop on Hot Topics in Storage and File Systems (HotStorage’16).

Digital Library

[51]

Yunpeng Song, Yiyang Huang, Yina Lv, Yi Zhang, and Liang Shi. 2023. When F2FS meets compression-based SSD!. In Proceedings of the 15th ACM Workshop on Hot Topics in Storage and File Systems. 87–92.

Digital Library

[52]

Qiuping Wang, Jinhong Li, Patrick P. C. Lee, Tao Ouyang, Chao Shi, and Lilong Huang. 2022. Separating data via block invalidation time inference for write amplification reduction in \(\lbrace\)Log-Structured\(\rbrace\) storage. In Proceedings of the 20th USENIX Conference on File and Storage Technologies (FAST’22). 429–444.

[53]

Guanying Wu and Xubin He. 2012. Delta-FTL: Improving SSD lifetime via exploiting content locality. In Proceedings of the 7th ACM European Conference on Computer Systems. 253–266.

Digital Library

[54]

Fei Xie, Michael Condict, and Sandip Shete. 2013. Estimating duplication by content-based sampling. In Proceedings of the 2013 USENIX Annual Technical Conference (ATC’13). 181–186.

Digital Library

[55]

Lei Yang, Hong Wu, Tieying Zhang, Xuntao Cheng, Feifei Li, Lei Zou, Yujie Wang, Rongyao Chen, Jianying Wang, and Gui Huang. 2020. Leaper: A learned prefetcher for cache invalidation in LSM-tree based storage engines. Proceedings of the VLDB Endowment 13, 12 (2020), 1976–1989.

Digital Library

[56]

Zhifeng Yang, Quanqing Xu, Shanyan Gao, Chuanhui Yang, Guoping Wang, Yuzhong Zhao, Fanyu Kong, Hao Liu, Wanhong Wang, and Jinliang Xiao. 2023. OceanBase paetica: A hybrid shared-nothing/shared-everything database for supporting single machine and distributed cluster. Proceedings of the VLDB Endowment 16, 12 (2023), 3728–3740.

Digital Library

[57]

Zhenkun Yang, Chuanhui Yang, Fusheng Han, Mingqiang Zhuang, Bing Yang, Zhifeng Yang, Xiaojun Cheng, Yuzhong Zhao, Wenhui Shi, Huafeng Xi, Huang Yu, Bin Liu, Yi Pan, Boxue Yin, Junquan Chen, and Quanqing Xu. 2022. OceanBase: A 707 Million tpmC distributed relational database system. Proceedings of the VLDB Endowment 15, 12 (2022), 3385–3397.

Digital Library

[58]

Cristian Zambelli, Rino Micheloni, and Piero Olivo. 2019. Reliability challenges in 3D NAND flash memories. In Proceedings of the 2019 IEEE 11th International Memory Workshop (IMW’19). IEEE, 1–4.

[59]

Xuebin Zhang, Jiangpeng Li, Hao Wang, Danni Xiong, Jerry Qu, Hyunsuk Shin, Jung Pill Kim, and Tong Zhang. 2017. Realizing transparent OS/Apps compression in mobile devices at zero latency overhead. IEEE Transactions on Computers 66, 7 (2017), 1188–1199.

Digital Library

[60]

Xuebin Zhang, Jiangpeng Li, Hao Wang, Kai Zhao, and Tong Zhang. 2016. Reducing \(\lbrace\)Solid-State\(\rbrace\) storage device write stress through opportunistic in-place delta compression. In Proceedings of the 14th USENIX Conference on File and Storage Technologies (FAST’16). 111–124.

Digital Library

[61]

Yu Zhang, Ping Huang, Ke Zhou, Hua Wang, Jianying Hu, Yongguang Ji, and Bin Cheng. 2020. \(\lbrace\)OSCA\(\rbrace\): An \(\lbrace\)Online-Model\(\rbrace\) based cache allocation scheme in cloud block storage systems. In Proceedings of the 2020 USENIX Annual Technical Conference (ATC’20). 785–798.

[62]

Bin Zhou, Shenggang Wan, and Changsheng Xie. 2021. Isolation: Inexpensively separating cold data via garbage collection to improve the lifetime and performance of NAND flash SSDs. Concurrency and Computation: Practice and Experience 33, 15 (2021), e5460.

[63]

Ke Zhou, Shaofu Hu, Ping Huang, and Yuhong Zhao. 2017. LX-SSD: Enhancing the lifespan of NAND flash-based memory via recycling invalid pages. In Proceedings of the 2017 IEEE 33rd International Conference on Massive Storage Systems and Technology (MSST’17). 1–13.

[64]

Aviad Zuck, Sivan Toledo, Dmitry Sotnikov, and Danny Harnik. 2014. Compression and \(\lbrace\)SSDs\(\rbrace\): Where and how?. In Proceedings of the 2nd Workshop on Interactions of NVM/Flash with Operating Systems and Workloads (INFLOW’14).

Index Terms

Extremely-Compressed SSDs with I/O Behavior Prediction
1. Information systems
  1. Information storage systems
    1. Information storage technologies
      1. Storage class memory
        Flash memory
    2. Record storage systems
      1. Relational storage
        Compression strategies

Recommendations

Hi-ZNS: High Space Efficiency and Zero-Copy LSM-Tree Based Stores on ZNS SSDs
ICPP '24: Proceedings of the 53rd International Conference on Parallel Processing

The Zoned Namespace (ZNS) SSD is a newly introduced storage device and provides several new ZNS commands to upper-level applications. Zone-reset command is one of the ZNS commands to erase all the flash blocks within a zone. Since data is grouped and ...
ACS: an alternate coding scheme to improve degrade read performance for SSD-based RAID5 systems

To guarantee high performance and reliability, storage systems require better devices and data redundancy schemes, e.g., SSD-based RAID5. However, failures in the large-scale storage systems are common. In order to serve requests on a failed node, the SSD-...
High performance & low latency in solid-state drives through redundancy
INFLOW '13: Proceedings of the 1st Workshop on Interactions of NVM/FLASH with Operating Systems and Workloads

Solid-state drives are becoming increasingly popular in enterprise storage systems, playing the role of large caches and permanent storage. Although SSDs provide faster random access than hard-drives, their performance under read/write workloads is ...

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Storage

ACM Transactions on Storage Volume 20, Issue 4

November 2024

248 pages

EISSN:1553-3093

DOI:10.1145/3613729

Editor:
Erez Zadok
Stony Brook University, USA

Issue’s Table of Contents

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 06 August 2024

Online AM: 16 July 2024

Accepted: 01 July 2024

Revised: 22 June 2024

Received: 14 November 2023

Published in TOS Volume 20, Issue 4

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

National Key R&D Program of China
Natural Science Foundation of Xiamen
National Natural Science Foundation of China
China Fundamental Research Funds
Central Universities
Ant Group through CCF-Ant Research Fund

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
532
Total Downloads

Downloads (Last 12 months)532
Downloads (Last 6 weeks)63

Reflects downloads up to 05 Feb 2025

Other Metrics

View Author Metrics

Citations

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Full Text

View this article in Full Text.

Figures

Tables

Media

View full text|Download PDF

View Issue’s Table of Contents