Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Extremely-Compressed SSDs with I/O Behavior Prediction

Published: 06 August 2024 Publication History

Abstract

As the data volume continues to grow exponentially, there is an increasing demand for large storage system capacity. Data compression techniques effectively reduce the volume of written data, enhancing space efficiency. As a result, many modern SSDs have already incorporated data compression capabilities. However, data compression introduces additional processing overhead in critical I/O paths, potentially affecting system performance. Currently, most compression solutions in flash-based storage systems employ fixed compression algorithms for all incoming data without leveraging differences among various data access patterns. This leads to sub-optimal compression efficiency.
This article proposes a data-type-aware Flash Translation Layer (DAFTL) scheme to maximize space efficiency without compromising system performance. First, we propose an I/O behavior prediction method to forecast future access on specific data. Then, DAFTL matches data types with distinct I/O behaviors to compression algorithms of varying intensities, achieving an optimal balance between performance and space efficiency. Specifically, it employs higher-intensity compression algorithms for less frequently accessed data to maximize space efficiency. For frequently accessed data, it utilizes lower-intensity but faster compression algorithms to maintain system performance. Finally, an improved compact compression method is proposed to effectively eliminate page fragmentation and further enhance space efficiency. Extensive evaluations using a variety of real-world workloads, as well as the workloads with real data we collected on our platforms, demonstrate that DAFTL achieves more data reductions than other approaches. When compared to the state-of-the-art compression schemes, DAFTL reduces the total number of pages written to the SSD by an average of 8%, 21.3%, and 25.6% for data with high, medium, and low compressibility, respectively. In the case of workloads with real data, DAFTL achieves an average reduction of 10.4% in the total number of pages written to SSD. Furthermore, DAFTL exhibits comparable or even improved read and write performance compared to other solutions.

References

[1]
Przemyslaw Skibinski, Jinfei Han, Dmitry Atamanov, Andrea Bocci, and Chip Turner. 2015. Lzbench. Retrieved 30 June 2024 from https://github.com/inikep/lzbench
[2]
Milosz Krajewski. 2015. Silesia Compression Corpus. Retrieved 30 June 2024 from https://github.com/MiloszKrajewski/SilesiaCorpus
[3]
Facebook. 2016. ZSTD. Retrieved 30 June 2024 from https://github.com/facebook/zstd
[4]
Chao Shi and Qiuping Wang. 2018. Alibaba Block Traces. Retrieved 30 June 2024 from https://github.com/alibaba/block-traces
[5]
Ohad Rodeh, Josef Bacik, and Chris Mason. 2013. BTRFS: The linux B-tree filesystem. ACM Transactions on Storage (TOS) 9, 3 (2013), 1–32.
[6]
ScaleFlux. 2023. CSD3000 Solid State Drive. Retrieved 30 June 2024 from https://scaleflux.com/products/csd-3000
[7]
OpenZFS. 2023. OpenZFS on Linux and FreeBSD. Retrieved 30 June 2024 from https://github.com/openzfs/zfs
[8]
Abutalib Aghayev, Theodore Ts’o, Garth Gibson, and Peter Desnoyers. 2017. Evolving ext4 for shingled disks. In Proceedings of the 15th USENIX Conference on File and Storage Technologies (FAST’17). 105–120.
[9]
Tobias Ahrens, Mohammed Rajab, and Jürgen Freudenberger. 2016. Compression of short data blocks to improve the reliability of non-volatile flash memories. In Proceedings of the 2016 International Conference on Information and Digital Technologies (IDT’16). IEEE, 1–4.
[10]
Matěj Bartík, Sven Ubik, and Pavel Kubalik. 2015. LZ4 compression algorithm on FPGA. In Proceedings of the 2015 IEEE International Conference on Electronics, Circuits, and Systems (ICECS’15). IEEE, 179–182.
[11]
Chandranil Chakraborttii and Heiner Litz. 2021. Reducing write amplification in flash by death-time prediction of logical block addresses. In Proceedings of the 14th ACM International Conference on Systems and Storage. 1–12.
[12]
Chin-Hsing Chen, Chun-Ta Chen, and Wen-Tzeng Huang. 2008. The real-time compression layer for flash memory in mobile multimedia devices. Mobile Networks and Applications 13, 6 (2008), 547–554.
[13]
Feng Chen, David A. Koufaty, and Xiaodong Zhang. 2009. Understanding intrinsic characteristics and system implications of flash memory based solid state drives. ACM SIGMETRICS Performance Evaluation Review 37, 1 (2009), 181–192.
[14]
Xubin Chen, Yin Li, Jingpeng Hao, Hyunsuk Shin, Michael Suh, and Tong Zhang. 2019. Simultaneously reducing cost and improving performance of NVM-based block devices via transparent data compression. In Proceedings of the International Symposium on Memory Systems. 331–341.
[15]
Xubin Chen, Yin Li, and Tong Zhang. 2018. Reducing flash memory write traffic by exploiting a few MBs of capacitor-powered write buffer inside solid-state drives (SSDs). IEEE Transactions on Computers 68, 3 (2018), 426–439.
[16]
Jinhua Cui, Youtao Zhang, Liang Shi, Chun Jason Xue, Weiguo Wu, and Jun Yang. 2018. ApproxFTL: On the performance and lifetime improvement of 3-D NAND flash-based SSDs. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems 37, 10 (2018), 1957–1970. DOI:
[17]
Carlos Escuin, Asif Ali Khan, Pablo Ibánez, Teresa Monreal, Jeronimo Castrillon, and Víctor Viñals. 2023. Compression-aware and performance-efficient insertion policies for long-lasting hybrid LLCs. In Proceedings of the 2023 IEEE International Symposium on High-Performance Computer Architecture (HPCA’23). IEEE, 179–192.
[18]
Raúl Gracia-Tinedo, Danny Harnik, Dalit Naor, Dmitry Sotnikov, Sivan Toledo, and Aviad Zuck. 2015. \(\lbrace\)SDGen\(\rbrace\): Mimicking datasets for content generation in storage benchmarks. In Proceedings of the 13th USENIX Conference on File and Storage Technologies (FAST’15). 317–330.
[19]
Jen-Wei Hsieh, Tei-Wei Kuo, and Li-Pin Chang. 2006. Efficient identification of hot data for flash memory storage systems. ACM Transactions on Storage (TOS) 2, 1 (2006), 22–40.
[20]
Xiaokang Hu, Fuzong Wang, Weigang Li, Jian Li, and Haibing Guan. 2019. \(\lbrace\)QZFS\(\rbrace\):\(\lbrace\)QAT\(\rbrace\) Accelerated compression in file system for application agnostic and cost efficient data storage. In Proceedings of the 2019 USENIX Annual Technical Conference (ATC’19). 163–176.
[21]
Yang Hu, Hong Jiang, Dan Feng, Lei Tian, Hao Luo, and Shuping Zhang. 2011. Performance impact and interplay of SSD parallelism through advanced commands, allocation strategy and data granularity. In Proceedings of the International Conference on Supercomputing. 96–107.
[22]
Cheng Ji, Li-Pin Chang, Riwei Pan, Chao Wu, Congming Gao, Liang Shi, Tei-Wei Kuo, and Chun Jason Xue. 2021. \(\lbrace\)Pattern-Guided\(\rbrace\) file compression with \(\lbrace\)User-Experience\(\rbrace\) enhancement for \(\lbrace\)Log-Structured\(\rbrace\) file system on mobile devices. In Proceedings of the 19th USENIX Conference on File and Storage Technologies (FAST’21). 127–140.
[23]
Cheng Ji, Li-Pin Chang, Liang Shi, Congming Gao, Chao Wu, Yuangang Wang, and Chun Jason Xue. 2017. Lightweight data compression for mobile flash storage. ACM Transactions on Embedded Computing Systems (TECS) 16, 5s (2017), 1–18.
[24]
Jeong-Uk Kang, Jeeseok Hyun, Hyunjoo Maeng, and Sangyeun Cho. 2014. The multi-streamed \(\lbrace\)Solid-State\(\rbrace\) drive. In Proceedings of the 6th USENIX Workshop on Hot Topics in Storage and File Systems (HotStorage’14).
[25]
Mincheol Kang, Wonyoung Lee, Jinkwon Kim, and Soontae Kim. 2022. PR-SSD: Maximizing partial read potential by exploiting compression and channel-level parallelism. IEEE Transactions on Computers 72, 3 (2022), 772–785.
[26]
Mincheol Kang, Wonyoung Lee, and Soontae Kim. 2018. Subpage-aware solid state drive for improving lifetime and performance. IEEE Transactions on Computers 67, 10 (2018), 1492–1505.
[27]
Dohyun Kim, Kwangwon Min, Joontaek Oh, and Youjip Won. 2022. \(\lbrace\)ScaleXFS\(\rbrace\): Getting scalability of \(\lbrace\)XFS\(\rbrace\) back on the ring. In Proceedings of the 20th USENIX Conference on File and Storage Technologies (FAST’22). 329–344.
[28]
Jungrae Kim, Michael Sullivan, Seong-Lyong Gong, and Mattan Erez. 2015. Frugal ECC: Efficient and versatile memory error protection through fine-grained compression. In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis. 1–12.
[29]
Changman Lee, Dongho Sim, Jooyoung Hwang, and Sangyeun Cho. 2015. \(\lbrace\)F2FS\(\rbrace\): A new file system for flash storage. In Proceedings of the 13th USENIX Conference on File and Storage Technologies (FAST’15). 273–286.
[30]
Jongsung Lee and Jin-Soo Kim. 2013. An empirical study of hot/cold data separation policies in solid state drives (SSDs). In Proceedings of the 6th International Systems and Storage Conference. 1–6.
[31]
Sungjin Lee, Jihoon Park, Kermin Fleming, Arvind Arvind, and Jihong Kim. 2011. Improving performance and lifetime of solid-state drives using hardware-accelerated compression. In IEEE Transactions on Consumer Electronics 57, 4 (2011), 1732–1739.
[32]
Jinhong Li, Qiuping Wang, Patrick P. C. Lee, and Chao Shi. 2023. An in-depth comparative analysis of cloud block storage workloads: Findings and implications. ACM Transactions on Storage 19, 2 (2023), 1–32.
[33]
Jiangpeng Li, Kai Zhao, Xuebin Zhang, Jun Ma, Ming Zhao, and Tong Zhang. 2015. How much can data compressibility help to improve \(\lbrace\)NAND\(\rbrace\) flash memory lifetime?. In Proceedings of the 13th USENIX Conference on File and Storage Technologies (FAST’15). 227–240.
[34]
Qiao Li, Liang Shi, Riwei Pan, Cheng Ji, Xiaoqiang Li, and Chun Jason Xue. 2018. Selective compression scheme for read performance improvement on flash devices. In Proceedings of the 2018 IEEE 36th International Conference on Computer Design (ICCD’18). IEEE, 43–50.
[35]
Qiao Li, Liang Shi, Chun Jason Xue, Kaijie Wu, Cheng Ji, Qingfeng Zhuge, and Edwin H-M Sha. 2016. Access characteristic guided read and write cost regulation for performance improvement on flash memory. In Proceedings of the14th USENIX Conference on File and Storage Technologies (FAST’16). 125–132.
[36]
Qiang Li, Qiao Xiang, Yuxin Wang, Haohao Song, Ridi Wen, Wenhui Yao, Yuanyuan Dong, Shuqi Zhao, Shuo Huang, Zhaosheng Zhu, Huayong Wang, Shanyang Liu, Lulu Chen, Zhiwu Wu, Haonan Qiu, Derui Liu, Gexiao Tian, Chao Han, Shaozong Liu, Yaohui Wu, Zicheng Luo, Yuchao Shao, Junping Wu, Zheng Cao, Zhongjie Wu, Jiaji Zhu, Jinbo Wu, Jiwu Shu, and Jiesheng Wu. 2023. More than capacity: Performance-oriented evolution of Pangu in Alibaba. In Proceedings of the 21st USENIX Conference on File and Storage Technologies (FAST’23). 331–346.
[37]
Weiqiang Liu, Faqiang Mei, Chenghua Wang, Maire O’Neill, and Earl E. Swartzlander. 2018. Data compression device based on modified LZ4 algorithm. IEEE Transactions on Consumer Electronics 64, 1 (2018), 110–117.
[38]
Thanos Makatos, Yannis Klonatos, Manolis Marazakis, Michail D. Flouris, and Angelos Bilas. 2010. Using transparent compression to improve SSD-based I/O caches. In Proceedings of the 5th European Conference on Computer Systems. 1–14.
[39]
Bo Mao, Suzhen Wu, Hong Jiang, Yaodong Yang, and Zaifa Xi. 2018. EDC: Improving the performance and space efficiency of flash-based storage systems with elastic data compression. IEEE Transactions on Parallel and Distributed Systems 29, 6 (2018), 1261–1274.
[40]
Nimrod Megiddo and Dharmendra S. Modha. 2004. Outperforming LRU with an adaptive replacement cache algorithm. Computer 37, 4 (2004), 58–65.
[41]
Changwoo Min, Sanidhya Kashyap, Steffen Maass, and Taesoo Kim. 2016. Understanding manycore scalability of file systems. In Proceedings of the 2016 USENIX Annual Technical Conference (ATC’16). 71–85.
[42]
Dushyanth Narayanan, Austin Donnelly, and Antony Rowstron. 2008. Write off-loading: Practical power management for enterprise storage. ACM Transactions on Storage (TOS) 4, 3 (2008), 1–23.
[43]
Kazuichi Oe, Kazutaka Ogihara, and Takeo Honda. 2018. Analysis of commercial cloud workload and study on how to apply cache methods. IEICE Technical Report; IEICE Tech. Rep. 118, 165 (2018), 7–12.
[44]
Elizabeth J. O’neil, Patrick E. O’neil, and Gerhard Weikum. 1993. The LRU-K page replacement algorithm for database disk buffering. ACM SIGMOD Record 22, 2 (1993), 297–306.
[45]
Yubiao Pan, Yongkun Li, Huizhen Zhang, and Yinlong Xu. 2019. Lifetime-aware FTL to improve the lifetime and performance of solid-state drives. Future Generation Computer Systems 93 (2019), 58–67.
[46]
Lu Pang, Anis Alazzawe, Krishna Kant, and Jeremy Swift. 2019. Data heat prediction in storage systems using behavior specific prediction models. In Proceedings of the 2019 IEEE 38th International Performance Computing and Communications Conference (IPCCC’19). IEEE, 1–8.
[47]
Jisung Park, Jeonggyun Kim, Yeseong Kim, Sungjin Lee, and Onur Mutlu. 2022. \(\lbrace\)DeepSketch\(\rbrace\): A new machine \(\lbrace\)Learning-Based\(\rbrace\) reference search technique for \(\lbrace\)Post-Deduplication\(\rbrace\) delta compression. In Proceedings of the 20th USENIX Conference on File and Storage Technologies (FAST’22). 247–264.
[48]
Youngjo Park and Jin-Soo Kim. 2011. zFTL: Power-efficient data compression support for NAND flash-based consumer electronics devices. IEEE Transactions on Consumer Electronics 57, 3 (2011), 1148–1156.
[49]
Yifan Qiao, Xubin Chen, Ning Zheng, Jiangpeng Li, Yang Liu, and Tong Zhang. 2022. Closing the B+-tree vs.\(\lbrace\)LSM-tree\(\rbrace\) write amplification gap on modern storage hardware with built-in transparent compression. In Proceedings of the 20th USENIX Conference on File and Storage Technologies (FAST’22). 69–82.
[50]
Mansour Shafaei, Peter Desnoyers, and Jim Fitzpatrick. 2016. Write amplification reduction in \(\lbrace\)Flash-Based\(\rbrace\)\(\lbrace\)SSDs\(\rbrace\) through \(\lbrace\)Extent-Based\(\rbrace\) temperature identification. In Proceedings of the 8th USENIX Workshop on Hot Topics in Storage and File Systems (HotStorage’16).
[51]
Yunpeng Song, Yiyang Huang, Yina Lv, Yi Zhang, and Liang Shi. 2023. When F2FS meets compression-based SSD!. In Proceedings of the 15th ACM Workshop on Hot Topics in Storage and File Systems. 87–92.
[52]
Qiuping Wang, Jinhong Li, Patrick P. C. Lee, Tao Ouyang, Chao Shi, and Lilong Huang. 2022. Separating data via block invalidation time inference for write amplification reduction in \(\lbrace\)Log-Structured\(\rbrace\) storage. In Proceedings of the 20th USENIX Conference on File and Storage Technologies (FAST’22). 429–444.
[53]
Guanying Wu and Xubin He. 2012. Delta-FTL: Improving SSD lifetime via exploiting content locality. In Proceedings of the 7th ACM European Conference on Computer Systems. 253–266.
[54]
Fei Xie, Michael Condict, and Sandip Shete. 2013. Estimating duplication by content-based sampling. In Proceedings of the 2013 USENIX Annual Technical Conference (ATC’13). 181–186.
[55]
Lei Yang, Hong Wu, Tieying Zhang, Xuntao Cheng, Feifei Li, Lei Zou, Yujie Wang, Rongyao Chen, Jianying Wang, and Gui Huang. 2020. Leaper: A learned prefetcher for cache invalidation in LSM-tree based storage engines. Proceedings of the VLDB Endowment 13, 12 (2020), 1976–1989.
[56]
Zhifeng Yang, Quanqing Xu, Shanyan Gao, Chuanhui Yang, Guoping Wang, Yuzhong Zhao, Fanyu Kong, Hao Liu, Wanhong Wang, and Jinliang Xiao. 2023. OceanBase paetica: A hybrid shared-nothing/shared-everything database for supporting single machine and distributed cluster. Proceedings of the VLDB Endowment 16, 12 (2023), 3728–3740.
[57]
Zhenkun Yang, Chuanhui Yang, Fusheng Han, Mingqiang Zhuang, Bing Yang, Zhifeng Yang, Xiaojun Cheng, Yuzhong Zhao, Wenhui Shi, Huafeng Xi, Huang Yu, Bin Liu, Yi Pan, Boxue Yin, Junquan Chen, and Quanqing Xu. 2022. OceanBase: A 707 Million tpmC distributed relational database system. Proceedings of the VLDB Endowment 15, 12 (2022), 3385–3397.
[58]
Cristian Zambelli, Rino Micheloni, and Piero Olivo. 2019. Reliability challenges in 3D NAND flash memories. In Proceedings of the 2019 IEEE 11th International Memory Workshop (IMW’19). IEEE, 1–4.
[59]
Xuebin Zhang, Jiangpeng Li, Hao Wang, Danni Xiong, Jerry Qu, Hyunsuk Shin, Jung Pill Kim, and Tong Zhang. 2017. Realizing transparent OS/Apps compression in mobile devices at zero latency overhead. IEEE Transactions on Computers 66, 7 (2017), 1188–1199.
[60]
Xuebin Zhang, Jiangpeng Li, Hao Wang, Kai Zhao, and Tong Zhang. 2016. Reducing \(\lbrace\)Solid-State\(\rbrace\) storage device write stress through opportunistic in-place delta compression. In Proceedings of the 14th USENIX Conference on File and Storage Technologies (FAST’16). 111–124.
[61]
Yu Zhang, Ping Huang, Ke Zhou, Hua Wang, Jianying Hu, Yongguang Ji, and Bin Cheng. 2020. \(\lbrace\)OSCA\(\rbrace\): An \(\lbrace\)Online-Model\(\rbrace\) based cache allocation scheme in cloud block storage systems. In Proceedings of the 2020 USENIX Annual Technical Conference (ATC’20). 785–798.
[62]
Bin Zhou, Shenggang Wan, and Changsheng Xie. 2021. Isolation: Inexpensively separating cold data via garbage collection to improve the lifetime and performance of NAND flash SSDs. Concurrency and Computation: Practice and Experience 33, 15 (2021), e5460.
[63]
Ke Zhou, Shaofu Hu, Ping Huang, and Yuhong Zhao. 2017. LX-SSD: Enhancing the lifespan of NAND flash-based memory via recycling invalid pages. In Proceedings of the 2017 IEEE 33rd International Conference on Massive Storage Systems and Technology (MSST’17). 1–13.
[64]
Aviad Zuck, Sivan Toledo, Dmitry Sotnikov, and Danny Harnik. 2014. Compression and \(\lbrace\)SSDs\(\rbrace\): Where and how?. In Proceedings of the 2nd Workshop on Interactions of NVM/Flash with Operating Systems and Workloads (INFLOW’14).

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Storage
ACM Transactions on Storage  Volume 20, Issue 4
November 2024
248 pages
EISSN:1553-3093
DOI:10.1145/3613729
Issue’s Table of Contents

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 06 August 2024
Online AM: 16 July 2024
Accepted: 01 July 2024
Revised: 22 June 2024
Received: 14 November 2023
Published in TOS Volume 20, Issue 4

Check for updates

Author Tags

  1. Solid-state drives
  2. I/O behavior prediction
  3. data compression
  4. space efficiency

Qualifiers

  • Research-article

Funding Sources

  • National Key R&D Program of China
  • Natural Science Foundation of Xiamen
  • National Natural Science Foundation of China
  • China Fundamental Research Funds
  • Central Universities
  • Ant Group through CCF-Ant Research Fund

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 532
    Total Downloads
  • Downloads (Last 12 months)532
  • Downloads (Last 6 weeks)63
Reflects downloads up to 05 Feb 2025

Other Metrics

Citations

View Options

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Full Text

View this article in Full Text.

Full Text

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media