Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3456727.3463774acmconferencesArticle/Chapter ViewAbstractPublication PagessystorConference Proceedingsconference-collections
research-article
Public Access

HeuristicDB: a hybrid storage database system using a non-volatile memory block device

Published: 14 June 2021 Publication History

Abstract

Hybrid storage systems are widely used in big data fields to balance system performance and cost. However, due to a poor understanding of the characteristics of database block requests, past studies in this area cannot fully utilize the performance gain from emerging storage devices. This study presents a hybrid storage database system, called HeuristicDB, which uses an emerging non-volatile memory (NVM) block device as an extension of the database buffer pool. To consider the unique performance behaviors of NVM block devices and the block-level characteristics of database requests, a set of heuristic rules that associate database (block) requests with the appropriate quality of service for the purpose of caching priority are proposed. Using online analytical processing (OLAP) and online transactional processing (OLTP) benchmarks, both trace-based examination and system implementation on MySQL are carried out to evaluate the effectiveness of the proposed design. The experimental results indicate that HeuristicDB provides up to 75% higher performance and migrates 18X fewer data between storage and the NVM block device than existing systems.

References

[1]
2001. TPC Benchmark H. http://www.tpc.org/tpch/.
[2]
2007. TPC Benchmark E. http://www.tpc.org/tpce/.
[3]
2009. TeraData Virtual Storage System. http://assets.teradata.com/resourceCenter/downloads/Brochures/EB5944.pdf?processed=1.
[4]
2014. Intel SSD DC P3700 Series. https://www.intel.com/content/www/us/en/products/memory-storage/solid-state-drives/data-center-ssds/dc-p3700-series.html.
[5]
2018. Intel Optane SSD DC P4800X Series. https://www.intel.com/content/www/us/en/solid-state-drives/optane-ssd-dc-p4800x-brief.html.
[6]
2019. MySQL 5.7. https://dev.mysql.com/doc/refman/5.7/en/.
[7]
2019. Persistent memory development kit. https://https://github.com/pmem/pmdk.
[8]
2020. Optimizing InnoDB Disk I/O. https://dev.mysql.com/doc/refman/5.7/en/optimizing-innodb-diskio.html.
[9]
Nitin Agrawal, Vijayan Prabhakaran, Ted Wobber, John D Davis, Mark S Manasse, and Rina Panigrahy. 2008. Design tradeoffs for SSD performance. In USENIX Annual Technical Conference, Vol. 57.
[10]
Mohamed Arafa, Bahaa Fahim, Sailesh Kottapalli, Akhilesh Kumar, Lily P Looi, Sreenivas Mandava, Andy Rudoff, Ian M Steiner, Bob Valentine, Geetha Vedaraman, et al. 2019. Cascade lake: Next generation intel xeon scalable processor. IEEE Micro 39, 2 (2019), 29--36.
[11]
Joy Arulraj, Andy Pavlo, and Krishna Teja Malladi. 2019. Multi-tier buffer management and storage system design for non-volatile memory. arXiv preprint arXiv:1901.10938 (2019).
[12]
Ryan Bannon, Alvin Chin, Faryaaz Kassam, Andrew Roszko, and Ric Holt. 2002. Innodb concrete architecture. University of Waterloo (2002).
[13]
Charles Andrew Bell and Sven Sandberg. 2012. Expert MySQL. Vol. 3. Springer.
[14]
Curtis Neal Boger, John Francis Edwards, Randy Lynn Egan, and Michael S Faunce. 2006. Metadata manager for database query optimizer. US Patent 6,996,556.
[15]
Michael L Brundage and Andrew E Kimball. 2006. Query optimizer system and method. US Patent 7,146,352.
[16]
Alan D Brunelle. 2006. Block i/o layer tracing: blktrace. HP, Gelato-Cupertino, CA, USA (2006).
[17]
Geoffrey W Burr, Matthew J Breitwisch, Michele Franceschini, Davide Garetto, Kailash Gopalakrishnan, Bryan Jackson, Bülent Kurdi, Chung Lam, Luis A Lastras, Alvaro Padilla, et al. 2010. Phase change memory technology. Journal of Vacuum Science & Technology B, Nanotechnology and Microelectronics: Materials, Processing, Measurement, and Phenomena 28, 2 (2010), 223--262.
[18]
Mustafa Canim, George A Mihaila, Bishwaranjan Bhattacharjee, Kenneth A Ross, and Christian A Lang. 2009. An object placement advisor for DB2 using solid state storage. Proceedings of the VLDB Endowment 2, 2 (2009), 1318--1329.
[19]
Mustafa Canim, George A Mihaila, Bishwaranjan Bhattacharjee, Kenneth A Ross, and Christian A Lang. 2010. SSD bufferpool extensions for database systems. Proceedings of the VLDB Endowment 3, 1-2 (2010), 1435--1446.
[20]
Jaeyoung Do, Donghui Zhang, Jignesh M Patel, David J DeWitt, Jeffrey F Naughton, and Alan Halverson. 2011. Turbocharging DBMS buffer pool using SSDs. In Proceedings of the 2011 ACM SIGMOD International Conference on Management of data. ACM, 1113--1124.
[21]
Assaf Eisenman, Darryl Gardner, Islam AbdelRahman, Jens Axboe, Siying Dong, Kim Hazelwood, Chris Petersen, Asaf Cidon, and Sachin Katti. 2018. Reducing DRAM footprint with NVM in Facebook. In Proceedings of the Thirteenth EuroSys Conference. ACM, 42.
[22]
Hiroki Fujii, Kousuke Miyaji, Koh Johguchi, Kazuhide Higuchi, Chao Sun, and Ken Takeuchi. 2012. x11 performance increase, x6. 9 endurance enhancement, 93% energy reduction of 3D TSV-integrated hybrid ReRAM/MLC NAND SSDs by data fragmentation suppression. In 2012 symposium on VLSI circuits (VLSIC). IEEE, 134--135.
[23]
Frank T Hady, Annie Foong, Bryan Veal, and Dan Williams. 2017. Platform storage performance with 3D XPoint technology. Proc. IEEE 105, 9 (2017), 1822--1833.
[24]
Joseph Izraelevitz, Jian Yang, Lu Zhang, Juno Kim, Xiao Liu, Amirsaman Memaripour, Yun Joon Soh, Zixuan Wang, Yi Xu, Subramanya R Dulloor, et al. 2019. Basic performance measurements of the intel optane DC persistent memory module. arXiv preprint arXiv:1903.05714 (2019).
[25]
John R Jameson, Philippe Blanchard, John Dinh, Nathan Gonzales, Vasudevan Gopalakrishnan, Berenice Guichet, Shane Hollmer, Sue Hsu, Gideon Intrater, Deepak Kamalanathan, et al. 2016. Conductive bridging RAM (CBRAM): then, now, and tomorrow. ECS Transactions 75, 5 (2016), 41.
[26]
Jaeyong Jeong, Sangwook Shane Hahn, Sungjin Lee, and Jihong Kim. 2014. Lifetime Improvement of {NAND} Flash-based Storage Systems Using Dynamic Program and Erase Scaling. In 12th {USENIX} Conference on File and Storage Technologies ({FAST} 14). 61--74.
[27]
Insoon Jo, Duck-Ho Bae, Andre S Yoon, Jeong-Uk Kang, Sangyeun Cho, Daniel DG Lee, and Jaeheon Jeong. 2016. YourSQL: a high-performance database system leveraging in-storage computing. Proceedings of the VLDB Endowment 9, 12 (2016), 924--935.
[28]
Sungchan Kim, Hyunok Oh, Chanik Park, Sangyeun Cho, and SangWon Lee. 2011. Fast, energy efficient scan inside flash memory SSDs. In Proceeedings of the International Workshop on Accelerating Data Management Systems (ADMS).
[29]
Sungchan Kim, Hyunok Oh, Chanik Park, Sangyeun Cho, Sang-Won Lee, and Bongki Moon. 2016. In-storage processing of database scans and joins. Information Sciences 327 (2016), 183--200.
[30]
Michael Kund, Gerhard Beitel, C-U Pinnow, Thomas Rohr, Jorg Schumann, Ralf Symanczyk, K Ufert, and Gerhard Muller. 2005. Conductive bridging RAM (CBRAM): An emerging non-volatile memory technology scalable to sub 20nm. In Electron Devices Meeting, 2005. IEDM Technical Digest. IEEE International. IEEE, 754--757.
[31]
Youngjin Kwon, Henrique Fingler, Tyler Hunt, Simon Peter, Emmett Witchel, and Thomas Anderson. 2017. Strata: A cross media file system. In Proceedings of the 26th Symposium on Operating Systems Principles. 460--477.
[32]
Tapio Lahdenmaki and Mike Leach. 2005. Relational Database Index Design and the Optimizers: DB2, Oracle, SQL Server, et al. John Wiley & Sons.
[33]
Stefan Lai. 2003. Current status of the phase change memory and its future. In IEEE International Electron Devices Meeting 2003. IEEE, 10--1.
[34]
Xuhui Li, Ashraf Aboulnaga, Kenneth Salem, Aamer Sachedina, and Shaobo Gao. 2005. Second-Tier Cache Management Using Write Hints. In FAST, Vol. 5. 9--9.
[35]
Hongjun Lu, Yuet Yeung Ng, and Zengping Tian. 2000. T-tree or b-tree: Main memory database index structure revisited. In Proceedings 11th Australasian Database Conference. ADC 2000 (Cat. No. PR00528). IEEE, 65--73.
[36]
Tian Luo, Rubao Lee, Michael Mesnier, Feng Chen, and Xiaodong Zhang. 2012. hStorage-DB: heterogeneity-aware data management to exploit the full capability of hybrid storage systems. Proceedings of the VLDB Endowment 5, 10 (2012), 1076--1087.
[37]
Nimrod Megiddo and Dharmendra S Modha. 2004. Outperforming LRU with an adaptive replacement cache algorithm. Computer 37, 4 (2004), 58--65.
[38]
Paul Saab Mohan Srinivasan. 2010. Flashcache. https://github.com/facebookarchive/flashcache.
[39]
Elizabeth J O'neil, Patrick E O'neil, and Gerhard Weikum. 1993. The LRU-K page replacement algorithm for database disk buffering. Acm Sigmod Record 22, 2 (1993), 297--306.
[40]
Jiaxin Ou, Jiwu Shu, Youyou Lu, Letian Yi, and Wei Wang. 2014. EDM: An endurance-aware data migration scheme for load balancing in SSD storage clusters. In 2014 IEEE 28th International Parallel and Distributed Processing Symposium. IEEE, 787--796.
[41]
Moinuddin K Qureshi, Vijayalakshmi Srinivasan, and Jude A Rivers. 2009. Scalable high performance main memory system using phase-change memory technology. In Proceedings of the 36th annual international symposium on Computer architecture. 24--33.
[42]
Reza Salkhordeh, Mostafa Hadizadeh, and Hossein Asadi. 2018. An Efficient Hybrid I/O Caching Architecture Using Heterogeneous SSDs. IEEE Transactions on Parallel and Distributed Systems 30, 6 (2018), 1238--1250.
[43]
Kyungjune Son, Kyungjun Cho, Subin Kim, Gapyeol Park, Kyunghwan Song, and Journ Kim. 2018. Modeling and signal integrity analysis of 3D XPoint memory cells and interconnections with memory size variations during read operation. In 2018 IEEE Symposium on Electromagnetic Compatibility, Signal Integrity and Power Integrity (EMC, SI & PI). IEEE, 223--227.
[44]
Gokul Soundararajan, Vijayan Prabhakaran, Mahesh Balakrishnan, and Ted Wobber. 2010. Extending SSD Lifetimes with Disk-Based Write Caches. In FAST, Vol. 10. 101--114.
[45]
Ralf Symanczyk, Jan Keller, Michael Kund, Gerhard Muller, Bernhard Ruf, Paul-Henri Albarede, Serge Bournat, Laurent Bouteille, Alexander Duch, et al. 2007. Conductive bridging memory development from single cells to 2Mbit memory arrays. In 2007 Non-Volatile Memory Technology Symposium. IEEE, 71--75.
[46]
H-S Philip Wong, Simone Raoux, SangBum Kim, Jiale Liang, John P Reifenberg, Bipin Rajendran, Mehdi Asheghi, and Kenneth E Goodson. 2010. Phase change memory. Proc. IEEE 98, 12 (2010), 2201--2227.
[47]
Shiqin Yan, Huaicheng Li, Mingzhe Hao, Michael Hao Tong, Swaminathan Sundararaman, Andrew A Chien, and Haryadi S Gunawi. 2017. Tiny-tail flash: Near-perfect elimination of garbage collection tail latencies in NAND SSDs. ACM Transactions on Storage (TOS) 13, 3 (2017), 1--26.
[48]
Jian Yang, Juno Kim, Morteza Hoseinzadeh, Joseph Izraelevitz, and Steve Swanson. 2020. An empirical guide to the behavior and use of scalable persistent memory. In 18th {USENIX} Conference on File and Storage Technologies ({FAST} 20). 169--182.
[49]
Jinfeng Yang, Bingzhe Li, and David J Lilja. 2020. Exploring Performance Characteristics of the Optane 3D Xpoint Storage Technology. ACM Transactions on Modeling and Performance Evaluation of Computing Systems (TOMPECS) 5, 1 (2020), 1--28.
[50]
Jinfeng Yang and David J Lilja. 2018. Reducing Relational Database Performance Bottlenecks Using 3D XPoint Storage Technology. In 2018 17th IEEE International Conference On Trust, Security And Privacy In Computing And Communications/12th IEEE International Conference On Big Data Science And Engineering (TrustCom/BigDataSE). IEEE, 1804--1808.
[51]
Zhengyu Yang, Morteza Hoseinzadeh, Allen Andrews, Clay Mayers, David Thomas Evans, Rory Thomas Bolt, Janki Bhimani, Ningfang Mi, and Steven Swanson. 2017. AutoTiering: automatic data placement manager in multi-tier all-flash datacenter. In 2017 IEEE 36th International Performance Computing and Communications Conference (IPCCC). IEEE, 1--8.
[52]
Ting Yao, Yiwen Zhang, Jiguang Wan, Qiu Cui, Liu Tang, Hong Jiang, Changsheng Xie, and Xubin He. 2020. MatrixKV: Reducing Write Stalls and Write Amplification in LSM-tree Based {KV} Stores with Matrix Container in {NVM}. In 2020 {USENIX} Annual Technical Conference ({USENIX}{ATC} 20). 17--31.
[53]
Qian Zhao and Hao Chen. 2010. PCI Express interface. US Patent 7,673,092.
[54]
Yuanyuan Zhou, James Philbin, and Kai Li. 2001. The Multi-Queue Replacement Algorithm for Second Level Buffer Caches. In USENIX Annual Technical Conference, General Track. 91--104.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
SYSTOR '21: Proceedings of the 14th ACM International Conference on Systems and Storage
June 2021
226 pages
ISBN:9781450383981
DOI:10.1145/3456727
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

In-Cooperation

  • Technion: Israel Institute of Technology
  • USENIX Assoc: USENIX Assoc

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 14 June 2021

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. hybrid storage database management system
  2. non-volatile memory block device
  3. relational database management system

Qualifiers

  • Research-article

Funding Sources

Conference

SYSTOR '21
Sponsor:

Acceptance Rates

SYSTOR '21 Paper Acceptance Rate 18 of 63 submissions, 29%;
Overall Acceptance Rate 108 of 323 submissions, 33%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 476
    Total Downloads
  • Downloads (Last 12 months)119
  • Downloads (Last 6 weeks)20
Reflects downloads up to 20 Feb 2025

Other Metrics

Citations

Cited By

View all

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media