Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

SSD bufferpool extensions for database systems

Published: 01 September 2010 Publication History

Abstract

High-end solid state disks (SSDs) provide much faster access to data compared to conventional hard disk drives. We present a technique for using solid-state storage as a caching layer between RAM and hard disks in database management systems. By caching data that is accessed frequently, disk I/O is reduced. For random I/O, the potential performance gains are particularly significant. Our system continuously monitors the disk access patterns to identify hot regions of the disk. Temperature statistics are maintained at the granularity of an extent, i.e., 32 pages, and are kept current through an aging mechanism. Unlike prior caching methods, once the SSD is populated with pages from warm regions cold pages are not admitted into the cache, leading to low levels of cache pollution. Simulations based on DB2 I/O traces, and a prototype implementation within DB2 both show substantial performance improvements.

References

[1]
L. Belady. A study of replacement algorithms for virtual storage computers. IBM Systems Journal, 5(2):78--101, 1966.
[2]
M. Canim, G. A. Mihaila, B. Bhattacharjee, K. A. Ross, and C. A. Lang. An object placement advisor for DB2 using solid state storage. Proc. VLDB Endow., 2(2):1318--1329, 2009.
[3]
F. Chen, D. A. Koufaty, and X. Zhang. Understanding intrinsic characteristics and system implications of flash memory based solid state drives. In SIGMETRICS '09: Proceedings of the eleventh international joint conference on Measurement and modeling of computer systems, pages 181--192, New York, NY, USA, 2009. ACM.
[4]
Z. Chen, Y. Zhang, Y. Zhou, H. Scott, and B. Schiefer. Empirical evaluation of multi-level buffer cache collaboration for storage systems. SIGMETRICS Perform. Eval. Rev., 33(1):145--156, 2005.
[5]
DB2 for Linux, UNIX and Windows. http://www-01.ibm.com/software/data/db2/linux-unix-windows.
[6]
P. Frazier, P. Andersen, G. Boggs, C. Carrillo, D. Holtzman, J. M. Morris, P. K. Muller, and P. Rubio. Decoupled logical and physical data storage within a database management system. Patent Application no. 20080281939, May 2007.
[7]
P. Gray and H. J. Watson. Present and future directions in data warehousing. SIGMIS Database, 29(3):83--90, 1998.
[8]
B. S. Greenberg and S. H. Webber. The multics multilevel paging hierarchy. In IEEE, editor, Proc 1975 IEEE Intercon, 1975.
[9]
Performance value of solid state drives using IBM i, May 2009. http://www-03.ibm.com/systems/resources/ssd_ibmi.pdf.
[10]
W. H. Inmon. Building the Operational Data Store. John Wiley & Sons, Inc., New York, NY, USA, 1999.
[11]
S. Jiang, K. Davis, and X. Zhang. Coordinated multilevel buffer cache management with consistent access locality quantification. IEEE Trans. Comput., 56(1):95--108, 2007.
[12]
S. Jiang, F. Petrini, X. Ding, and X. Zhang. A locality-aware cooperative cache management protocol to improve network file system performance. In ICDCS '06: Proceedings of the 26th IEEE International Conference on Distributed Computing Systems, page 42, Washington, DC, USA, 2006. IEEE Computer Society.
[13]
T. L. Johnson and W. mei W. Hsu. Run-time adaptive cache hierarchy management via reference analysis. In Proceedings of 24th Annual International Symposium on Computer Architecture, June 1997.
[14]
S.-H. Kim, D. Jung, J.-S. Kim, and S. Maeng. HeteroDrive: Re-shaping the storage access pattern of oltp workload using ssd. In Proceedings of 4th International Workshop on Software Support for Portable Storage (IWSSPS 2009), pages 13--17, October 2009.
[15]
I. Koltsidas and S. Viglas. The case for flash-aware multi level caching. Internet Publication, 2009. http://homepages.inf.ed.ac.uk/s0679010/mfcache-TR.pdf.
[16]
I. Koltsidas and S. Viglas. Utility-aware multi-device caching, 2009.
[17]
C. A. Lang, B. Bhattacharjee, T. Malkemus, S. Padmanabhan, and K. Wong. Increasing buffer-locality for multiple relational table scans through grouping and throttling. Proc. of International Conference on Data Engineering, 0:1136--1145, 2007.
[18]
A. Leventhal. Flash storage memory. Commun. ACM, 51(7):47--51, 2008.
[19]
X. Li, A. Aboulnaga, K. Salem, A. Sachedina, and S. Gao. Second-tier cache management using write hints. In FAST'05: Proceedings of the 4th conference on USENIX Conference on File and Storage Technologies, pages 9--9, Berkeley, CA, USA, 2005. USENIX Association.
[20]
N. Megiddo and D. Modha. ARC: A self-tuning, low overhead replacement cache. In In Proceedings of the 2003 Conference on File and Storage Technologies (FAST, pages 115--130, 2003.
[21]
N. Megiddo and D. S. Modha. Outperforming LRU with an adaptive replacement cache algorithm. Computer, 37(4):58--65, 2004.
[22]
A technical overview of the Sun Oracle Exadata storage server and database machine. Internet Publication, September 2009. http://www.oracle.com/technology/products/bi/db/exadata/pdf/Exadata_Smart_Flash_Cache_TWP_v5.pdf.
[23]
S. Padmanabhan, B. Bhattacharjee, T. Malkemus, L. Cranston, and M. Huras. Multi-dimensional clustering: A new data layout scheme in DB2. In A. Y. Halevy, Z. G. Ives, and A. Doan, editors, SIGMOD Conference, pages 637--641. ACM, 2003.
[24]
R. Ramakrishnan and J. Gehrke. Database Management Systems. McGraw-Hill, 2003.
[25]
J. Samos, F. Saltor, J. Sistac, and A. Bardés. Database architecture for data warehousing: An evolutionary approach. In DEXA '98: Proceedings of the 9th International Conference on Database and Expert Systems Applications, pages 746--756, 1998.
[26]
TPC-C, On-Line Transaction Processing Benchmark. http://www.tpc.org/tpcc/.
[27]
TPC-H, Decision Support Benchmark. http://www.tpc.org/tpch/.
[28]
C. Williamson. On filter effects in web caching hierarchies. ACM Trans. Internet Technol., 2(1):47--77, 2002.
[29]
Y. Zhou, Z. Chen, and K. Li. Second-level buffer cache management. IEEE Transactions on Parallel and Distributed Systems, 15:2004, 2004.

Cited By

View all
  • (2024)PolarDB-MP: A Multi-Primary Cloud-Native Database via Disaggregated Shared MemoryCompanion of the 2024 International Conference on Management of Data10.1145/3626246.3653377(295-308)Online publication date: 9-Jun-2024
  • (2023)What Modern NVMe Storage Can Do, and How to Exploit it: High-Performance I/O for High-Performance Storage EnginesProceedings of the VLDB Endowment10.14778/3598581.359858416:9(2090-2102)Online publication date: 1-May-2023
  • (2022)Writes hurtProceedings of the 13th Symposium on Cloud Computing10.1145/3542929.3563461(110-125)Online publication date: 7-Nov-2022
  • Show More Cited By
  1. SSD bufferpool extensions for database systems

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image Proceedings of the VLDB Endowment
    Proceedings of the VLDB Endowment  Volume 3, Issue 1-2
    September 2010
    1658 pages
    ISSN:2150-8097
    • Editors:
    • Elisa Bertino,
    • Paolo Atzeni,
    • Kian Lee Tan,
    • Yi Chen,
    • Y. C. Tay
    Issue’s Table of Contents

    Publisher

    VLDB Endowment

    Publication History

    Published: 01 September 2010
    Published in PVLDB Volume 3, Issue 1-2

    Qualifiers

    • Research-article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)51
    • Downloads (Last 6 weeks)5
    Reflects downloads up to 22 Sep 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)PolarDB-MP: A Multi-Primary Cloud-Native Database via Disaggregated Shared MemoryCompanion of the 2024 International Conference on Management of Data10.1145/3626246.3653377(295-308)Online publication date: 9-Jun-2024
    • (2023)What Modern NVMe Storage Can Do, and How to Exploit it: High-Performance I/O for High-Performance Storage EnginesProceedings of the VLDB Endowment10.14778/3598581.359858416:9(2090-2102)Online publication date: 1-May-2023
    • (2022)Writes hurtProceedings of the 13th Symposium on Cloud Computing10.1145/3542929.3563461(110-125)Online publication date: 7-Nov-2022
    • (2022)HPCache: Memory-Efficient OLAP Through Proportional CachingProceedings of the 18th International Workshop on Data Management on New Hardware10.1145/3533737.3535100(1-9)Online publication date: 12-Jun-2022
    • (2022)Building a Fast and Efficient LSM-tree Store by Integrating Local Storage with Cloud StorageACM Transactions on Architecture and Code Optimization10.1145/352745219:3(1-26)Online publication date: 25-May-2022
    • (2021)HeuristicDBProceedings of the 14th ACM International Conference on Systems and Storage10.1145/3456727.3463774(1-12)Online publication date: 14-Jun-2021
    • (2021)HyR-tree: a spatial index for hybrid flash/3D XPoint storageNeural Computing and Applications10.1007/s00521-021-05804-235:1(133-145)Online publication date: 25-Feb-2021
    • (2020)MosaicProceedings of the VLDB Endowment10.14778/3407790.340785213:12(2662-2675)Online publication date: 1-Jul-2020
    • (2020)System co-design and data management for flash devicesProceedings of the VLDB Endowment10.14778/3402755.34028074:12(1504-1505)Online publication date: 3-Jun-2020
    • (2020)Exploring Performance Characteristics of the Optane 3D Xpoint Storage TechnologyACM Transactions on Modeling and Performance Evaluation of Computing Systems10.1145/33727835:1(1-28)Online publication date: 4-Feb-2020
    • Show More Cited By

    View Options

    Get Access

    Login options

    Full Access

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media