Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Native store extension for SAP HANA

Published: 01 August 2019 Publication History

Abstract

We present an overview of SAP HANA's Native Store Extension (NSE). This extension substantially increases database capacity, allowing to scale far beyond available system memory. NSE is based on a hybrid in-memory and paged column store architecture composed from data access primitives. These primitives enable the processing of hybrid columns using the same algorithms optimized for traditional HANA's in-memory columns. Using only three key primitives, we fabricated byte-compatible counterparts for complex memory resident data structures (e.g. dictionary and hash-index), compressed schemes (e.g. sparse and run-length encoding), and exotic data types (e.g. geo-spatial). We developed a new buffer cache which optimizes the management of paged resources by smart strategies sensitive to page type and access patterns. The buffer cache integrates with HANA's new execution engine that issues pipelined prefetch requests to improve disk access patterns. A novel load unit configuration, along with a unified persistence format, allows the hybrid column store to dynamically switch between in-memory and paged data access to balance performance and storage economy according to application demands while reducing Total Cost of Ownership (TCO). A new partitioning scheme supports load unit specification at table, partition, and column level. Finally, a new advisor recommends optimal load unit configurations. Our experiments illustrate the performance and memory footprint improvements on typical customer scenarios.

References

[1]
Oracle Database 19c In-Memory Guide. https://docs.oracle.com/en/database/oracle/oracle-database/19/inmem/, 2019. {Online; accessed 03-February-2019}.
[2]
D. Abadi, P. Boncz, and S. Harizopoulos. The Design and Implementation of Modern Column-Oriented Database Systems. Now Publishers Inc., 2013.
[3]
D. V. Aken, A. Pavlo, G. J. Gordon, and B. Zhang. Automatic Database Management System Tuning Through Large-scale Machine Learning. In ACM SIGMOD, pages 1009--1024, 2017.
[4]
K. Alexiou, D. Kossmann, and P.-Å. Larson. Adaptive Range Filters for Cold Data: Avoiding Trips to Siberia. PVLDB, 6(14):1714--1725, 2013.
[5]
T. Anderson. Microsoft SQL Server 14 man: Nothing stops a Hekaton transaction. http://www.theregister.co.uk/2013/06/03/microsoft_sql_server_14_teched/, 2013. {Online; accessed 03-February-2019}.
[6]
M. Andrei, C. Lemke, G. Radestock, R. Schulze, C. Thiel, R. Blanco, A. Meghlan, M. Sharique, S. Seifert, S. Vishnoi, D. Booss, T. Peh, I. Schreter, W. Thesing, M. Wagle, and T. Willhalm. SAP HANA Adoption of Non-Volatile Memory. PVLDB, 10(12):1754--1765, 2017.
[7]
J. Arulraj, A. Pavlo, and K. T. Malladi. Multi-Tier Buffer Management and Storage System Design for Non-Volatile Memory. CoRR, abs/1901.10938, 2019.
[8]
B. Bhattacharjee, L. Lim, T. Malkemus, G. Mihaila, K. Ross, S. Lau, C. McArthur, Z. Toth, and R. Sherkat. Efficient Index Compression in DB2 LUW. PVLDB, 2(2):1462--1473, 2009.
[9]
J. Do, D. Zhang, J. M. Patel, D. J. DeWitt, J. Naughton, and A. Halverson. Turbocharging DBMS Buffer Pool Using SSDs. In ACM SIGMOD, pages 1113--1124, 2011.
[10]
A. Eldawy, L. Alarabi, and M. F. Mokbel. Spatial Partitioning Techniques in SpatialHadoop. PVLDB, 8(12):1602--1605, 2015.
[11]
S. Finkelstein, M. Schkolnick, and P. Tiberio. Physical Database Design for Relational Databases. ACM TODS, 13(1):91--128, 1988.
[12]
G. Graefe, H. Volos, H. Kimura, H. Kuno, J. Tucek, M. Lillibridge, and A. Veitch. In-memory Performance for Big Data. PVLDB, 8(1):37--48, 2014.
[13]
A. Gurajada, D. Gala, F. Zhou, A. Pathak, and Z. F. Ma. BTrim: Hybrid In-memory Database Architecture for Extreme Transaction Processing in VLDBs. PVLDB, 11(12):1889--1901, 2018.
[14]
H. Lang, T. Mühlbauer, F. Funke, P. Boncz, T. Neumann, and A. Kemper. Data Blocks: Hybrid OLTP and OLAP on Compressed Storage Using Both Vectorization and Compilation. In ACM SIGMOD, pages 311--326, 2016.
[15]
P.-Å. Larson, A. Birka, E. N. Hanson, W. Huang, M. Nowakiewicz, and V. Papadimos. Real-time Analytical Processing with SQL Server. PVLDB, 8(12):1740--1751, 2015.
[16]
J. Lee, H. Shin, C. G. Park, S. Ko, J. Noh, Y. Chuh, W. Stephan, and W.-S. Han. Hybrid Garbage Collection for Multi-Version Concurrency Control in SAP HANA. In ACM SIGMOD, pages 1307--1318, 2016.
[17]
V. Leis, M. Haubenschild, A. Kemper, and T. Neumann. LeanStore: In-Memory Data Management beyond Main Memory. In IEEE ICDE, pages 185--196, 2018.
[18]
C. Lemke, K. Sattler, F. Faerber, and A. Zeier. Speeding Up Queries in Column Stores - A Case for Compression. In DAWAK, pages 117--129, 2010.
[19]
X. Liu and K. Salem. Hybrid Storage Management for Database Systems. PVLDB, 6(8):541--552, 2013.
[20]
N. May, A. Böhm, and W. Lehner. SAP HANA - The Evolution of an In-Memory DBMS from Pure OLAP Processing Towards Mixed Workloads. In BTW, pages 545--563, 2017.
[21]
P. Menon, T. C. Mowry, and A. Pavlo. Relaxed Operator Fusion for In-memory Databases: Making Compilation, Vectorization, and Prefetching Work Together at Last. PVLDB, 11(1):1--13, 2017.
[22]
R. Nehme and N. Bruno. Automated Partitioning Design in Parallel Database Systems. In ACM SIGMOD, pages 1137--1148, 2011.
[23]
T. Neumann. Efficiently Compiling Efficient Query Plans for Modern Hardware. PVLDB, 4(9):539--550, 2011.
[24]
A. Nica, R. Sherkat, M. Andrei, X. Cheng, M. Heidel, C. Bensberg, and H. Gerwens. Statisticum: Data Statistics Management in SAP HANA. PVLDB, 10(12):1658--1669, 2017.
[25]
S. T. On, Y. Li, B. He, M. Wu, Q. Luo, and J. Xu. FD-buffer: A Buffer Manager for Databases on Flash Disks. In ACM CIKM, pages 1297--1300, 2010.
[26]
H. Plattner. The Impact of Columnar In-memory Databases on Enterprise Systems: Implications of Eliminating Transaction-maintained Aggregates. PVLDB, 7(13):1722--1729, 2014.
[27]
M. Poess and D. Potapov. Data Compression in Oracle. In VLDB, pages 937--947, 2003.
[28]
G. M. Sacco and M. Schkolnick. A Mechanism for Managing the Buffer Pool in a Relational Database System Using the Hot Set Model. In VLDB, pages 257--262, 1982.
[29]
R. Sherkat, C. Florendo, M. Andrei, A. Goel, A. Nica, P. Bumbulis, I. Schreter, G. Radestock, C. Bensberg, D. Booss, and H. Gerwens. Page As You Go: Piecewise Columnar Access In SAP HANA. In ACM SIGMOD, pages 1295--1306, 2016.
[30]
R. Stoica and A. Ailamaki. Enabling Efficient OS Paging for Main-memory OLTP Databases. In ACM DaMoN, pages 7:1--7:7, 2013.
[31]
T. Willhalm, I. Oukid, I. Müller, and F. Faerber. Vectorizing Database Column Scans with Complex Predicates. In ADMS@VLDB, pages 1--12, 2013.
[32]
D. Xie, F. Li, B. Yao, G. Li, L. Zhou, and M. Guo. Simba: Efficient In-Memory Spatial Analytics. In ACM SIGMOD, pages 1071--1085, 2016.
[33]
D. C. Zilio, J. Rao, S. Lightstone, G. Lohman, A. Storm, C. Garcia-Arellano, and S. Fadden. DB2 Design Advisor: Integrated Automatic Physical Database Design. In VLDB, pages 1087--1097, 2004.

Cited By

View all
  • (2023)PolarDB-SCC: A Cloud-Native Database Ensuring Low Latency for Strongly Consistent ReadsProceedings of the VLDB Endowment10.14778/3611540.361156216:12(3754-3767)Online publication date: 1-Aug-2023
  • (2023)Elastic Use of Far Memory for In-Memory Database Management SystemsProceedings of the 19th International Workshop on Data Management on New Hardware10.1145/3592980.3595311(35-43)Online publication date: 18-Jun-2023
  • (2022)ByteHTAPProceedings of the VLDB Endowment10.14778/3554821.355483215:12(3411-3424)Online publication date: 29-Sep-2022
  • Show More Cited By
  1. Native store extension for SAP HANA

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image Proceedings of the VLDB Endowment
    Proceedings of the VLDB Endowment  Volume 12, Issue 12
    August 2019
    547 pages

    Publisher

    VLDB Endowment

    Publication History

    Published: 01 August 2019
    Published in PVLDB Volume 12, Issue 12

    Qualifiers

    • Research-article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)37
    • Downloads (Last 6 weeks)2
    Reflects downloads up to 03 Oct 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2023)PolarDB-SCC: A Cloud-Native Database Ensuring Low Latency for Strongly Consistent ReadsProceedings of the VLDB Endowment10.14778/3611540.361156216:12(3754-3767)Online publication date: 1-Aug-2023
    • (2023)Elastic Use of Far Memory for In-Memory Database Management SystemsProceedings of the 19th International Workshop on Data Management on New Hardware10.1145/3592980.3595311(35-43)Online publication date: 18-Jun-2023
    • (2022)ByteHTAPProceedings of the VLDB Endowment10.14778/3554821.355483215:12(3411-3424)Online publication date: 29-Sep-2022
    • (2022)Cost modelling for optimal data placement in heterogeneous main memoryProceedings of the VLDB Endowment10.14778/3551793.355183715:11(2867-2880)Online publication date: 1-Jul-2022
    • (2022)Memory-optimized multi-version concurrency control for disk-based database systemsProceedings of the VLDB Endowment10.14778/3551793.355183215:11(2797-2810)Online publication date: 1-Jul-2022
    • (2021)Towards cost-effective and elastic cloud database deployment via memory disaggregationProceedings of the VLDB Endowment10.14778/3467861.346787714:10(1900-1912)Online publication date: 1-Jun-2021
    • (2021)Workload-Driven Placement of Column-Store Data Structures on DRAM and NVMProceedings of the 17th International Workshop on Data Management on New Hardware10.1145/3465998.3466008(1-8)Online publication date: 20-Jun-2021
    • (2021)Small Selectivities Matter: Lifting the Burden of Empty SamplesProceedings of the 2021 International Conference on Management of Data10.1145/3448016.3452805(697-709)Online publication date: 9-Jun-2021
    • (2020)Industrial-strength OLTP using main memory and many coresProceedings of the VLDB Endowment10.14778/3415478.341553713:12(3099-3111)Online publication date: 14-Sep-2020
    • (2020)Analyzing memory accesses with modern processorsProceedings of the 16th International Workshop on Data Management on New Hardware10.1145/3399666.3399896(1-9)Online publication date: 15-Jun-2020
    • Show More Cited By

    View Options

    Get Access

    Login options

    Full Access

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media