Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1007/978-981-97-0316-6_7guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Benchmarking Modern Databases for Storing and Profiling Very Large Scale HPC Communication Data

Published: 14 February 2024 Publication History

Abstract

Capturing cross-stack profiling of communication on HPC systems at fine granularity is critical for gaining insights into the detailed performance trade-offs and interplay among various components of HPC ecosystem. To enable this, one needs to be able to collect, store, and retrieve system-wide data at high fidelity. As modern HPC systems expand, ensuring high-fidelity, real-time communication profiling becomes more challenging, especially with the growing number of users employing profiling tools to monitor their workloads. We take on this challenge in this paper and identify the key metrics of performance that makes a database amenable to these needs. We then design benchmarks to measure and understand the performance of multiple, popular, open-source databases. Through rigorous experimental analysis, we demonstrate the performance and scalability trends of the selected databases to perform different types of fundamental storage and retrieval operations under various conditions. Through this work, we are able to achieve sub-second complex data querying serving up to 64 users and demonstrate a “9×” improvement in insertion latency through parallel data insertion, achieving a latency of 55 ms and 50% less disk space for inserting 200,000 rows of profiling data collected from a potential system that is “4×” the size of the state-of-the-art 19th-ranked Frontera supercomputing system at TACC with 8,368 nodes.

References

[2]
Kousha, P., et al.: Accelerated real-time network monitoring and profiling at scale using OSU INAM. In: Practice and Experience in Advanced Research Computing (PEARC 2020) (2020)
[3]
ClickHouse: Clickhouse official website. https://clickhouse.tech/
[4]
DBeaver Corp: Dbeaver - universal database tool. https://dbeaver.io/. Accessed 27 Dec 2023
[6]
Malony, A.D., Shende, S.: Performance technology for complex parallel and distributed systems. In: Kotsis, G., Kacsuk, P. (eds.) Proceedings of the DAPSYS 2000, pp. 37–46 (2000)
[7]
Network based computing team: OSU INAM (2019). http://mvapich.cse.ohio-state.edu/tools/osu-inam/
[9]
Oak Ridge National Laboratory: Frontier (2023). https://www.olcf.ornl.gov/frontier/. Accessed 27 Dec 2023
[10]
OSC: Ohio Supercomputer Center. https://www.osc.edu/
[11]
Palmer JT et al. Open XDMoD: a tool for the comprehensive management of high-performance computing resources Comput. Sci. Eng. 2015 17 4 52-62
[13]
Stanzione, D., West, J., Evans, R.T., Minyard, T., Ghattas, O., Panda, D.: Frontera: the evolution of leadership computing at the national science foundation. In: Practice and Experience in Advanced Research Computing, pp. 106–111. PEARC’20, ACM, New York, NY, USA (2020).
[14]
The Apache Software Foundation: Apache cassandra. https://cassandra.apache.org/. Accessed 27 Dec 2023

Index Terms

  1. Benchmarking Modern Databases for Storing and Profiling Very Large Scale HPC Communication Data
        Index terms have been assigned to the content through auto-classification.

        Recommendations

        Comments

        Information & Contributors

        Information

        Published In

        cover image Guide Proceedings
        Benchmarking, Measuring, and Optimizing: 15th BenchCouncil International Symposium, Bench 2023, Sanya, China, December 3–5, 2023, Revised Selected Papers
        Dec 2023
        200 pages
        ISBN:978-981-97-0315-9
        DOI:10.1007/978-981-97-0316-6
        • Editors:
        • Sascha Hunold,
        • Biwei Xie,
        • Kai Shu

        Publisher

        Springer-Verlag

        Berlin, Heidelberg

        Publication History

        Published: 14 February 2024

        Author Tags

        1. HPC
        2. Storage
        3. Database
        4. Profiling
        5. Communication

        Qualifiers

        • Article

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • 0
          Total Citations
        • 0
          Total Downloads
        • Downloads (Last 12 months)0
        • Downloads (Last 6 weeks)0
        Reflects downloads up to 17 Oct 2024

        Other Metrics

        Citations

        View Options

        View options

        Get Access

        Login options

        Media

        Figures

        Other

        Tables

        Share

        Share

        Share this Publication link

        Share on social media