Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3661167.3661174acmotherconferencesArticle/Chapter ViewAbstractPublication PageseaseConference Proceedingsconference-collections
research-article
Open access

Towards Comprehending Energy Consumption of Database Management Systems - A Tool and Empirical Study

Published: 18 June 2024 Publication History

Abstract

In the dynamic landscape of contemporary data-driven technologies, software systems depend significantly on vast datasets and ongoing data center operations that utilize diverse database systems to facilitate computationally intensive tasks. The management of vast amounts of data also introduces challenges related to energy efficiency. With the growing concern over energy consumption in software systems, the selection of a Green database system for its energy efficiency becomes crucial. While various software components have been scrutinized for their energy consumption, there exists a gap in the software engineering literature concerning the energy efficiency of database management systems. To bridge this gap, we performed an empirical study to investigate the energy consumption of queries associated with popular database systems namely MySQL, PostgreSQL, MongoDB, and Couchbase. Our assessments, performed on three commonly used datasets, uncover substantial variations in the energy consumption of these database systems. The study suggests a potential need for optimizing energy usage in various database systems, enhancing developer awareness of the impact of running queries on energy consumption. This empowers them to make informed, sustainable choices, warranting further research in this area.

References

[1]
Sarthak Agarwal and KS Rajan. 2017. Analyzing the performance of NoSQL vs. SQL databases for Spatial and Aggregate queries. In Free and Open Source Software for Geospatial (FOSS4G) Conference Proceedings, Vol. 17. 4.
[2]
Kazi Main Uddin Ahmed, Math HJ Bollen, and Manuel Alvarez. 2021. A review of data centers energy consumption and reliability modeling. IEEE Access 9 (2021), 152536–152563.
[3]
Lasse F Wolff Anthony, Benjamin Kanding, and Raghavendra Selvan. 2020. Carbontracker: Tracking and predicting the carbon footprint of training deep learning models. arXiv preprint arXiv:2007.03051 (2020).
[4]
Indu Arora and Anu Gupta. 2012. Cloud databases: a paradigm shift in databases. International Journal of Computer Science Issues (IJCSI) 9, 4 (2012), 77.
[5]
Béchir Bani, Foutse Khomh, and Yann-Gaël Guéhéneuc. 2016. A study of the energy consumption of databases and cloud patterns. In Service-Oriented Computing: 14th International Conference, ICSOC 2016, Banff, AB, Canada, October 10-13, 2016, Proceedings 14. Springer, 606–614.
[6]
Sneha Binani, Ajinkya Gutti, and Shivam Upadhyay. 2016. SQL vs. NoSQL vs. NewSQL-a comparative study. database 6, 1 (2016), 1–4.
[7]
James Bornholt, Todd Mytkowicz, and Kathryn S McKinley. 2012. The model is not enough: Understanding energy consumption in mobile devices. In 2012 IEEE Hot Chips 24 Symposium (HCS). IEEE, 1–3.
[8]
Shannon Bradshaw, Eoin Brazil, and Kristina Chodorow. 2019. MongoDB: the definitive guide: powerful and scalable data storage. O’Reilly Media.
[9]
Semen Andreevich Budennyy, Vladimir Dmitrievich Lazarev, Nikita Nikolaevich Zakharenko, Aleksei N Korovin, OA Plosskaya, Denis Valer’evich Dimitrov, VS Akhripkin, IV Pavlov, Ivan Valer’evich Oseledets, Ivan Segundovich Barsola, 2022. Eco2ai: carbon emissions tracking of machine learning models as the first step towards sustainable ai. In Doklady Mathematics, Vol. 106. Springer, S118–S128.
[10]
Rick Cattell. 2011. Scalable SQL and NoSQL data stores. Acm Sigmod Record 39, 4 (2011), 12–27.
[11]
Antonio Celesti, Maria Fazio, and Massimo Villari. 2019. A study on join operations in MongoDB preserving collections data models for future internet applications. Future Internet 11, 4 (2019), 83.
[12]
Rajrupa Chattaraj and Sridhar Chimalakonda. 2023. RJoules: An Energy Measurement Tool for R. In 2023 38th IEEE/ACM International Conference on Automated Software Engineering (ASE). IEEE, 2026–2029.
[13]
Marco Couto, Rui Pereira, Francisco Ribeiro, Rui Rua, and João Saraiva. 2017. Towards a green ranking for programming languages. In Proceedings of the 21st Brazilian Symposium on Programming Languages. 1–8.
[14]
Vitor Furlan de Oliveira, Marcosiris Amorim de Oliveira Pessoa, Fabrício Junqueira, and Paulo Eigi Miyagi. 2021. SQL and NoSQL Databases in the Context of Industry 4.0. Machines 10, 1 (2021), 20.
[15]
Yuetang Deng, Phyllis Frankl, and Zhongqiang Chen. 2003. Testing database transaction concurrency. In 18th IEEE International Conference on Automated Software Engineering, 2003. Proceedings. IEEE, 184–193.
[16]
Paul DuBois. 2013. MySQL. Addison-Wesley.
[17]
Avrilia Floratou, Umar Farooq Minhas, and Fatma Özcan. 2014. Sql-on-hadoop: Full circle back to shared-nothing database architectures. Proceedings of the VLDB Endowment 7, 12 (2014), 1295–1306.
[18]
Stefanos Georgiou, Maria Kechagia, Tushar Sharma, Federica Sarro, and Ying Zou. 2022. Green ai: Do deep learning frameworks have different costs?. In Proceedings of the 44th International Conference on Software Engineering. 1082–1094.
[19]
Binglei Guo, Jiong Yu, Bin Liao, Dexian Yang, and Liang Lu. 2017. A green framework for DBMS based on energy-aware query optimization and energy-efficient query processing. Journal of Network and Computer Applications 84 (2017), 118–130.
[20]
Peter Henderson, Jieru Hu, Joshua Romoff, Emma Brunskill, Dan Jurafsky, and Joelle Pineau. 2020. Towards the systematic reporting of the energy and carbon footprints of machine learning. The Journal of Machine Learning Research 21, 1 (2020), 10039–10081.
[21]
Ralph Hintemann and Simon Hinterholzer. 2019. Energy consumption of data centers worldwide. Business, Computer Science (ICT4S) (2019).
[22]
Stefan Igescu, Giovanni Monea, and Vincenzo Pecorella. [n. d.]. EcoML: A tool to track and predict the carbon footprint of machine learning tasks. ([n. d.]).
[23]
Ylber Januzaj, Jaumin Ajdari, and Besnik Selimi. 2015. DBMS as a Cloud service: Advantages and Disadvantages. Procedia-Social and Behavioral Sciences 195 (2015), 1851–1859.
[24]
Mohammad A Kausar and Mohammad Nasar. 2021. SQL versus NoSQL databases to assess their appropriateness for big data application. Recent Advances in Computer Science and Communications (Formerly: Recent Patents on Computer Science) 14, 4 (2021), 1098–1108.
[25]
Kashif Nizam Khan, Mikael Hirki, Tapio Niemi, Jukka K Nurminen, and Zhonghong Ou. 2018. Rapl in action: Experiences in using rapl for power measurements. ACM Transactions on Modeling and Performance Evaluation of Computing Systems (TOMPECS) 3, 2 (2018), 1–26.
[26]
Wisal Khan, Teerath Kumar, Cheng Zhang, Kislay Raj, Arunabha M Roy, and Bin Luo. 2023. SQL and NoSQL Database Software Architecture Performance Analysis and Assessments—A Systematic Literature Review. Big Data and Cognitive Computing 7, 2 (2023), 97.
[27]
Douglas Kunda and Hazael Phiri. 2017. A comparative study of nosql and relational database. Zambia ICT Journal 1, 1 (2017), 1–4.
[28]
Loïc Lannelongue, Jason Grealey, and Michael Inouye. 2021. Green algorithms: quantifying the carbon footprint of computation. Advanced science 8, 12 (2021), 2100707.
[29]
Yishan Li and Sathiamoorthy Manoharan. 2013. A performance comparison of SQL and NoSQL databases. In 2013 IEEE Pacific Rim conference on communications, computers and signal processing (PACRIM). IEEE, 15–19.
[30]
Kenan Liu, Gustavo Pinto, and Yu David Liu. 2015. Data-oriented characterization of application-level energy optimization. In Fundamental Approaches to Software Engineering: 18th International Conference, FASE 2015, Held as Part of the European Joint Conferences on Theory and Practice of Software, ETAPS 2015, London, UK, April 11-18, 2015, Proceedings 18. Springer, 316–331.
[31]
Guy Lohman. 2014. Is query optimization a “solved” problem. In Proc. Workshop on Database Query Optimization, Vol. 13. Oregon Graduate Center Comp. Sci. Tech. Rep, 10.
[32]
DA Maevsky, EJ Maevskaya, and ED Stetsuyk. 2017. Evaluating the RAM energy consumption at the stage of software development. Green IT Engineering: Concepts, Models, Complex Systems Architectures (2017), 101–121.
[33]
Divya Mahajan, Cody Blakeney, and Ziliang Zong. 2019. Improving the energy efficiency of relational and NoSQL databases via query optimizations. Sustainable Computing: Informatics and Systems 22 (2019), 120–133.
[34]
Phan Thi Anh Mai, Jukka K Nurminen, and Mario Di Francesco. 2014. Cloud databases for internet-of-things data. In 2014 IEEE International Conference on Internet of Things (iThings), and IEEE Green Computing and Communications (GreenCom) and IEEE Cyber, Physical and Social Computing (CPSCom). IEEE, 117–124.
[35]
Eric Masanet, Arman Shehabi, Nuoa Lei, Sarah Smith, and Jonathan Koomey. 2020. Recalibrating global data center energy-use estimates. Science 367, 6481 (2020), 984–986.
[36]
Houcine Matallah, Ghalem Belalem, and Karim Bouamrane. 2021. Comparative study between the MySQL relational database and the MongoDB NoSQL database. International Journal of Software Science and Computational Intelligence (IJSSCI) 13, 3 (2021), 38–63.
[37]
Rui Pereira, Marco Couto, João Saraiva, Jácome Cunha, and João Paulo Fernandes. 2016. The influence of the java collection framework on overall energy consumption. In Proceedings of the 5th International Workshop on Green and Sustainable Software. 15–21.
[38]
Wittawat Puangsaijai and Sutheera Puntheeranurak. 2017. A comparative study of relational database and key-value database for big data applications. In 2017 International Electrical Engineering Congress (iEECON). IEEE, 1–4.
[39]
Philip Sedgwick. 2014. Spearman’s rank correlation coefficient. Bmj 349 (2014).
[40]
Anil Shanbhag, Samuel Madden, and Xiangyao Yu. 2020. A study of the fundamental performance characteristics of GPUs and CPUs for database analytics. In Proceedings of the 2020 ACM SIGMOD international conference on Management of data. 1617–1632.
[41]
Shriram Shanbhag and Sridhar Chimalakonda. 2023. An Exploratory Study on Energy Consumption of Dataframe Processing Libraries. In 2023 IEEE/ACM 20th International Conference on Mining Software Repositories (MSR). IEEE, 284–295.
[42]
Vatika Sharma and Meenu Dave. 2012. Sql and nosql databases. International Journal of Advanced Research in Computer Science and Software Engineering 2, 8 (2012).
[43]
Spirals Research Group. 2021. pyJoules: A Python library to capture the energy consumption of code snippets. University of Lille and Inria (2021). https://github.com/powerapi-ng/pyJoules
[44]
Chanchai Supaartagorn. 2011. PHP Framework for database management based on MVC pattern. AIRCC’s International Journal of Computer Science and Information Technology (2011), 251–258.
[45]
Wojciech Truskowski, Rafał Klewek, and Maria Skublewska-Paszkowska. 2020. Comparison of MySQL, MSSQL, PostgreSQL, Oracle databases performance, including virtualization. Journal of Computer Sciences Institute 16 (2020), 279–284.
[46]
Yi-Cheng Tu, Xiaorui Wang, Bo Zeng, and Zichen Xu. 2014. A system for energy-efficient data management. ACM SIGMOD Record 43, 1 (2014), 21–26.
[47]
Sitalakshmi Venkatraman, Kiran Fahd, Samuel Kaspi, and Ramanathan Venkatraman. 2016. SQL versus NoSQL movement with big data analytics. International Journal of Information Technology and Computer Science 8, 12 (2016), 59–66.
[48]
Roberto Verdecchia, Luís Cruz, June Sallou, Michelle Lin, James Wickenden, and Estelle Hotellier. 2022. Data-centric green ai an exploratory empirical study. In 2022 International Conference on ICT for Sustainability (ICT4S). IEEE, 35–45.
[49]
Claes Wohlin, Per Runeson, Martin Höst, Magnus C Ohlsson, Björn Regnell, and Anders Wesslén. 2012. Experimentation in software engineering. Springer Science & Business Media.

Cited By

View all
  • (2024)Method for Calculating the Uncertainty range of Avoided Primary Energy Consumption and Environmental Impact applied to Data Analysis Software Services and Solar ElectricityInternational Journal of Environmental Engineering and Development10.37394/232033.2024.2.252(283-289)Online publication date: 30-Dec-2024

Index Terms

  1. Towards Comprehending Energy Consumption of Database Management Systems - A Tool and Empirical Study

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    EASE '24: Proceedings of the 28th International Conference on Evaluation and Assessment in Software Engineering
    June 2024
    728 pages
    ISBN:9798400717017
    DOI:10.1145/3661167
    Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 18 June 2024

    Check for updates

    Author Tags

    1. Couchbase
    2. Energy consumption
    3. MongoDB
    4. MySQL
    5. PostgreSQL

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    EASE 2024

    Acceptance Rates

    Overall Acceptance Rate 71 of 232 submissions, 31%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)450
    • Downloads (Last 6 weeks)75
    Reflects downloads up to 25 Jan 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Method for Calculating the Uncertainty range of Avoided Primary Energy Consumption and Environmental Impact applied to Data Analysis Software Services and Solar ElectricityInternational Journal of Environmental Engineering and Development10.37394/232033.2024.2.252(283-289)Online publication date: 30-Dec-2024

    View Options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format.

    HTML Format

    Login options

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media