Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3569966.3570024acmotherconferencesArticle/Chapter ViewAbstractPublication PagescsseConference Proceedingsconference-collections
research-article

ACT-SAGAN: Automatic Configuration Tuning for Kafka with Self-Attention Generative Adversarial Networks

Published: 20 December 2022 Publication History

Abstract

When Kafka is used in production environments, a large number of parameters are provided to facilitate user configuration for specific application environments in order to obtain better performance. However, configuring Kafka's parameters requires in-depth knowledge of the user, which is far beyond the ability of the average user and prevents Kafka from obtaining better performance. To address this problem, we propose an ACT-SAGAN method that adds a self-attention mechanism to the generative adversarial network model to capture the associations between hidden structures in good configuration combinations and configuration parameters, which uses these hidden structures and associations to generate better configuration combinations to improve Kafka's performance. Experimental results show that the algorithm improves Kafka's throughput and reduces latency after deployment for the configuration combinations generated by Kafka.

References

[1]
Hiraman B R. A study of apache kafka in big data stream processing[C]//2018 International Conference on Information, Communication, Engineering and Technology (ICICET). IEEE, 2018: 1-3.
[2]
Xu T, Jin L, Fan X, Hey, you have given me too many knobs!: Understanding and dealing with over-designed configuration in system software[C]//Proceedings of the 2015 10th Joint Meeting on Foundations of Software Engineering. 2015: 307-319.
[3]
Happe J, Becker S, Rathfelder C, Parametric performance completions for model-driven performance prediction[J]. Performance Evaluation, 2010, 67(8): 694-716.
[4]
Sachs K, Kounev S, Appel S, Benchmarking of message-oriented middleware[C]//Proceedings of the third ACM international conference on distributed event-based systems. 2009: 1-2.
[5]
Esposito C, Russo S, Di Crescenzo D. Performance assessment of OMG compliant data distribution middleware[C]//2008 IEEE International Symposium on Parallel and Distributed Processing. IEEE, 2008: 1-8.
[6]
Henard C, Papadakis M, Harman M, Combining multi-objective search and constraint solving for configuring large software product lines[C]//2015 IEEE/ACM 37th IEEE International Conference on Software Engineering. IEEE, 2015, 1: 517-528.
[7]
Oh J, Batory D, Myers M, Finding near-optimal configurations in product lines by random sampling[C]//Proceedings of the 2017 11th Joint Meeting on Foundations of Software Engineering. 2017: 61-71.
[8]
Ye T, Kalyanaraman S. A recursive random search algorithm for large-scale network parameter configuration[C]//Proceedings of the 2003 ACM SIGMETRICS International conference on Measurement and modeling of computer systems. 2003: 196-205.
[9]
Olaechea R, Rayside D, Guo J, Comparison of exact and approximate multi-objective optimization for software product lines[C]//Proceedings of the 18th International Software Product Line Conference-Volume 1. 2014: 92-101.
[10]
Wu F, Weimer W, Harman M, Deep parameter optimisation[C]//Proceedings of the 2015 Annual Conference on Genetic and Evolutionary Computation. 2015: 1375-1382.
[11]
Xi B, Liu Z, Raghavachari M, A smart hill-climbing algorithm for application server configuration[C]// Proceedings of the 13th international conference on World Wide Web, WWW 2004, New York, NY, USA, May 17-20, 2004. 2004.
[12]
Zhu Y, Liu J, Guo M, BestConfig: Tapping the Performance Potential of Systems via Automatic Configuration Tuning[J]. 2017.
[13]
Henard C, Papadakis M, Harman M, et al. Combining multiobjective search and constraint solving for configuring large software product lines[C]//2015 IEEE/ACM 37th IEEE International Conference on Software Engineering. IEEE, 2015, 1: 517-528.
[14]
Bao L, Liu X, Xu Z, AutoConfig: automatic configuration tuning for distributed message systems[C]. Proceedings of the 33rd ACM/IEEE International Conference on Automated Software Engineering - ASE 2018. Montpellier, France: ACM Press, 2018: 29–40.
[15]
J. Bergstra and Y. Bengio, “Random search for hyper-parameter optimization,” Journal of Machine Learning Research, vol. 13, no. Feb, pp. 281–305, 2012.
[16]
T. Wang, M. Harman, Y. Jia, and J. Krinke, “Searching for better configurations: a rigorous approach to clone evaluation,” in Proceedings of the 9th Joint Meeting on Foundations of Software Engineering. ACM, 2013, pp. 455–465.
[17]
V. Nair, T. Menzies, N. Siegmund, and S. Apel, “Using bad learners to find good configurations,” in Proceedings of the 11th Joint Meeting on Foundations of Software Engineering. ACM, 2017, pp. 257–267.
[18]
Sarkar A, Guo J, Siegmund N, Cost-Efficient Sampling for Performance Prediction of Configurable Systems (T)[C]// IEEE/ACM International Conference on Automated Software Engineering. ACM, 2016.
[19]
Bei Z, Yu Z, Zhang H, RFHOC: A Random-Forest Approach to Auto-Tuning Hadoop's Configuration[J]. IEEE Transactions on Parallel and Distributed Systems, 2016, 27(5):1470-1483.
[20]
Kang Z, Barve Y D, Bao S, Configuration Tuning for Distributed IoT Message Systems Using Deep Reinforcement Learning: Poster Abstract[C]// IoTDI '21: International Conference on Internet-of-Things Design and Implementation. 2021.
[21]
Kang Z, Barve Y D, Bao S, Configuration Tuning for Distributed IoT Message Systems Using Deep Reinforcement Learning: Poster Abstract[C]// IoTDI '21: International Conference on Internet-of-Things Design and Implementation. 2021.
[22]
Goodfellow I . NIPS 2016 Tutorial: Generative Adversarial Networks[J]. 2016.

Cited By

View all
  • (2024)Efficient topic partitioning of Apache Kafka for high-reliability real-time data streaming applicationsFuture Generation Computer Systems10.1016/j.future.2023.12.028154:C(173-188)Online publication date: 25-Jun-2024

Index Terms

  1. ACT-SAGAN: Automatic Configuration Tuning for Kafka with Self-Attention Generative Adversarial Networks
        Index terms have been assigned to the content through auto-classification.

        Recommendations

        Comments

        Information & Contributors

        Information

        Published In

        cover image ACM Other conferences
        CSSE '22: Proceedings of the 5th International Conference on Computer Science and Software Engineering
        October 2022
        753 pages
        ISBN:9781450397780
        DOI:10.1145/3569966
        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        Published: 20 December 2022

        Permissions

        Request permissions for this article.

        Check for updates

        Author Tags

        1. Generative Adversarial Networks
        2. Kafka
        3. Self-Attention

        Qualifiers

        • Research-article
        • Research
        • Refereed limited

        Conference

        CSSE 2022

        Acceptance Rates

        Overall Acceptance Rate 33 of 74 submissions, 45%

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)34
        • Downloads (Last 6 weeks)5
        Reflects downloads up to 09 Jan 2025

        Other Metrics

        Citations

        Cited By

        View all
        • (2024)Efficient topic partitioning of Apache Kafka for high-reliability real-time data streaming applicationsFuture Generation Computer Systems10.1016/j.future.2023.12.028154:C(173-188)Online publication date: 25-Jun-2024

        View Options

        Login options

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        HTML Format

        View this article in HTML Format.

        HTML Format

        Media

        Figures

        Other

        Tables

        Share

        Share

        Share this Publication link

        Share on social media