Systematic Evaluation of Deep Learning Models for Log-based Failure Prediction

Hadadi, Fatemeh; Dawes, Joshua H.; Shin, Donghwan; Bianculli, Domenico; Briand, Lionel

doi:10.1007/s10664-024-10501-4

Computer Science > Software Engineering

arXiv:2303.07230 (cs)

[Submitted on 13 Mar 2023 (v1), last revised 24 Jun 2024 (this version, v4)]

Title:Systematic Evaluation of Deep Learning Models for Log-based Failure Prediction

Authors:Fatemeh Hadadi, Joshua H. Dawes, Donghwan Shin, Domenico Bianculli, Lionel Briand

View PDF HTML (experimental)

Abstract:With the increasing complexity and scope of software systems, their dependability is crucial. The analysis of log data recorded during system execution can enable engineers to automatically predict failures at run time. Several Machine Learning (ML) techniques, including traditional ML and Deep Learning (DL), have been proposed to automate such tasks. However, current empirical studies are limited in terms of covering all main DL types -- Recurrent Neural Network (RNN), Convolutional Neural network (CNN), and transformer -- as well as examining them on a wide range of diverse datasets.
In this paper, we aim to address these issues by systematically investigating the combination of log data embedding strategies and DL types for failure prediction. To that end, we propose a modular architecture to accommodate various configurations of embedding strategies and DL-based encoders. To further investigate how dataset characteristics such as dataset size and failure percentage affect model accuracy, we synthesised 360 datasets, with varying characteristics, for three distinct system behavioral models, based on a systematic and automated generation approach. Using the F1 score metric, our results show that the best overall performing configuration is a CNN-based encoder with Logkey2vec. Additionally, we provide specific dataset conditions, namely a dataset size >350 or a failure percentage >7.5%, under which this configuration demonstrates high accuracy for failure prediction.

Comments:	Accepted by EMSE'24
Subjects:	Software Engineering (cs.SE)
Cite as:	arXiv:2303.07230 [cs.SE]
	(or arXiv:2303.07230v4 [cs.SE] for this version)
	https://doi.org/10.48550/arXiv.2303.07230
Journal reference:	Empir Software Eng 29, 105 (2024)
Related DOI:	https://doi.org/10.1007/s10664-024-10501-4

Submission history

From: Fatemeh Hadadi [view email]
[v1] Mon, 13 Mar 2023 16:04:14 UTC (622 KB)
[v2] Thu, 26 Oct 2023 20:07:45 UTC (548 KB)
[v3] Tue, 30 Apr 2024 16:25:17 UTC (386 KB)
[v4] Mon, 24 Jun 2024 04:36:05 UTC (1,366 KB)

Computer Science > Software Engineering

Title:Systematic Evaluation of Deep Learning Models for Log-based Failure Prediction

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Software Engineering

Title:Systematic Evaluation of Deep Learning Models for Log-based Failure Prediction

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators