A graph neural network-based bearing fault detection method

Xiao, Lu; Yang, Xiaoxin; Yang, Xiaodong

doi:10.1038/s41598-023-32369-y

Download PDF

Article
Open access
Published: 31 March 2023

A graph neural network-based bearing fault detection method

Lu Xiao^1,2,
Xiaoxin Yang^1,2 &
Xiaodong Yang²Â

Scientific Reports volumeÂ 13, ArticleÂ number:Â 5286 (2023) Cite this article

3973 Accesses
7 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Bearings are very important components in mechanical equipment, and detecting bearing failures helps ensure healthy operation of mechanical equipment and can prevent catastrophic accidents. Most of the well-established detection methods do not take into account the correlation between signals and are difficult to accurately identify those fault samples that have a low degree of failure. To address this problem, we propose a graph neural network-based bearing fault detection (GNNBFD) method. The method first constructs a graph using the similarity between samples; secondly the constructed graph is fed into a graph neural network (GNN) for feature mapping, and the samples outputted by the GNN network fuse the feature information of their neighbors, which is beneficial to the downstream detection task; then the samples mapped by the GNN network are fed into base detector for fault detection; finally, the results determined by the integrated base detector algorithm are determined, and the top n samples with the highest outlier scores are the faulty samples. The experimental results with five state-of-the-art algorithms on publicly available datasets show that the GNNBFD algorithm improves the AUC by 6.4% compared to the next best algorithm, proving that the GNNBFD algorithm is effective and feasible.

Bearing fault detection by using graph autoencoder and ensemble learning

Article Open access 03 March 2024

Research on an intelligent diagnosis method of mechanical faults for small sample data sets

Article Open access 20 December 2022

Research on bearing fault diagnosis based on improved genetic algorithm and BP neural network

Article Open access 05 July 2024

Introduction

With the rapid development of science and technology, mechanical equipment is widely used in modern industry. And the health monitoring technology of mechanical systems is also an essential issue in modern industry¹. Rotating machinery is frequently used in industrial machinery and equipment, and its condition detection and fault diagnosis are of great significance in ensuring the reliability and safety of machinery in modern industrial systems². Rolling bearing is a crucial component of rotating machinery. Due to its particular working environment, rolling bearing is prone to accidental failure and damage under high speed and heavy load as well as repeated high-temperature con-tact, which will directly affect the whole machine's performance, thus leading to serious safety hazards and high maintenance costs³. Bearing failure is the most common type of failure in rotating machinery systems, and according to statistics, 30â40% of rotating machinery failures are caused by bearing defects⁴. Therefore, efficiently intelligent fault diagnosis techniques for rolling bearings have been a vital research element in mechanical failures in the past decades.

Bearing fault diagnosis technology has undergone three stages: manual experience, signal processing, and intelligence⁵. Traditional mechanical fault diagnosis methods and theories can play a good role for simple systems with a single process, single fault, and gradual fault, but for multi-process, multi-fault, and sudden fault, as well as complex, large, highly automated large equipment and systems have more significant limitations⁶. Nowadays, with the development of sensors and computer systems, the amount of data describing the status of mechanical equipment has increased exponentially. Artificial intelligence methods can extract hidden fault features from these large-scale data sets and learn new fault types, significantly improving fault diagnosis accuracy while reducing the labor cost and diagnostic uncertainty of traditional methods^7,8,9,10. However, the deep learning-based method considers the objects to be independently distributed and cannot take into account the correlation between objects during the training process, so it is still difficult to identify some of the early fault signals. In this paper, we propose a graph neural network-based bearing fault detection method in order to improve the accuracy of bearing fault detection.

Our main contributions are summarized as follows:

1.
We convert the time-series signal of vibration into non-Euclidean structured graph data by methods such as feature transformation and similarity measurement.
2.
A method for extracting features of vibration signals using graph neural networks is proposed. By feeding the vibration signal with the constructed graph into the graph neural network for training, the object after the training is completed can contain a wider range of neighborhood information.
3.
In order to improve the usability of the algorithm in the real world, an ensemble learning approach is proposed to improve the robustness of the proposed algorithm.

Related work

Up to now, a lot of research has been conducted on the intelligent diagnosis of bearing faults. Early widely used machine learning algorithms, such as PCA (Principal Component Analysis), SVM (Support Vector Machine), KNN (K-Nearest Neighbor), etc., have achieved satisfactory results in intelligent diagnosis, and the classification accuracy has improved significantly compared with the traditional methods. However, classical machine learning algorithms cannot learn nonlinear relationships¹¹, and it isn't easy to find suitable shallow machine learning methods when there are highly complex and difficult to understand nonlinear relationships between input data (samples) and output data (labels). As a branch of machine learning, deep learning is highly capable of modeling nonlinearities with high flexibility and performs much better in dealing with realistic and complex problems. Therefore, deep learning has been introduced into bearing fault diagnosis to obtain a higher correct rate of fault diagnosis in complex environments^12,13.

Many deep learning-based bearing fault diagnosis algorithms have been proposed as deep learning evolves.

CNN (Convolutional Neural Networks) is the most representative model of deep learning. Janssens et al.¹⁴ was the first paper to apply convolutional neural networks to bearing fault diagnosis, using the spatial structure in the data to effectively capture the covariance of the frequency decomposition of accelerometer signals; to balance the training speed and accuracy of the model, Guo et al.¹⁵ improved the traditional convolutional neural networks model by adding adaptive learning rate and momentum components to the weight update process; Xia et al.¹⁶ in the training process of convolutional neural networks both temporal and spatial information of the raw data from multiple sensors are considered; To deal with mechanical vibration signals with variable sequence length, Zhang et al.¹⁷ proposed a bearing fault diagnosis method based on residual learning algorithm, and the whole network uses a 1-dimensional convolutional layer to obtain local sequence features of the data information stream; For data that are difficult to obtain labels in practical situations, Meng et al.¹⁸ proposed a data enhancement technique, using deep convolutional neural network with residual learning algorithm as the main structure to obtain higher diagnostic accuracy with limited training data; Zhang et al.¹⁹ used a deep full convolutional neural network (DFCNN) containing four pairs of convolutional pooling layer pairs to convert vibration signals into images as input; Choudhary et al.⁸ proposed a fault diagnosis method for rotating machinery bearings combining CNN and thermal images, using various fault conditions explored the availability of thermal imaging in bearing fault diagnosis; Xu et al.²⁰ proposed a rolling bearing fault diagnosis model based on online transfer convolutional neural network (OTCNN) with pre-trained network model and source domain features.

AE (Autoencoder) is an unsupervised approach to deep learning. In²¹, the maximum correlation entropy was used as the loss function of the deep autoencoder and the critical parameters of the deep autoencoder were optimized to fit the signal characteristics using an artificial fish swarm algorithm; Wang et al.²² used a Gaussian radial basis kernel function and acoustic emission method for fault diagnosis of bearings with high diagnostic accuracy and applicability; Shao et al.²³ proposed an ensemble deep autoencoder for intelligent fault diagnosis of rolling bearings (EDAEs) method for unsupervised feature learning from measured vibration signals; similar to^24,25,26 also improved on SAEs (Stacked Autoencoders) for fault diagnosis of bearings, both with improved detection results compared to traditional SAEs; Zhang et al.²⁷ proposed a semi-supervised learning method based on a depth generating model of variational autoencoder (VAE), The VAE generation function is used to improve the classification performance when only a tiny portion of the data has labels; Cui et al.²⁸ proposed a rolling bearing fault detection and classification method combining feature distance stacked autoencoder (FD-SAE) and support vector machines by organically combining machine learning and deep learning methods; Shao et al.²⁹ used Morlet wavelet activation function to establish an accurate non-smooth vibration data based on stacked autoencoder with an accurate nonlinear mapping between the original non-stationary vibration data and various fault states using Morlet wavelet activation function; Ma et al.³⁰ applied the weak magnetic detection method to rolling bearing whole life cycle monitoring with an improved variational autoencoder; Li et al.³¹ proposed a unified framework combining predictive generative denoising autoencoder (PGDAE) and deep coral network (DCN).

DBN (Deep Belief Network) is a simple combination of unsupervised networks. Chen and Li³² first applied deep belief network to bearing fault diagnosis and proposed a multi-sensor feature fusion diagnosis method for bearing faults based on stacked autoencoder and deep belief network; Hoang et al.³³ automatically extracted bearing fault features from signals by DBN and then used Dempster-Shafer evidence theory combined with information from different sensors to predict bearing fault types; Liang et al.³⁴ implemented a four-layer DBN that processes sensor data through multiple DBNs for feature extraction; Xu et al.³⁵ combined clustering model affinity propagation (AP) with a DBN containing multiple hidden layers for fault diagnosis; Yu et al.³⁶ combined maximum overlap discrete wavelet packet transform (MODWPT) and deep belief network methods to analyze rolling bearing fault features and identify fault states; Zhu et al.³⁷ used principal component analysis to extract fault features and then used DBN for bearing fault diagnosis; Gao et al.³⁸ focused on the structure and momentum of neural networks and used summary optimization algorithm to optimize the network structure of DBN; Niu et al.³⁹ used particle swarm optimization (PSO) and adaptive training strategy to improve DBN to achieve higher accuracy and faster convergence speed.

In addition to CNN, AE and DBN, some common deep learning methods have also been applied to bearing fault diagnosis. For example^40,41,42, used generative adversarial networks and their variants for bearing fault diagnosis; with the birth of LSTM (Long Short Term Memory network), References^43,44,45 improved RNN (Recurrent Neural Network) and applied it to bearing fault diagnosis; and Refs.^46,47 proposed a bearing fault diagnosis method with higher diagnostic accuracy based on reinforcement learning. In recent years, researchers have borrowed ideas from convolutional networks, recurrent networks, and deep autoencoders to define and design neural network structures for processing graph data, and graph neural networks have come into being, but up to now, there is almost no research related to the application of graph neural networks to bearing fault diagnosis.

Model

Among the collected bearing vibration signals, there are normal vibration signals and faulty vibration signals. The vibration signals are converted into nodes in the graph by means of data slicing and feature transformation. This converts the fault detection of bearings into a node classification task in machine learning. For the problem that early bearing fault signals are weak and difficult to distinguish from normal signals, we propose a graph neural network-based bearing fault detection method (GNNBFD). This method contains five main parts, which are: (1) dataset process, (2) construct graph, (3) graph neural network, (4) ensemble, and (5) outlier score. In this section, we will describe each part of the method in detail. FigureÂ 1 represents the detection process of GNNBFD algorithm.

Dataset process

Since it is more difficult for neural networks to extract features from the original dataset, the dataset needs to be processed first to improve the detection accuracy of the algorithm. The processing of the dataset consists of two main steps: (1) slicing the dataset; (2) feature transformation. The flow chart of dataset process are shown in Fig.Â 2.

Slice and dice the dataset

The data set for bearing fault detection is usually N*1 of time-series data, and the data set is sliced in a segment of 300 data points, after which the original data is transformed into a matrix of 300*(N/300). The transformed data set contains a total of N/300 subsamples, and each subsample consists of 300 data points. It is worth noting that our slicing method for the dataset is obtained using a non-overlapping moving window.

Feature transformation

For each subsample, 23 features in the time and frequency domains are calculated and used as input to the subsequent model.

Based on Table 1, the index is calculated for each sample. Four steps are required.

(1)
Nine time domain indexes are calculated as follows:
$$ I{ = }\left[ {I_{1} ,I_{2} ,I_{3} ,I_{4} ,I_{5} ,I_{6} ,I_{7} ,I_{8} ,I_{9} } \right] $$
(1)
(2)
E_WPD is obtained by calculating WPD energy (parameters jâ=â3 and wavelet Db20).
$$ W_{WPD} { = }\left[ {E_{WPD}^{1} ,E_{WPD}^{2} ,E_{WPD}^{3} ,E_{WPD}^{4} ,E_{WPD}^{5} ,E_{WPD}^{6} ,E_{WPD}^{7} ,E_{WPD}^{8} } \right] $$
(2)
(3)
EEMD energy is calculated to obtain a dataset as follows:
$$ W_{EEMD} { = }\left[ {E_{EEMD}^{1} ,E_{EEMD}^{2} ,E_{EEMD}^{3} ,E_{EEMD}^{4} ,E_{EEMD}^{5} ,E_{EEMD}^{6} } \right] $$
(3)
(4)
I, W_WPD, W_EEMD are combined into a dataset as follows:
$$ X = \left[ {I,W_{WPD} ,W_{EEMD} } \right] $$
(4)

Table 1 Indexes and the calculation formulas.

Full size table

The feature transformation of the sliced dataset reduces the redundant information of the subsamples and can effectively reduce the computational effort of the subsequent model. The extracted 23 time-domain and frequency-domain features can adequately reflect the information contained in the samples and facilitate further processing of the subsequent model.

Construct graph

Since the subsamples processed by Section âDataset processâ are independent of each other, there is no interconnectivity between the subsamples. Traditional deep learning methods would input the subsamples directly into the model for training, but this method does not consider the correlation between the subsamples. For this reason, we propose a method for constructing correlations between subsamples, called construct graph.

Construct graph mainly considers the similarity between subsamples, and the higher the similarity, the greater the weight of the connection between them. Construct graph consists of (1) calculating the similarity between subsamples, (2) assigning weights, and (3) outputting the graph. The flow chart of construct graph are show in Fig.Â 3.

Similarity

Let Xâ=â{X₁, X₂, X₃â¦, X_m} indicate the processed dataset, where X_iââ¤âX_i1, X_i2, X_i3â¦. X_idâ>âindicates a subsamples in X, and d indicates the dimension of X. X_id indicates the value of subsamples X_i in the dth dimension.

We use the normalized Euclidean distance as a measure of similarity between subsamples. Dist (x_i, x_j) represents the similarity between subsamples x_i and x_j and is calculated as shown in the following equation:

$$ Dist\left( {x_{i} ,x_{j} } \right) = (x_{i} - x_{j} )V^{{ - {1}}} (x_{i} - x_{j} )\prime $$

(5)

where V is the n-by n diagonal matrix whose jth diagonal element is x_i², where x_j is a vector of scaling factors for each dimension. When the value between Dist (x_i, x_j) is larger, it means that the similarity between two objects is higher.

Assigning weights

First, the top k subsamples with the highest similarity to x_i subsamples are selected, and they form the set of neighbors of x_i, which is denoted as N_k (x_i). Then calculate the weights between these k subsamples and x_i, which are calculated as shown in the following equation.

$$ W(X_{i} ,X_{j} ) = \left\{ \begin{gathered} \frac{{Dist(X_{i} ,X_{j} )}}{{\sum\limits_{j = 1}^{k} {Dist(X_{i} ,X_{j} )} }} \, ,X_{j} \in N_{k} (X_{i} ) \hfill \\ 0 \, ,X_{j} \notin N_{k} (X_{i} ) \hfill \\ \end{gathered} \right. $$

(6)

Output graph

By calculating the similarity and assigning weights in the first two steps, we can connect the subsamples to each other. If the weights between subsamples are greater than 0, there is a connected edge between them; otherwise there is no connection between them. The subsamples are connected to each other to form the graph, and we represent the constructed graph by the adjacency matrix A.

$$ A{ = } \left | \begin{array}{*{20}c} & 1 & {W(X_{{2}} ,X_{1} )} & {...} & {...} &{W(X_{m} ,X_{1} )} & \\ & {W(X_{{1}} ,X_{{2}} )} & 1 & {...} & {...} & {W(X_{m} ,X_{2} )} & \\ & {...} & {...} & {...} & {...} & {...} & \\ & {...} & {...} & {W(X_{i} ,X_{j} )} & {...} & {...} & \\ & {W(X_{1} ,X_{m} )} & {W(X_{2} ,X_{m} )} & {...} & {...} & 1 & \\ \end{array} \right |$$

The values of the diagonal elements in the adjacency matrix A are all 1, which indicates that the subsample itself is connected to itself. In this way, the feature information of the subsample itself can be effectively prevented from being lost during the training process of the subsequent model.

Graph neural network

Traditional neural network structures such as convolutional neural networks, recurrent neural network, etc. receive data in Euclidean space as input, and they cannot handle data structures in non-Euclidean space, such as graphs. Therefore, we will use graph neural networks to handle graph data. GNN a framework to learn directly from graph structured data using deep learning.

We use a recurrent neural structure to propagate the neighbor information until reach a stable immobility point to learn the representation of the target node, that is facilitate the subsequent fault detection task. After feature extraction by the graph neural network, each graph node contains not only its own information, but also the feature information of its neighbors. Our forward model then takes the simple form:

$$ Z = f(X,A) = ReLU((ReLU(XAW^{(0)} - b^{(0)} )AW^{(1)} - b^{(1)} ) $$

(7)

Here, $W \in R^{D \times H}$ is an input-to-hidden weight matrix for a hidden layer with H feature maps, and $b \in R^{H}$ is an input-to-hidden biases matrix. The GNN weights W⁽⁰⁾, W⁽¹⁾ and biases b⁽⁰⁾, b⁽¹⁾ are trained using the gradient descent.

As shown in Fig.Â 4, the GNN model is based on an information propagation mechanism, where each node exchanges information (propagates) with other nodes through continuous iterative updates to reach a stable state. When the information flow of the whole graph smoothâs out, each node has information about itself and its neighboring nodes.

Ensemble

In this paper, we combine the detection of Z-matrices using multiple outlier detection algorithms, a process called ensemble. Each subsample of the Z matrix output by the GNN network contains not only its own eigenvalues, but also those of its neighbors. Inputting the Z-matrix into the outlier detection algorithm for detection helps to maximize the separation of normal objects from outliers in the dataset.

We use three classical outlier detection algorithms, LOF (Local Outlier Factor), KNN (K-Nearest Neighbor), and AE (Autoencoder), to detect the Z matrix. LOF is the classical density-based outlier detection algorithm, KNN is the distance-based outlier detection algorithm, and AE is the neural network-based outlier detection algorithm. Different types of outlier detection algorithms focus on detecting outliers with different distributions. Combining the above three types of algorithms can maximize the robustness of GNNBFD. Meanwhile, LOF, KNN, and AE are all unsupervised outlier detection algorithms, and the first two do not require training process. When training the AE algorithm, we set its learning rate to 0.0001, the hidden layer depth of the network to three layers, and the Adam optimizer is used for back propagation.

Each algorithm eventually outputs an outlier value for each subsample in the Z matrix. The outlier value indicates the probability that the subsample is a faulty sample, and the higher the outlier value, the more likely the sample is a faulty sample, and vice versa.

Outlier score (OS)

We define the outlier value of the ith subsample as Outlier Score (OS), OS is calculated as shown in the following equation:

$$ OS_{i} = \frac{{\sum\limits_{j = 1}^{m} {OF_{j} } }}{m} $$

(8)

In the above equation, OF_j denotes the outlier factor assigned to the ith subsample by the jth outlier detection algorithm. OS_i denotes the average outlier score of the ith subsample over the m outlier detection algorithms. By averaging the values in this way, the misclassification rate of the algorithm is reduced and the reliability and robustness of the final detection results of the algorithm are improved.

Experiments

In this section, we will demonstrate the effectiveness of the proposed algorithm for bearing fault detection in detail in four parts. These four sections are: (a) introduction to the experimental environment and data set; (b) evaluation method and comparison algorithm; (c) experimental results and (d) effectiveness analysis.

In (a) Introduction to the experimental environment and dataset, we describe in detail the experimental platform, hardware and software configurations, and the introduction and processing of the dataset; in (b) valuation method and comparison algorithm section, we use four evaluation metrics to measure the performance of each algorithm in a comprehensive manner. In (c) Experimental results, we report the final detection results of the algorithms in detail and analyze the reasons for them. In (d) effectiveness analysis, we verified the effectiveness of the GNNBFD algorithm using k-fold cross-validation.

Experimental environment and dataset

The hardware environment for the experiments is an Intel(R) Core(TM) i7-7700 3.60Â GHz CPU with 8Â GB of RAM. The software side contains the platform and operating system required for the experiments, and we implemented the code required for the model using Matlab 2020, and the operating system is Windows 10 Professional.

The dataset is the publicly available Case Western Reserve University (CWRU) dataset. The advantage of using a publicly available dataset is that it is easy for other researchers to reproduce our experimental results. The bearing type is SKF6250, bearing location is Drive-end, Sample frequency (Hz) is 12000Â Hz and the motor speed (rpm) is 1797. We divided the collected experimental data into 3 groups, each group contains 4 states, which are: (1) 0.1778Â mm inner race fault, 0.3556Â mm inner race fault, 0.5334Â mm inner race fault and normal base; (2) 0.1778Â mm ball fault, 0.3556Â mm ball fault, 0.5334Â mm ball fault and normal base; (3) 0.1778Â mm outer race fault, 0.3556Â mm outer race fault, 0.5334Â mm outer race fault and normal base. FigureÂ 5 shows the normal bearing and the three faulty bearing states.

Taking the first set of inner circle faults as an example, the normal sample contains a total of 240,000 data points and the faulty sample contains 120,000 data points. We divide the normal sample and the faulty sample into a sub-sample with 300 data points. Among the divided fault samples, 20 samples are randomly selected as outliers and form the first data set together with the normal objects. They were processed with the two data sets according to the same method, and thus three data sets were obtained, and the details of the data sets are summarized in Table 2.

Table 2 Summary of datasets.

Full size table

Evaluation methods and comparison algorithms

We use the receiver operating characteristic (ROC) curve and corresponding area under the curve (AUC), accuracy (ACC), detection rate (DR), and false alarm rate (FAR) to measure the detection performance. Higher AUC, ACC, and DR values and lower FAR indicate better performance.

The calculation method of ACC, DR, FAR can be obtained from the confusion matrix in Table 3.

$$ ACC = \frac{TP + TN}{{TP + TN + FP + FN}} $$

(9)

$$ DR = \frac{TP}{{TP + FN}} $$

(10)

$$ FAR = \frac{FP}{{TN + FP}} $$

(11)

Table 3 Confusion matrix.

Full size table

We compare GNNBFD with five representative outlier detection algorithms. These algorithms are common types in the outlier detection field, and they are used as comparison algorithms in most of the related literature. They can be divided into five categories: (i) Neuron network-based, SO-GAAL (Single Objective Generative Adversarial Active Learning); (ii) Graph-based, CutPC (graph-based clustering method using noise cutting); (iii) Local outlier factor-based, LOF; (iv) Distance-based, KNN; (v) Isolation-based, IForest. The parameter settings are summarized in Table 4.

Table 4 Parameter setting.

Full size table

Experiment results

Experimental results on real-world datasets are shown in Fig.Â 6. We adjust the parameters 10 different times for all algorithms and choose the best result among the 10 times as the evaluation of the final performance of the algorithm.

Observing the experimental results, some interesting information can be found.

(1)
Among the four evaluation methods, the GNNBFD achieves the best results. Compared with the next best IForest algorithm, GNNBFD improves the AUC value by 6.4% on average, which proves the validity of the proposed algorithm.
(2)
In the Group2 dataset, most algorithms have poor detection results. This is due to the weak fault vibration signal of the ball and the low degree of deviation of the normal signal from the fault signal. Since the GNNBFD method can learn the broader neighborhood information of the object, which enables the fault signal with low degree of deviation to produce a greater degree of deviation after the graph neural network embedding.
(3)
In the process of converting the vibration signals into graphs, connectivity relations are constructed for the objects. After training with the graph neural network, the low-dimensional embedding of the objects contains more valuable information and changes the distribution pattern from the original one. Therefore, the base detector is able to detect the faulty and normal objects more accurately when performing detection.

The experimental results prove that the GNNBFD is effective and feasible.

Effectiveness analysis

K-fold cross-validation is a commonly used technique for evaluating the performance of machine learning models. By using K-fold cross-validation, a more accurate estimate of a model's performance can be obtained. Furthermore, K-fold cross-validation can help detect whether a model is over fitting, as it enables evaluation of the model on a larger amount of data.

To analyze the effectiveness of the GNNBFD algorithm proposed in this paper, we performed K-fold cross-validation. Specifically, we performed a fivefold cross-validation on each of the three datasets. Each time, 150 normal objects and 30 faulty objects were selected from each dataset, and the performance of the proposed algorithm was measured by the AUC value. The experimental results are shown in Table 5.

Table 5 K-fold cross validation.

Full size table

To ensure that when performing fault detection, GNNBFD is blind to the samples. We used random sampling and randomly sorted these samples. Observing the experimental results in Table 5, it can be seen that the GNNBFD algorithm still has good detection results with a small number of samples. The average AUC values in the three datasets are 0.984, 0.975 and 0.991, it proves that the proposed algorithm can effectively detect the bearing fault signals.

Conclusion

In this paper, a graph neural network-based bearing fault detection method is proposed to improve the accuracy of bearing fault detection. The graph neural network has a very powerful feature mapping capability, which can fit the feature values of the sample and its neighbors simultaneously. The samples outputted by the GNN network after mapping can be more easily separated from normal samples and fault samples by the base detector algorithm. Considering the higher requirements for algorithm robustness in real production environments, we use an integrated technique to synthesize the detection results of the base detector to make the GNNBFD algorithm more stable and efficient. Experiments on publicly available datasets show that the GNNBFD algorithm can successfully detect most of the fault samples in the dataset. In the future work, we will mainly study how to improve the detection performance of the GNNBFD with a deeper network structure.

Data availability

The datasets generated and/or analyzed during the current study are available in the Case Western Reserve University (CWRU) repository, https://github.com/yyxyz/CaseWesternReserveUniversityData.

References

Gangsar, P. & Tiwari, R. Signal based condition monitoring techniques for fault detection and diagnosis of induction motors: A state-of-the-art review. Mech. Syst. Signal Process. 144, 106908 (2020).
ArticleÂ Google ScholarÂ
Zhang, Y. et al. Fault diagnosis of rotating machinery based on recurrent neural networks. Measurement 171, 108774 (2021).
ArticleÂ Google ScholarÂ
Wang, Z. et al. Mahalanobis semi-supervised mapping and beetle antennae search based support vector ma-chine for wind turbine rolling bearings fault diagnosis. Renew. Energy 155, 1312â1327 (2020).
ArticleÂ Google ScholarÂ
Zhao, X., Jia, M. & Lin, M. Deep Laplacian autoencoder and its application into imbalanced fault diagnosis of rotating machinery. Measurement 152, 107320 (2020).
ArticleÂ Google ScholarÂ
Jiao, J. et al. A comprehensive review on convolutional neural network in machine fault diagnosis. Neurocomputing 417, 36â63 (2020).
ArticleÂ Google ScholarÂ
Chen, X., Zhang, B. & Gao, D. Bearing fault diagnosis base on multi-scale CNN and LSTM model. J. Intell. Manuf. 32(4), 971â987 (2021).
ArticleÂ Google ScholarÂ
Xu, X. et al. Application of neural network algorithm in fault diagnosis of mechanical intelligence. Mech. Syst. Signal Process. 141, 106625 (2020).
ArticleÂ Google ScholarÂ
Choudhary, A., Mian, T. & Fatima, S. Convolutional neural network based bearing fault diagnosis of rotating machine using thermal images. Measurement 176, 109196 (2021).
ArticleÂ Google ScholarÂ
Iqbal, M. & Madan, A. K. CNC machine-bearing fault detection based on convolutional neural network using vibration and acoustic signal. J. Vib. Eng. Technol. 10, 1613â1631 (2022).
ArticleÂ Google ScholarÂ
Du, X. & Yu, J. Graph Neural Network-based Early Bearing Fault Detection. http://arxiv.org/abs/2204.11220 (2022).
Niazian, M. & NiedbaÅa, G. Machine learning for plant breeding and biotechnology. Agriculture 10(10), 436 (2020).
ArticleÂ CASÂ Google ScholarÂ
Tao, Q. et al. Piecewise linear neural networks and deep learning. Nat. Rev. Methods Primers 2(1), 1â17 (2022).
ArticleÂ Google ScholarÂ
Du, X. et al. Graph autoencoder-based unsupervised outlier detection. Inf. Sci. 608, 532â550 (2022).
ArticleÂ Google ScholarÂ
Janssens, O. et al. Convolutional neural network based fault detection for rotating machinery. J. Sound Vib. 377, 331â345 (2016).
ArticleÂ ADSÂ Google ScholarÂ
Guo, X., Chen, L. & Shen, C. Hierarchical adaptive deep convolution neural network and its application to bearing fault diagnosis. Measurement 93, 490â502 (2016).
ArticleÂ ADSÂ Google ScholarÂ
Xia, M. et al. Fault diagnosis for rotating machinery using multiple sensors and convolutional neural networks. IEEE/ASME Trans. Mechatron. 23(1), 101â110 (2017).
ArticleÂ Google ScholarÂ
Zhang, W., Li, X. & Ding, Q. Deep residual learning-based fault diagnosis method for rotating machinery. ISA Trans. 95, 295â305 (2019).
ArticleÂ PubMedÂ Google ScholarÂ
Meng, Z. et al. Data segmentation and augmentation methods based on raw data using deep neural networks approach for rotating machinery fault diagnosis. IEEE Access 7, 79510â79522 (2019).
ArticleÂ Google ScholarÂ
Zhang, W. et al. Fault state recognition of rolling bearing based fully convolutional network. Comput. Sci. Eng. 21(5), 55â63 (2018).
ArticleÂ MathSciNetÂ CASÂ Google ScholarÂ
Xu, Q. et al. Fault diagnosis of rolling bearing based on online transfer convolutional neural network. Appl. Acoust. 192, 108703 (2022).
ArticleÂ Google ScholarÂ
Shao, H. et al. A novel deep autoencoder feature learning method for rotating machinery fault diagnosis. Mech. Syst. Signal Process. 95, 187â204 (2017).
ArticleÂ ADSÂ Google ScholarÂ
Wang, F. et al. A deep neural network based on kernel function and auto-encoder for bearing fault diagnosis. In 2018 IEEE International Instrumentation and Measurement Technology Conference (I2MTC), 1â6. (IEEE, 2018).
Shao, H. et al. A novel method for intelligent fault diagnosis of rolling bearings using ensemble deep autoencoders. Mech. Syst. Signal Process. 102, 278â297 (2018).
ArticleÂ ADSÂ Google ScholarÂ
Haidong, S. et al. Intelligent fault diagnosis of rolling bearing using deep wavelet autoencoder with extreme learning machine. Knowl. Based Syst. 140, 1â14 (2018).
ArticleÂ Google ScholarÂ
Jia, F. et al. A neural network constructed by deep learning technique and its application to intelligent fault diagnosis of machines. Neurocomputing 272, 619â628 (2018).
ArticleÂ Google ScholarÂ
Pan, H. et al. Rolling bearing fault diagnosis based on stacked autoencoder network with dynamic learning rate. Adv. Mater. Sci. Eng. 2020, 1â12 (2020).
ArticleÂ Google ScholarÂ
Zhang, S. et al. Semi-supervised Learning of Bearing Anomaly Detection via Deep Variational Autoencoders. http://arxiv.org/abs/1912.01096 (2019).
Cui, M. et al. Fault diagnosis of rolling bearings based on an improved stack autoencoder and support vector machine. IEEE Sens. J. 21(4), 4927â4937 (2020).
ArticleÂ ADSÂ Google ScholarÂ
Shao, H. et al. Modified stacked autoencoder using adaptive Morlet wavelet for intelligent fault diagnosis of rotating machinery. IEEE/ASME Trans. Mechatron. 27(1), 24â33 (2021).
ArticleÂ Google ScholarÂ
Ma, J., Li, C. & Zhang, G. Rolling bearing fault diagnosis based on deep learning and autoencoder information fusion. Symmetry 14(1), 13 (2021).
ArticleÂ ADSÂ Google ScholarÂ
Li, X. et al. A unified framework incorporating predictive generative denoising autoencoder and deep coral network for rolling bearing fault diagnosis with unbalanced data. Measurement 178, 109345 (2021).
ArticleÂ Google ScholarÂ
Chen, Z. & Li, W. Multi-sensor feature fusion for bearing fault diagnosis using sparse autoencoder and deep belief network. IEEE Trans. Instrum. Meas. 66(7), 1693â1702 (2017).
ArticleÂ ADSÂ Google ScholarÂ
Hoang, D. T. & Kang, H. J. Deep belief network and dempster-shafer evidence theory for bearing fault diagnosis. In 2018 IEEE 27th international symposium on industrial electronics (ISIE), 841â846 (IEEE, 2018).
Liang, T. et al. Bearing fault diagnosis based on improved ensemble learning and deep belief net-work. J. Phys. Conf. Ser. 1074(1), 012154 (2018).
ArticleÂ Google ScholarÂ
Xu, F. & Tse, P. W. Combined deep belief network in deep learning with affinity propagation clustering algorithm for roller bearings fault diagnosis without data label. J. Vib. Control 25(2), 473â482 (2019).
ArticleÂ MathSciNetÂ Google ScholarÂ
Yu, X. et al. Rolling bearing fault feature extraction and diagnosis method based on MODWPT and DBN. In 2019 11th International Conference on Wireless Communications and Signal Processing (WCSP), 1â7 (IEEE, 2019).
Zhu, J. et al. Intelligent bearing fault diagnosis using PCAâDBN framework. Neural Comput. Appl. 32(14), 10773â10781 (2020).
ArticleÂ Google ScholarÂ
Gao, S. et al. Rolling bearing fault diagnosis based on SSA optimized self-adaptive DBN. ISA Trans. 128, 485â502 (2022).
ArticleÂ PubMedÂ Google ScholarÂ
Niu, G. et al. An optimized adaptive PReLU-DBN for rolling element bearing fault diagnosis. Neurocomputing 445, 26â34 (2021).
ArticleÂ Google ScholarÂ
Suh, S. et al. Generative oversampling method for imbalanced data on bearing fault detection and diagnosis. Appl. Sci. 9(4), 746 (2019).
ArticleÂ Google ScholarÂ
Zhou, F. et al. Deep learning fault diagnosis method based on global optimization GAN for unbalanced data. Knowl. Based Syst. 187, 104837 (2020).
ArticleÂ Google ScholarÂ
Liu, J., Zhang, C. & Jiang, X. Imbalanced fault diagnosis of rolling bearing using improved MsR-GAN and feature enhancement-driven CapsNet. Mech. Syst. Signal Process. 168, 108664 (2022).
ArticleÂ Google ScholarÂ
Guo, L. et al. A recurrent neural network based health indicator for remaining useful life prediction of bearings. Neurocomputing 240, 98â109 (2017).
ArticleÂ Google ScholarÂ
Liu, H. et al. Fault diagnosis of rolling bearings with recurrent neural network-based autoencoders. ISA Trans. 77, 167â178 (2018).
ArticleÂ PubMedÂ Google ScholarÂ
Shenfield, A. & Howarth, M. A novel deep learning model for the detection and identification of rolling element-bearing faults. Sensors 20(18), 5112 (2020).
ArticleÂ ADSÂ PubMedÂ PubMed CentralÂ Google ScholarÂ
Wang, R. et al. A reinforcement neural architecture search method for rolling bearing fault diagnosis. Measurement 154, 107417 (2020).
ArticleÂ Google ScholarÂ
Wang, S. et al. Few-shot rolling bearing fault diagnosis with metric-based meta learning. Sensors 20(22), 6437 (2020).
ArticleÂ ADSÂ PubMedÂ PubMed CentralÂ Google ScholarÂ

Download references

Acknowledgements

This research was supported by the National Natural Science Foundation of China under Grants 71961029 and Key R&D of intelligent manufacturing technology and its application in Xinjiang Uygur Autonomous Region (Project No.: 2020B02013).

Author information

Authors and Affiliations

China University of Mining and Technology, Xuzhou, 221116, China
Lu XiaoÂ &Â Xiaoxin Yang
Xinjiang Tianchi Energy Co., Ltd., Changji, 831100, China
Lu Xiao,Â Xiaoxin YangÂ &Â Xiaodong Yang

Authors

Lu Xiao
View author publications
You can also search for this author in PubMedÂ Google Scholar
Xiaoxin Yang
View author publications
You can also search for this author in PubMedÂ Google Scholar
Xiaodong Yang
View author publications
You can also search for this author in PubMedÂ Google Scholar

Contributions

L.X. and X.Y. wrote the main manuscript text, L.X. and X.Y. prepared all the figures. X.Y. and X.Y. conducted a thorough review of the manuscript. All authors reviewed the manuscript.

Corresponding author

Correspondence to Lu Xiao.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Xiao, L., Yang, X. & Yang, X. A graph neural network-based bearing fault detection method. Sci Rep 13, 5286 (2023). https://doi.org/10.1038/s41598-023-32369-y

Download citation

Received: 30 October 2022
Accepted: 27 March 2023
Published: 31 March 2023
DOI: https://doi.org/10.1038/s41598-023-32369-y

This article is cited by

Detection of incipient rotor unbalance fault based on the RIME-VMD and modified-WKN
- Qian Wang
- Shuo Hu
- Xinya Wang
Scientific Reports (2024)
Fault detection and classification in hybrid energy-based multi-area grid-connected microgrid clusters using discrete wavelet transform with deep neural networks
- S. N. V. Bramareswara Rao
- Y. V. Pavan Kumar
- S. M. Muyeen
Electrical Engineering (2024)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Bearing fault detection by using graph autoencoder and ensemble learning

Research on an intelligent diagnosis method of mechanical faults for small sample data sets

Research on bearing fault diagnosis based on improved genetic algorithm and BP neural network

Introduction

Related work

Model

Dataset process

Slice and dice the dataset

Feature transformation

Construct graph

Similarity

Assigning weights

Output graph

Graph neural network

Ensemble

Outlier score (OS)

Experiments

Experimental environment and dataset

Evaluation methods and comparison algorithms

Experiment results

Effectiveness analysis

Conclusion

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Detection of incipient rotor unbalance fault based on the RIME-VMD and modified-WKN

Fault detection and classification in hybrid energy-based multi-area grid-connected microgrid clusters using discrete wavelet transform with deep neural networks

Comments

Search

Quick links