1. Introduction
Diesel engines have been widely used in energy, construction machinery, and military equipment. Vibration signals are transmitted dynamically and synchronously in real-time, Playing a pivotal role in real-time online monitoring of diesel engine health [
1,
2,
3]. It can effectively reduce the incidence of equipment failure, downtime, and management costs. Compared with traditional wired data transmission, edge computing wireless data transmission methods can significantly improve the real-time, flexibility, and ease of use of data, according to the Nyquist sampling theorem [
4]. To realize the collection of high-frequency vibration signals of the equipment, we will inevitably generate a large amount of data. However, big data is constrained by network bandwidth during wireless transmission. Whether it can support the problems of massive data, high concurrency, low latency, and low power consumption is yet to be determined.
Recently, it has become a research hotspot that researchers focus on. For example, Antonopoulos et al. [
5] embedded compression algorithms into hardware systems to improve the work efficiency of transmitting large amounts of data wirelessly. Ma et al. [
6] used a distributed video codec scheme to enhance the processing power of a single node for traditional data compression. Yi et al. [
7] proposed an adaptive data compression and transmission range extension scheme to improve the data collection rate of sink nodes. Hameed et al. [
8] used lossless compression technology and Huffman coding encryption technology to provide effective means for remote monitoring security and compressibility of electrocardiography (ECG) data. Therefore, before the data are wirelessly transmitted, real-time synchronous sampling and compression of the original vibration data is the best solution to solve the above problems.
Compressive sensing (CS) is a new technical theory that has emerged in recent years [
9]. Due to its outstanding performance in data compression and reconstruction, it has been widely used in the field of image and sound. Use the observation matrix to map the original vibration signal from the high-dimensional space to the low-dimensional space. Then, the original signal is recovered with a high probability from fewer observations through an optimization algorithm. Currently, commonly used compression and re-construction algorithms include greedy algorithm [
10], convex optimization algorithm [
11], Bayesian learning [
12], etc. For example, Liu et al. [
13] used a low-pass filtering method to optimize the electrographic signal and used basis pursuit (BP) algorithm to compress and reconstruct the electrocardiogram signal. Cheng et al. [
14] used an improved orthogonal matching pursuit (OMP) algorithm to improve seismic data’s reconstruction speed and compression effect. Sajjad et al. [
15] used a genetic algorithm to optimize the sparse signal and the regularized orthogonal matching pursuit (ROMP) algorithm to reconstruct the image signal. Generally, reciprocating mechanical vibration signals have sparse, non-sparse, and unique structural features. The traditional compression and reconstruction algorithm is used to recover sparse signals with high accuracy and versatility in the above research. However, this type of algorithm only considers its sparsity and is not necessarily suitable for reconstructing reciprocating mechanical vibration signals. Improving the recovery accuracy of structured non-sparse signals becomes crucial.
In the existing Bayesian algorithm, the block sparse Bayesian learning bound optimization (BSBL-BO) algorithm [
16] has the potential to solve the problem of structured non-sparse signal reconstruction. The algorithm effectively uses the intra-block correlation of vibration signals to restore structured non-sparse signals. Compared with other traditional compression and reconstruction algorithms, the BSBL-BO algorithm has the advantages of high signal recovery accuracy and good compression effect and has been widely used in electrocardiograms and radar. For example, Mahrous et al. [
17] proposed a space-time sparse Bayesian learning method. By optimizing the BSBL-BO algorithm, the compression and reconstruction of multi-channel electro-encephalogram (EEG) signals are realized. Li et al. [
18] used an enhanced narrow-band interference separation algorithm for radar to achieve compression and reconstruction of radar signals through the BSBL framework, proving the feasibility of the BSBL-BO algorithm for data compression. However, this algorithm has not been studied much in reciprocating mechanical vibration signals in previous studies. This paper carries out related research based on the BSBL-BO algorithm to fill the gap.
An essential prerequisite for CS is the sparsity in the original vibration signal. Sparsity plays a crucial role in the accuracy of the reconstruction of recovered data. Therefore, an efficient data dictionary is needed to improve the signal’s sparsity. Classical dictionaries include discrete cosine transform (DCT) [
19], discrete Fourier transform (DFT) [
20], and wavelet packet transform (discrete wavelet transform, DWT) [
21] are fixed dictionaries. The ideal sparse representation can only be obtained when the atomic features in this dictionary type are the same as the original vibration information. There is also a dictionary, commonly used K-singular value decomposition (K-SVD) [
22] and optimal directions (method of optimal directions, MOD) [
23]. The dictionary is dynamically updated through training to obtain the optimal sparse representation. Compared with the fixed dictionary, it has the advantage of solid adaptive ability. For example, Li et al. [
24] used the K-SVD algorithm to update the dictionary to improve the sparsity of image signals. Yang et al. [
25] used the K-SVD algorithm to enhance the sparse representation of medical images to obtain better compression and reconstruction accuracy.
Diesel engines often have various failures in their daily work. Among them, the loss of the diesel engine refers to the phenomenon of increased valve clearance, severe deformation of a valve seat ring, burning oil, and severe wear of piston rings during operation. As a result, the diesel engine cannot work normally, and there is a more significant safety hazard. To reduce the occurrence rate of diesel engine failures and improve stability and safety, researchers have carried out a great deal of research work and achieved fruitful research results. Gu et al. [
26] applied the multivariate empirical mode decomposition to the fault diagnosis of diesel engine misfire and achieved good fault classification results by using the SVM classifier. Chen et al.’s [
27] harmony search optimizer is used to set hyper-parameters of the variational stacked autoencoder. This method has been well applied in the fault detection of diesel engines. Wang et al. [
28] proposed the plan of particle swarm optimization probabilistic neural network (probabilistic neural network, PNN) and support vector machine. Effective diagnosis of common engine failures is achieved. In recent years, the application of compressed sensing theory to fault diagnosis has gradually attracted the attention of researchers, and some research results have been completed. Zhang et al. [
29] trained several over-complete dictionaries with a dictionary learning method. Thereby, redundant dictionaries corresponding to different fault categories are obtained. The matching tracking algorithm is used to determine. The error of the reconstructed signal under various dictionaries is compared to realize the diagnosis of the fault category. Tang et al. [
30] first obtained the compressed acquisition signal. Then, given the specified sparsity, the matching pursuit algorithm is used to directly obtain the first few fault characteristic frequencies with enormous energy. To realize the identification and diagnosis of fault signals, Du et al. [
31] used a dictionary constructed from Fourier transform matrices. The fault features are directly extracted in the compressed measurement domain to realize fault diagnosis of vibration signals.
Although compression technology has been widely used, there are still the following problems or deficiencies:
In the process of wireless transmission, due to the limitation of network bandwidth and low power consumption, massive vibration signals bring considerable challenges to data storage and wireless network transmission;
The problem of the reconstruction accuracy of the structured non-sparse signal of the reciprocating mechanical vibration signal cannot be satisfied by the traditional data compression technology;
Aiming at the compression and reconstruction effects of reciprocating mechanical vibration signals, there is a lack of an effective, comprehensive evaluation index for data compression effects;
There is a lack of relevant research on compressive sensing technology and fault diagnosis methods and their application in fault diagnosis of reciprocating machinery.
Using the BSBL-BO algorithm can effectively solve the problem of structured non-sparse signal reconstruction. At the same time, the sparsity of the signal can also be enhanced by the adaptive dynamic updating of the K-SVD dictionary. Combining the two methods can efficiently and accurately recover structured non-sparse signals. Therefore, this paper proposes a compression and reconstruction method based on the BSBL-BO algorithm and the K-SVD dictionary. In addition, this article also establishes an evaluation index for the effect of data compression. First, divide the original signal into blocks. Use the K-SVD dictionary to obtain optimal sparse decomposition to train the actual movement to improve the re-construction performance of the restored signal. Second, use the BSBL-BO algorithm to restore structured non-sparse signals. Compared with other reconstruction algorithms, it has the advantages of high accuracy and a good data compression effect. Finally, the proposed BSBL-KSVD algorithm is verified through a diesel engine valve clearance experiment and fault classification. The experimental results prove that the BSBL-KSVD algorithm proposed in this paper is practical and feasible, providing a reference basis for wireless data transmission of reciprocating mechanical vibration signals.
The main contributions of this paper are summarized as follows:
Using the BSBL-KSVD algorithm and exploiting the intra-block correlation of the vibration signal, we can recover the structured non-sparse signal efficiently. Compared with other traditional compression and reconstruction algorithms. We can effectively improve the reconstruction accuracy and compression effect;
A comprehensive evaluation index of compression effect suitable for reciprocating mechanical vibration signal is constructed, and it has a good engineering application prospect;
We apply compressed sensing technology to fault diagnosis. The wireless trans-mission efficiency of the vibration signal can be effectively improved to achieve a better diagnosis effect and has a better reference value.
The second section of this article describes the diesel engine compression reconstruction method model based on BSBL-KSVD; the third part is the comprehensive evaluation index of vibration data compression effect; the fourth part verifies the effectiveness of the compression reconstruction method through preset failure experiments. Finally, this research is summarized.
2. Model of Diesel Engine Compression Reconstruction Method Based on BSBL-KSVD
2.1. Compressed Sensing
In traditional data acquisition and transmission, the Nyquist sampling theorem is used. Usually, the sampling frequency is set to more than twice the highest frequency in the signal under test. Due to the high sampling frequency, a large amount of data is generated. This brings considerable challenges to the wireless data transmission, storage, and remote real-time dynamic monitoring of the operational status of the diesel engine. The emergence of CS theory breaks through the limitation of the traditional vibration signal sampling theorem. Combining the acquisition of vibration signals with the compression process, a small number of signals contains most of the valuable data. Assuming the original signal
and observation matrix
(
), then the signal
x is linearly projected on the matrix
as a compressed signal. Then, the compressed observation of the original signal
can be obtained [
32]:
Among them, v represents the unknown noise vector. The CS algorithm uses the compressed data y and the measurement matrix Φ to restore the original vibration signal x.
2.2. Block Sparse Bayesian Learning Reconstruction Algorithm
We were using the block structure characteristics of sparse signals. Based on the block sparse Bayesian learning framework, data compression can be realized. In actual engineering applications, the signal
x has a block structure feature, as shown in the following equation [
16]:
The model combined by Equations (1) and (2) is called a block sparse data compression model. We use the characteristics of intra-block correlation to improve the ability of compressed data recovery. Therefore, based on the model in the BSBL framework, it is assumed that the independent
xi between each block satisfies a multivariate Gaussian distribution [
16]:
Among them,
and
both represent unknown parameter variables.
represents a non-negative parameter variable that controls the block sparsity of the original signal
x.
represents a positive definite matrix used to obtain the related structure between elements in each block. Assuming that the noise vector obeys the Gaussian prior distribution
, use Bayesian principle to obtain the posterior probability of
x, as shown in the following equation [
16]:
Among them, , .
When the parameters
and
are solved, then the maximum posterior estimate of
x can be obtained as
. Next, use the second type of maximum likelihood estimation method to obtain this parameter, as shown in the following equation [
16]:
where Θ represents the parameters
,
.
2.3. K-SVD Adaptive Over-Complete Dictionary
The traditional fixed dictionary has a particular sparse representation when the signal is sparsely decomposed. Since the sparse representation of the limited dictionary is unknown, its suitability and flexibility are not strong enough. To further improve the sparsity, we need to use an adaptive dictionary learning method for optimization. Therefore, the K-SVD learning dictionary is used as the spare base to obtain a better sparse representation. The dictionary atom is dynamically updated through training until an adaptive over-complete dictionary is obtained. To ensure that the atomic scale in the dictionary is closer to the atomic scale in the original signal, the training process of dictionary D is expressed as [
33]:
In the above equation, represents the given training dictionary matrix, A represents a sparse matrix, and T represents the sparsity of the sparse representation vector to be solved.
Initialization D belongs to a super-complete dictionary, and there is a certain degree of redundancy. Suppose that when we update the
j-th column atom in dictionary D, we also let
Ei be the calculation error after removing the
i-th atom;
dj represents the
j-th column of dictionary D, and
ai represents the
i-th row of sparse matrix A. Then, the objective function is as follows [
33]:
When directly decomposing
Ei, the elements in the obtained
ai may not be sparse. Therefore, only the non-zero elements in
ai need to be updated, defined as the following equation:
represents the index collection of the index of the non-zero element in ai. The SVD decomposition method is used to update the atomic vector gradually, and the sparse representation coefficient matrix A in the dictionary D. Next, we generate a new dictionary through multiple iterative updates.
2.4. Basic Flow of BSBL-KSVD Algorithm
The algorithm flow of compression and reconstruction of diesel engine vibration signal based on BSBL-KSVD is shown in
Figure 1. The algorithm mainly includes dictionary training, data compression, and signal reconstruction.
The specific implementation steps are as follows:
Step 1. The signal is divided into blocks. Customize the collected original vibration signal x into i blocks and the size of the elements in each block;
Step 2. Dictionary training: Initialize the dictionary parameters, set the number of training samples, use the K-SVD algorithm to train the examples, and obtain an optimized dictionary Ψ;
Step 3. Data compression: The vibration signal of reciprocating machinery is more complicated than that of rotating machinery. To further improve the sparsity of the signal, the optimized dictionary Ψ can map the signal to the sparse transformation, and the original signal x = Ψθ can obtain the sparse transformation signal θ. The sensing matrix A = ΦΨ (that is, observation matrix × sparse matrix) compresses the sparse signal data and obtains the data compressed signal observation value y = Aθ;
Step 4. Signal transmission: The block-compressed signals y1,…,yi are successively transmitted through the data network;
Step 5. Signal reconstruction: After receiving the compressed signal block, using the BSBL-KSVD reconstruction algorithm proposed in this article through the sensor matrix A1, …, Ai and compressed signal y1, …, yi to reconstruct, we obtain the restored sparse signal θ1, …, θi. At the same time, we perform inverse sparse transformation to obtain reconstructed signal blocks x1, …, xi and connect the reconstructed signal blocks one by one and finally form a complete reconstructed signal x′.
The pseudo code of the algorithm (Algorithm 1) is as follows:
Algorithm 1 BSBL-KSVD algorithm pseudo code |
1. Input: x = [ x1, x2, …, xi], blkLen, N, M; |
2. Initialize dictionary parameters: param. L = 5, param. K = 70, param. numIteration = 20, param. Initialization Method = ‘Data Elements’; group Start Loc = 1:blkLen:N; |
3. K-SVD dictionary training: [Ψ, output] = KSVD(xi, param); the core is to use Equations (6)–(8) to generate a new dictionary Ψ through multiple iterative updates; |
4. Sparse transformation: xi = Ψθ; |
5. Sensor matrix: Ai = ΦΨ; |
6. Using the combination of Equations (1) and (2), the observation value of the data compression signal is obtained yi = Aiθ; |
7. Signal transmission: The block-compressed signals y1, …, yi are successively transmitted through the data network; |
8. For i = 1: size(xi,2)/N (signal reconstruction); |
9. θi = BSBL_BO (Ai, yi, groupStartLoc, 0, ‘prune_gamma’, −1, ‘max_iters’, 20); the core is to use Formula (3)–(5) to solve the reconstructed signal θi; |
10. Perform inverse sparse transformation to obtain reconstructed signal blocks: x1, …, xi; |
11. Connect the reconstructed signal blocks one by one to finally form a complete reconstructed signal: x′ = x1 + x2, + … + xi; |
12. End; |
13. Output: x′. |
3. Comprehensive Evaluation Index of Vibration Data Compression Effect
Reciprocating machinery vibration signal components are complex when compared to rotating machinery vibration signal components, noise pollution is severe, and a considerable amount of redundant data is created. There are numerous techniques in extant research to solve the data compression challenge. However, innumerable metrics are necessary to evaluate the data compression effect and performance benefits thoroughly. Although data compression technologies are widely utilized in the voice and image sectors, no standardized complete evaluation approach exists. As a result, while researching the vibration data compression method used in reciprocating equipment, it is vital to define a standard for evaluating the data compression effect. The following thorough assessment index of the data compression effect is produced by combining the structural properties of reciprocating equipment vibration data.
3.1. Data Compression Rate Evaluation Index
Data compression rate refers to the ratio of compressed data to the original data. It is a straightforward, intuitive, and easy-to-understand key indicator. Use CR (compressing ratio, CR) to represent the data compression ratio, and the range is set to (0, 1); then, the compression ratio is defined as follows [
34]:
N represents the original signal in the above equation, and M represents the compressed signal. The larger the CR value, the higher the data compression rate. When the data compression rate is higher, it does not mean that the data compression and reconstruction effect is better. It needs to be combined with the standard mean square error index for comprehensive evaluation.
3.2. Standard Mean Square Error Evaluation Index
The data compression rate is used to evaluate the ability of data compression. It shows that the loss rate of the original signal in the data compression process is very high. The accuracy of the reconstructed original signal is closely related to the compression rate. When the data compression rate is more significant, we cannot accurately restore the original signal reconstructed from the compressed signal. Therefore, based on data compression, MSE (mean square error, MSE) is used to represent the standard mean square error index, which the following equation can calculate [
35]:
Z represents the original signal in the equation above, and Z′ represents the reconstructed signal. The smaller the MSE value, the higher the accuracy of the data compression reconstructed signal. When the data compression rate is more significant, the MSE value is smaller, indicating better data compression and reconstruction effect.
3.3. Peak Signal-to-Noise Ratio Evaluation Index
The peak signal-to-noise ratio refers to the ratio of the original signal to the data com-pressed and reconstructed signal. In data compression, the loss of data information is reduced, and the quality of retaining the original data is improved as much as possible. PSNR (peak signal-to-noise ratio, PSNR) is used to express the peak signal-to-noise ratio [
35], which the following equation can calculate:
z represents the original signal in the equation above, z′ represents the reconstructed signal, and zmax represents the maximum component. The greater the PSNR value, the higher the accuracy of the data compression and reconstruction signal, the closer it is to the original signal. It shows that the data compression and reconstruction effect is better.
3.4. Pearson Correlation Coefficient Evaluation Index
In evaluating the effect of data compression and reconstruction of the signal and using the two indicators of MSE and PSNR, usually, the Pearson correlation coefficient can also be used to evaluate the degree of correlation between the reconstructed signal and the original signal. Use r to represent the Pearson correlation coefficient, and the range is set to (−1,1), which can be calculated by the following equation [
36]:
Z represents the original signal in the equation above, and Z′ represents the reconstructed signal. When the value of r is closer to 1, the similarity between the compressed and reconstructed signal and the original signal is higher and, conversely, the lower the similarity to the actual movement.
3.5. Comprehensive Evaluation Index in Time Domain
In fault prediction and health management, extracting characteristic parameters from vibration signals is crucial. Provide input conditions for further relevant analysis. For compressed data, the compression reconstruction algorithm should be able to recover from the compressed reconstructed signal similar to the original signal. Furthermore, in theory, it is identical to the actual feature parameters. Commonly used time-domain characteristic parameters mainly include mean value, root mean square value, variance and peak value, and other 12 indicators [
37]. Under the same compression ratio, the feature parameters extracted from the reconstructed signal from compressed data are closer to the feature parameters extracted from the original signal, indicating that the less loss in the data compression process, the better the data restoration effect.
To better reflect the compression effect of the reconstructed signal in the time domain signal, the time domain characteristic index
TTi is defined, which can be calculated by the following equation:
In the equation, Ti represents the time-domain feature value of the original signal, and represents the time-domain feature value of the reconstructed signal. The smaller the TTi value, the closer the time-domain characteristic index of the reconstructed signal and the original signal and the more accurate the data compression effect and restoration effect.
Similarly, in order to better evaluate the data compression effect of different compression algorithms, the comprehensive evaluation index
KPIt of time domain characteristics is defined, which can be calculated by the following equation:
In the equation, represents the weight coefficient, which satisfies > 0, and = 1. If there is no special case, the value is set to = 1/12, i = 1,2,3,…,12. The smaller the KPIt value is, the closer the reconstructed signal data recovery is to the time domain index of the original signal, and the more accurate the corresponding data compression effect is.
6. Conclusions
This paper proposes a method of compression and reconstruction of diesel engine vibration signal based on BSBL-KSVD, which is practical and feasible, and compared with other methods, there are advantages. To effectively verify the pros and cons of the BSBL-KSVD algorithm proposed in this study regarding data compression effects, use the CR indicator, MSE indicator, PSNR indicator, r indicator, and KPIs indicator for verification and, finally, compressed and reconstructed signals for fault diagnosis case analysis. The experimental results show that the compression effect of the BSBL-KSVD algorithm is optimal when the compression rate CR = 0.5. The recovered reconstructed signal is closer to the original signal, and good classification accuracy is obtained, which has a good engineering application prospect.
Although the proposed method has achieved good results, we can still improve it in the following aspects: First, this research did not focus on using the reconstructed signal to perform signal repair and noise reduction preprocessing in the follow-up. We will conduct detailed research using methods such as double sparse dictionary learning; second, it did not consider integrating the algorithm with the data acquisition hardware. In the subsequent investigation, embedding the algorithm into FPGA improves front-end data acquisition, transmission performance, and efficiency.