The six investigated RNN (LSTM, Attn.-LSTM, Bi-LSTM, GRU, Attn.-GRU, and Bi-GRU) models were trained with 40%, 60%, and 80% segments of the collected normal and abnormal vibration data in this study. Then, the prediction performance of each model was evaluated on the validation sets. In this section, the predicted vibration results are explained. To compare and analyze the simulation results in detail, the efficiency of each model was comparatively analyzed based on the waveforms of the vibrations, scatter plots, the coefficient of determinations, and simulation runtime.
4.2. Waveform of the Predicted Vibrations and Comparison of Scatter Plots
This subsection compares and analyzes the waveforms and scatter plots of the vibrations predicted from the normal and abnormal motor vibration data for the six investigated models.
Figure 9 and
Figure 10 show the waveforms of the vibrations predicted from the normal and abnormal vibration data, respectively, using the six investigated techniques. For the prediction results of both the normal and abnormal vibration data, the vibration prediction accuracy increased for all six RNN models as the training set increased from 40% to 80%.
In addition, according to the vibration prediction results in
Figure 9a and
Figure 10a, the vibration prediction accuracy of LSTM was the lowest among the six RNN models when the size of the training dataset was 40%. As shown in
Figure 9b and
Figure 10b, when the size of the training dataset was 60%, Attn.-LSTM and Attn.-GRU had the highest prediction accuracy, whereas LSTM and GRU exhibited the lowest prediction accuracy. Moreover, the results in
Figure 9c and
Figure 10c show that when the size of the training dataset was 80%, the RNN techniques with the attention mechanism and bidirectional method achieved vibration prediction accuracies that were very similar to the actual vibrations.
Figure 11 and
Figure 12 below show the scatter plots of the similarities between the actual vibration values and the vibration values predicted from the normal and abnormal vibration data, respectively, for the six investigated models.
According to
Figure 11 and
Figure 12, which show the scatter plots of the results predicted from the normal and abnormal vibration data, respectively, the prediction accuracy of all six RNN models increased with the increase in the training segment.
In particular, according to
Figure 11a and
Figure 12a, the accuracy of the vibrations predicted using LSTM was much lower than the accuracy of the vibrations predicted using the other five RNN models when the size of the training data was 40%. Additionally, according to
Figure 11c and
Figure 12c, the vibrations predicted from the normal vibrations were more accurate than the vibrations predicted from the abnormal vibrations for all six RNN models.
This subsection comparatively analyzed, through the waveforms and scatter plots, the vibration prediction accuracies of the six RNN models according to the increase in the segment of the training data. To comparatively analyze the accuracies of the vibrations predicted in more detail, the changes in the coefficients of determination of the six RNN models according to the increase in the segment of the training data are analyzed in
Section 4.3.
4.3. Comparative Analysis of the Accuracy of the Predicted Vibrations
This subsection comparatively analyzes the accuracy of the predicted values for each RNN model based on the changes in their coefficients of determination according to the increase in the segment of the training data. To compare the accuracies of the vibrations predicted for the six RNN techniques, the coefficient of determination is defined as follows:
where
,
,
, and
denote the total number of vibration data values, the actual vibration values, the predicted vibration values, and the average of the vibration value, respectively.
Figure 13 below shows the average value of the coefficient of determination according to the change in the size of the training data. Here, the value of the coefficient of determination was additionally calculated for the 50%, 70%, and 90% segments of the training data to analyze in more detail the changes in the prediction accuracies of the LSTM, Attn.-LSTM, Bi-LSTM, GRU, Attn.-GRU, and Bi-GRU models with the increase in the size of the training data.
The solid and dotted lines in
Figure 13 represent the average value of the coefficient of determination for the results predicted from normal and abnormal vibration data, respectively. According to
Figure 13, the average value of the coefficient of determination for the normal vibration data was higher than that of the abnormal vibration data in most segments of the training data for the RNN techniques. However, as an exception, the average value of the coefficient of determination for GRU was lower for the normal vibration data than for the abnormal vibration data when the size of the training data was 40%.
Table 4 below shows the average value of the coefficient of determination for each segment of the training data, as shown in
Figure 13.
According to
Table 4, the average value of the coefficient of determination for the vibrations predicted from all six RNN models increased with the increase in the size of the training data. In addition, the average value of the coefficient of determination for the normal predicted vibrations converged to one for all six RNN models. However, the average value of the coefficient of determination for the results predicted from abnormal vibration data converged to 0.9, which is less than one. This subsection compared the average values of the coefficient of determination according to the increase in the segment of the training data. Next, in
Section 4.4, the time taken to predict the vibrations is also considered to analyze the runtime efficiency of the investigated models.
4.4. Comparative Analysis of Runtime Efficiency
Section 4.2 and
Section 4.3 verified that the vibration prediction accuracies for the LSTM, Attn.-LSTM, Bi-LSTM, GRU, Attn.-GRU, and Bi-GRU models all increased with the increase in the size of the training data. Here, this subsection comparatively analyzes the efficiency of each RNN model by comparing the simulation runtimes and the coefficients of determination.
Table 5 shows the average simulation runtime of each model when the size of the training data was 40%, 60%, and 80%. According to
Table 5, the simulation runtime required to predict the vibrations increased with the increase in the size of the training data for all six RNN models. Therefore, the model with the highest coefficient of determination compared to the simulation runtime required to predict the vibrations was the most efficient technique for predicting vibrations.
Figure 14 below compares both the average simulation runtimes and average values of the coefficient of determination for the six RNN models to compare the efficiency of each model.
In
Figure 14, the circle, triangle, and square represent the average value of the coefficient of determination when the size of the training data was 40%, 60%, and 80%, respectively. In addition, the bar graph shows the simulation runtimes of the RNN models. According to
Figure 14a,b, the average values of the coefficient of determination of Attn.-LSTM and Attn.-GRU, with the attention mechanism, were higher than or similar to those of the other RNN models. Therefore, in this subsection, the simulation runtime required for the prediction and the rate of change for the coefficient of determination is analyzed when the attention mechanism and bidirectional technique were incorporated into the existing LSTM and GRU techniques.
Table 6 below compares the rate of change for the coefficient of determination and the simulation runtime when the attention mechanism and bidirectional technique were incorporated into the LSTM technique.
According to
Table 6, the accuracy compared to the rate of increase in the simulation runtime was higher in Attn.-LSTM, with the attention mechanism, than in Bi-LSTM, with the bidirectional technique, for both normal and abnormal vibration prediction results in all segments of the training data. For example, when the size of the training data was 40%, the average value of the coefficient of determination for Attn.-LSTM increased by about 446.6% compared to that for LSTM, whereas the simulation runtime increased by only 19.7%.
However, the average value of the coefficient of determination increased by about 299.6% when predicting vibrations using Bi-LSTM compared to when LSTM was used, while the simulation time increased by 55.1%, which was greater than the increase when Attn.-LSTM was used. When the segment of the training data was 40%, the rate of increase in the simulation runtime was lower for Attn.-LSTM than for Bi-LSTM, while the vibration prediction accuracy increased more for Attn.-LSTM. Hence, the comparative analysis verified that Attn.-LSTM was more efficient than Bi-LSTM.
Moreover, when the segment of training data was 60%, the increase in the average value of the coefficient of determination for Bi-LSTM was about 4.3% higher than that for Attn.-LSTM; however, the simulation runtime increased by about 36.7% more for Bi-LSTM than for Attn.-LSTM. Hence, Attn.-LSTM had better efficiency than Bi-LSTM. In particular, when the abnormal vibration data predicted from the segment of the training data was 80%, the average value of the coefficient of determination decreased slightly by about 1.1%, while the simulation runtime increased by about 41.6%. Therefore, the efficiency of Attn.-LSTM was superior to that of Bi-LSTM. Therefore, the rate of change in the simulation runtime and the average value of the coefficient of determination were compared between the Attn.-LSTM and Bi-LSTM techniques and analyzed. This comparative analysis confirmed that Attn.-LSTM had the best efficiency.
Table 7 below shows the change rate in the simulation runtime and the average value of the coefficient of determination when the attention mechanism and bidirectional techniques were incorporated into the GRU technique.
According to
Table 7, both Attn.-GRU and Bi-GRU have an increased simulation runtime compared to general GRUs in all segments of the training data. However, the increase in the average value of the coefficient of determination was greater for Attn.-GRU than for Bi-GRU. For example, when the size of the training data was 40%, the average value of the coefficient of determination for Attn.-GRU increased by about 23.2% compared to that for GRU, and the simulation runtime increased by only about 14.9%. Meanwhile, the average value of the coefficient of determination for Bi-GRU increased slightly by about 6.7% compared to that for GRU, and the average simulation runtime increased by about 47.3%.
Similarly, when the size of the training data was 60%, the average value of the coefficient of determination for Attn.-GRU increased by about 6% compared to that for GRU, and the simulation runtime increased by about 16.1%. Meanwhile, the average value of the coefficient of determination for Bi-GRU increased by about 2.7% compared to that for GRU, and the average simulation runtime increased by about 44.6%.
In addition, when the size of the training data was 80%, the average value of the coefficient of determination for Attn.-GRU increased by about 6.3% compared to that for GRU, whereas the average value of the coefficient of determination for Bi-GRU increased by about 3.8%. As for the average simulation runtime, the average simulation runtime for Bi-GRU increased by 40.1%, which was 28.1% higher than the increase in the average simulation runtime for Attn.-GRU.
Therefore, the comparison of the rate of change in the average simulation runtime and the average coefficient of determination confirmed that Attn.-GRU had the best vibration prediction efficiency.
Section 4.5 explains why the RNN techniques with the attention mechanism had the best efficiency.
4.5. Analysis of Vibration Prediction of Attention Mechanism
This subsection explains why the best performance was achieved when the attention mechanism was incorporated into the LSTM and GRU techniques.
In
Figure 15, the values in the blue boxes represent the input vibration values used for the prediction, and the values in the red boxes represent the predicted vibration values. In addition,
,
, and
in
Figure 15 denote the collected vibration data values, the input vibration values used for the prediction, and the predicted output vibration values, respectively.
As shown in
Figure 15, the number of input vibration values was set to 200, the same size as one cycle of the waveform of the vibration data. Then, a total of 800 vibration values (
) were predicted sequentially from these input vibration values. The 200 vibration values were used as the input to the encoder of the attention mechanism. The attention value, a vector representing the correlation with the predicted vibration values as in Equation (16), was calculated using these input values and used as the input to the output layer. Hence, the correlation with the to-be-predicted vibration values was passed to the output layer (
) of the neural network. In other words, Attn.-LSTM and Attn.-GRU calculated the correlation between the input vibration values and the to-be-predicted vibration values before predicting the vibrations. Conversely, the general LSTM and GRU, Bi-LSTM, and Bi-GRU predicted the vibration values without calculating the correlation between the input vibration values and the to-be-predicted vibration values. Therefore, Attn.-LSTM and Attn.-GRU have a higher prediction accuracy than other models.
On the one hand, Bi-LSTM and Bi-GRU collected information in both the backward and forward directions to increase the amount of information to be used for prediction. As a result, the vibration prediction accuracy was improved. However, the simulation runtime also increased rapidly. On the other hand, Attn.-LSTM and Attn.-GRU obtained information on the correlation between the input values and the to-be-predicted values only in the forward direction without unnecessary repetition. Hence, their vibration prediction accuracy is similar to or higher than that of Bi-LSTM and Bi-GRU, but their simulation runtime is shorter.