Contemporary diagnostic methods encompass time-domain, frequency-domain, machine learning, and deep learning techniques, with general procedures involving signal acquisition and processing, feature extraction, and fault classification [
2,
3,
4,
5,
6]. However, diagnostic precision is impacted by the non-linear and non-stationary characteristics of signals, which lead to the obtained signals failing to fully express the bearing’s state [
7]. Yan et al. employed multi-scale enveloping spectrum (MuSEnE) technology to visually ascertain bearing fault locations based on defect characteristic frequencies [
8]. Hoang, Duy-Tang et al. proposed a method for diagnosing bearing faults using a deep convolutional neural network. Through the direct use of vibration signals as the input, they eliminated the need for feature extraction and achieved a high accuracy and robustness in noisy environments with the CWRU dataset [
9]. Mohammadreza Ghorvei et al. proposed a novel deep subdomain adaptation graph convolutional neural network (DSAGCN), addressing unsupervised domain adaptation, data geometric structure, and subdomain relation issues in fault diagnosis [
10]. Qiang Wang et al. introduced an adaptive denoising convolutional neural network (ADCNN) to tackle the common challenges of noise interference and varying load conditions in rolling bearing fault diagnosis, thereby enhancing diagnostic accuracy and robustness to meet the challenges of complex industrial environments [
11]. Spyridon Plakias proposed a new deep learning framework for unsupervised fault detection. Through the integration of multiple types of autoencoders and employment of a soft voting mechanism, the framework effectively addressed the issue of identifying abnormal samples with only normal training data, significantly improving fault detection accuracy and robustness [
12]. Jinchuan Qian et al. presented an industrial monitoring method based on autoencoders (AEs), addressing the fault detection and diagnosis requirements of complex processes. Their approach advanced the intelligence of monitoring systems, enhancing both performance and efficiency [
13]. Jiazhen Zhu et al. proposed a load weighted denoising autoencoder (LWDAE), which highlighted useful information from both training and online data through weighted loading matrices. Through the modification of the loss function, the addition of regularization terms, and revisions to the calculation logic, the LWDAE resolved issues of mixed and neutralized process variables in the fault diagnosis process [
14]. Despite advances in bearing fault diagnostic methods, the non-linear and non-stationary nature of signals continues to pose challenges to diagnostic accuracy. Generative adversarial networks (GANs) have been introduced as an innovative approach to improve effectiveness. GANs learn complex data distributions through training generators and discriminators, thereby extracting deeper fault characteristics directly from raw signals without the need for manual feature extraction. Liang et al. merged generative adversarial networks (GANs) with convolutional neural networks (CNNs) to accomplish end-to-end feature extraction, minimizing information loss and enhancing diagnostic efficiency [
15]. Ren, Z. et al. proposed a multi-domain GAN with a self-reasoning training strategy to generate diverse, reliable samples. Separate domain models ensured training stability and similarity to real samples, addressing overfitting in small sample conditions and improving fault diagnosis accuracy [
16]. Zhou, K. introduced a semi-supervised DCGAN method to diagnose gear faults with limited training data, combining labeled and unlabeled data to enhance performance. The approach ensures model stability, prevents collapse, and effectively explains the correlations between known and unseen faults [
17]. Wang, R. et al. proposed the GFMGAN method to address data scarcity in fault diagnosis via expansion of the original dataset, which improved the reliability of vibration data and diagnostic accuracy [
18].
The above methods not only solve the problem of non-linear and non-stationary signals but also solve the problem of missing signals, ensuring that the fault characteristics of the original vibration signal are retained. The WGAN, an extension of the GAN [
19], was developed to address the limitations of both JS and KL divergence inherent in GANs and partially alleviate the vanishing gradient issue. Wang et al. employed data augmentation for under-represented classes and a WGAN to bolster small sample datasets, thereby enhancing the precision of fault diagnosis [
20]. Wu et al. used a WGAN to enhance the number of fault samples in planetary gearboxes. He et al. integrated a WGAN with an attention mechanism to augment the model’s multi-fault identification capabilities [
21]. Nonetheless, the WGAN discriminator, still grounded in GANs, exhibits incomplete fault feature extraction in convolutional layers, leading to suboptimal model training accuracy. This study proposes the implementation of the R-FCN as a substitute for the conventional discriminator network and the adoption of a two-dimensional convolutional layer for fault feature extraction as a means of refining the WGAN.
As the number of network layers escalates, optimization of network hyperparameters becomes increasingly challenging. The prevailing optimization techniques primarily encompass grid search (GS) [
22], random search (RS) [
23], genetic algorithms (GAs) [
24], and Bayesian optimization (BO) [
25]. Nv et al. employed GS optimization for maximum correlation kurtosis deconvolution and successfully extracted feeble initial-stage bearing fault vibration signals [
26]. Chen et al. harnessed response surface (RS) optimization for long short-term memory (LSTM) models, accurately pinpointing various bearing fault locations and damage levels [
27]. Xu et al. utilized a quantum genetic algorithm (QGA) for the global optimization of support vector machines (SVMs) to accurately identify and classify bearing faults [
28]. Although the aforementioned methodologies can optimize networks to a certain degree, they remain inadequate in obtaining network hyperparameters for intricate problems with unknown objective functions [
29]. Conversely, the Bayesian optimization algorithm (BOA) necessitates only a limited number of objective function evaluations to achieve optimal outcomes, streamlining network optimization.
In conclusion, existing methods have limitations when dealing with the non-linear and non-stationary characteristics of bearing fault signals, leading to a decrease in diagnostic accuracy. Furthermore, many current methods rely on a single dataset for validation and lack extensive data support to fully demonstrate their effectiveness and robustness. Although the traditional generative adversarial network (GAN) can partially address the issue of small sample sizes, deficiencies in fault feature extraction still impact the training accuracy and diagnostic efficacy of the model. Given this situation, this study proposes an enhanced WGAN fault diagnosis method by replacing the conventional discriminator network with R-FCN and utilizing two-dimensional convolutional layers for improved fault feature extraction. Additionally, the Bayesian optimization algorithm (BOA) is employed to optimize the enhanced WGAN by obtaining optimal network hyperparameters, while a semi-supervised learning loss function is formulated to optimize both the generator and discriminator to perform bearing fault diagnosis.