Classification of Power Quality Disturbances Using Resnet with Channel Attention Mechanism

Su Pan, Xingyang Nie, Xiaoyu Zhai, Biao Wang, Huilin Ge, Cheng He and Zhenping Ding Xingyang Nie, Su Pan, Biao Wang, Huilin Ge, and Cheng He are with the Ocean College, Jiangsu University of Science And Technology, Zhenjiang 212003, China. E-mail: starsun87@126.com, 1242425221@qq.com, wangbiao@just.edu.cn, ghl1989@just.edu.cn, hecyz@qq.com. Xiaoyu Zhai is with Nanjing Research Institute of Electronic Equipment, China Aerospace Science and Industrial Corporation, Nanjing 210000, China. E-mail: 573911383@qq.com. Zhenping Ding, Nanjing University of Science and Technology, Zijin College, Nanjing 210023, China. E-mail:dzpseu@139.com(Corresponding author: Xingyang Nie.)

Abstract

The detection and classification of power quality disturbances (PQDs) carries significant importance for power systems. In response to this imperative, numerous intelligent diagnostic methods have been developed. However, existing identification methods usually concentrate on single-type signals or on complex signals with two types, rendering them susceptible to noisy labels and environmental effects. This study proposes a novel method for the classification of PQDs, termed ST-GSResNet, which utilizes the S-Transform and an improved residual neural network (ResNet) with a channel attention mechanism. The ST-GSResNet approach initially uses the S-Transform to transform a time-series signal into a 2D time-frequency image for feature enhancement. Then, an improved ResNet model is introduced, which employs grouped convolution instead of the traditional convolution operation. This improvement aims to facilitate learning with a block-diagonal structured sparsity on the channel dimension, the highly-correlated filters are learned in a more structured way in the networks with filter groups. By reducing the number of parameters in the network in this significant manner, the model becomes less prone to overfitting. Furthermore, the SE module concentrates on primary components, which enhances the model’s robustness in recognition and immunity to noise. Experimental results demonstrate that, compared to existing deep learning models, our approach has advantages in computational efficiency and classification accuracy.

Index Terms:

Power Quality Disturbances, Deep learning, Improved ResNet, S-Transform.

I Introduction

Rapid industrialization continues to lead to an increase in greenhouse gas emissions, further intensifying global warming. The urgent task at hand is to implement a comprehensive global dual-carbon strategy to address this challenge. This strategy entails achieving carbon peaking and carbon neutrality, promoting the development and utilization of new energy sources, and enhancing energy efficiency and conservation. However, the integration of new intermittent renewable energy sources, electric vehicles, and energy storage devices into the grid can deteriorate power quality [1]; This poses substantial economic challenges and safety risks are posed for both power users and grid companies [2]. Therefore, a crucial step in addressing power quality issues involves accurately identifying power quality disturbances (PQDs).

Power quality is measured by deviations from defined standards in electrical parameters, including voltage, current, and electromagnetic fields, within the power system. PQDs encompass fluctuations in these electrical parameters, primarily induced by the operation of nonlinear loads, switching devices, and momentary faults. Such disturbances can negatively impact the reliability, production efficiency, and lifespan of equipment within the power system, as well as the safety of humans and property. The prevalence of renewable energy sources and power electronics devices in electricity grids has given rise to a ”dual high” power system, which aims to enhance grid flexibility and cleanliness; however, it may have a detrimental effect on power quality by introducing high-frequency harmonics, spikes, and dips [3]. In complex grid environments, PQDs seldom occur in isolation. Instead, often combine fundamental PQDs, resulting in mixed power quality interference, also known as composite or mixed PQDs.

Additionally, these disturbances often involve transient components. Analyzing and identifying mixed PQDs poses a significant challenge due to the intricate interplay between the characteristics of time and frequency domains. Effective management of PQDs is crucial from a practical standpoint [4].

To address this issue, various signal processing techniques, such as the short-time Fourier transform (STFT) [8], wavelet transform (WT) [6], Hilbert-Huang transform (HHT) [7], empirical mode decomposition (EMD) [8], and variational mode decomposition (VMD) [9], have been employed for parameter detection and feature analysis. Among these techniques, the S-Transform (ST) [10] combines the strengths of STFT, WT, and Fourier transform (FT) [11], while overcoming certain limitations of the wavelet transform. Consequently, the ST transform has widespread application in identifying PQDs.In [12], a PQD classification scheme based on Fully Convolutional Networks (FCN) and Bidirectional Gated Recursive Units (BiGRU) is proposed. In [13], the author proposed a method based on deep learning and 2D wavelet scale images for classifying PQDs. The two aforementioned deep learning models employ distinct classification strategies: one directly utilizes a 1D sampling sequence, while the other transforms it into a 2D image using visualization techniques. Compared to the former, the latter strategy offers the advantage of potentially presenting a more comprehensive representation of the PQD signals’ characteristics, such as spectral and nonlinear features, thereby enhancing the model’s classification performance. Furthermore, the method of converting one-dimensional sequences into two-dimensional images provides superior interpretability, allowing for an intuitive understanding of the PQD signal characteristics through observation of the generated images.

The paper outlines several significant contributions: (i) A new method, ST-GSResNet, for the identification of power quality disturbance signals is proposed, which combines the S-Transform with an improved ResNet. This method demonstrates robustness to noise and maintains satisfactory performance even under conditions of noisy labels and environments. (ii) This paper introduces a novel approach that uses multi-resolution analysis with the S-Transform to encode one-dimensional time series signals into two-dimensional images. (iii) In this paper, we introduce group convolution to replace the original convolution, and incorporate the Squeeze and Excite (SE) module into the network, which significantly improves the model’s identification and generalization ability. The proposed method aims to enhance the time-frequency characteristics of Power Quality Disturbance signals and effectively employ Deep Residual Networks (ResNet) to learn advanced features.

The remainder of this paper is structured as follows. Section 2 describes the background. Section 3 describes the research methodology and development. Section 4 shows the results and analysis. Finally, Section 5 concludes this paper.

II RELATED WORK

This section introduces the theoretical topics that are essential for understanding the progress of this work. The discussion begins with the PQDs and extends to other topics pertinent to this research.

II-A S-Transform(ST)

The S-Transform [14], initially proposed by the American mathematician R.L. Bellman in the 1950s, is a mathematical method employed for signal analysis. It processes both continuous-time and discrete-time signals. The principle of the S-Transform involves converting a signal from the time domain (or spatial domain) to the frequency domain. In the frequency domain, signal characteristics are represented by amplitude and phase, facilitating a more thorough understanding and processing of the signal’s frequency components. The S-Transform can be considered a spectral analysis method that combines the characteristics of the short-time Fourier transform and wavelets. It achieves multi-resolution signal analysis effectively, obtaining the accurate phase of each frequency component. By utilizing a window with width inversely proportional to frequency, the S-Transform can provide high time resolution for high-frequency components and high-frequency resolution for low-frequency signal components. Since most complex power quality events are non-stationary, the S-Transform effectively extracts features by employing an adaptable transformation with a Gaussian window.

The formula for the S-Transform is as follows:

S(a,b)=\int\left[f(t)g^{*}(t-b)e^{-2\pi iat}\right]dt

(1)

The output obtained after applying the S-Transform is denoted as -S(a, b), where a and b are two parameters utilized in the S-Transform. -f(t) corresponds to the original signal. Moreover, -g(t) represents a Gaussian window with a width that inversely varies with frequency within the S-Transform. Lastly, -a represents the frequency parameter. The formula for the S-Transform illustrates its process of analyzing signals in the frequency domain. Adjusting the values of parameters a and b enables customization of the analysis window’s frequency and time resolution to better match the characteristics of the signal.

II-B ResNet

ResNet [15] is employed as a crucial variant of CNN for classifying images with a magnitude map, which describes voltage perturbation signals. Compared to traditional CNNs, ResNet incorporates residual connections to link the inputs and outputs of different layers, aiming to address the common issues known as gradient vanishing and Gradient explosion that can arise during training in many deep neural networks. In a Residual Network, the inputs and outputs of each stage are referred to as feature maps, generated by a series of convolutional layers, ReLU layers, and pooling layers. The output of each block represents a feature extraction of the input image, enabling the residual network to further increase its depth further and achieve superior performance.

Refer to caption — Figure 1: shows an illustration of a typical ResNet’s architecture.

Fig. 1 illustrates a typical ResNet architecture. The architecture successfully facilitates message delivery between multiple networks by introducing the residual structure. This enhances the stability of the training process and effectively mitigates the issues of gradient vanishing and Gradient explosion.

III Methodology

In practical applications, models for identifying power quality disturbance signals face various challenges. Signals acquired from real-world scenarios often encounter substantial noise interference, which substantially impairs the capacity of neural networks to learn features. Convolutional kernels and pooling kernels act as common local feature extractors in neural networks. However, signal noise can significantly impact the networks’ ability to learn features. Consequently, the performance of numerous existing methods for identifying power quality disturbance signals diminishes notably in such scenarios.

To tackle the aforementioned challenges, this study presents a novel scheme for recognizing PQD signals, termed ST-GSResNet. As shown in Fig. 2, the proposed method consists of two key components: S-Transform for generating images and GSResNet for classifying images. The specific steps are outlined as follows:

1.

The original power quality disturbance signal is transformed using S-Transform to generate its corresponding image representation..
2.

To classify the generated images, the GSResNet is employed as the classifier.
3.

The model is trained and optimized to enhance recognition accuracy and overall performance.
4.

The practical application involves feeding the power quality disturbance signal into the model and employing ST-GSResNet for recognition.
5.

Based on the identification results, suitable measures are implemented to ensure the stability and reliability of power quality.

III-A Generation Two-dimensional time-frequency spectrogram

The generation of data signals constitutes a foundational step in this research, as a substantial amount of data is crucial for obtaining robust training results in deep learning-based models. The MATLAB tool was employed in this study for data signal generation. This section is in adherence to IEEE standards [16], The frequency for all studied PQDs models studied is fixed at 50 Hz, and parameters are randomly generated within specified ranges. For a comprehensive overview of signal models and standard parameters pertaining to single PQD voltages, please consult Table I.

TABLE I: SIGNAL MODEL OF SINGLE POWER QUALITY DISTURBANCE

Labels	PQD types	Numerical Model	Parameters
$\begin{aligned} \text{V1}\\ &\\ &\end{aligned}$	$\begin{aligned} \text{Harmonic}\\ &\\ &\end{aligned}$	$\begin{aligned} &V(t)=\sin(\omega t)+\alpha_{3}\sin(3\omega t+\varphi_{3})+% \alpha_{5}\sin(5\omega t+\varphi_{5})+\\ &\alpha_{7}\sin(7\omega t+\varphi_{7})\\ &\end{aligned}$	$\begin{aligned} &\alpha_{3}=0\sim 0.15,\alpha_{5}=0\sim 0.15,\\ &\alpha_{7}=0\sim 0.15,\varphi_{3}=0\sim 2\pi,\\ &\varphi_{5}=0\sim 2\pi,\varphi_{7}=0\sim 2\pi\end{aligned}$
V2	sag	$\begin{aligned} V(t)=\left(1-\alpha\left(u\left(t-t_{1}\right)-u\left(t-t_{2}% \right)\right)\right)\sin(\omega t)\end{aligned}$	$\begin{aligned} \alpha=0.1\sim 0.9,\quad t_{2}-t_{1}=4T\sim 9T\end{aligned}$
V3	swell	$\begin{aligned} V(t)=\left(1-\alpha\left(u\left(t-t_{1}\right)-u\left(t-t_{2}% \right)\right)\right)\sin(\omega t)\end{aligned}$	$\begin{aligned} \alpha=0.1\sim 0.9,\quad t_{2}-t_{1}=4T\sim 9T\end{aligned}$
V4	interrupt	$\begin{aligned} V(t)=\left(1-\alpha\left(u\left(t-t_{1}\right)-u\left(t-t_{2}% \right)\right)\right)\sin(\omega t)\end{aligned}$	$\begin{aligned} \alpha=0.9\sim 0.1,\quad t_{2}-t_{1}=4T\sim 9T\end{aligned}$
V5	flicker	$\begin{aligned} V(t)=\left(1+\alpha_{\mathrm{f}}\sin(\beta\omega t)\right)\sin% (\omega t)\end{aligned}$	$\begin{aligned} \alpha_{f}=0.3\sim 0.5,\beta=0.1\sim 0.4\end{aligned}$
$\begin{aligned} \text{V6}\\ &\\ &\end{aligned}$	$\begin{aligned} \text{Oscillatory transient (OT)}\\ &\\ &\end{aligned}$	$\begin{aligned} &V(t)=\sin(\omega t)+\alpha_{2}\mathrm{e}^{-\frac{\left(t-t_{3% }\right)}{\tau}}\sin\left\{\omega_{n}\left(t-t_{3}\right)\right\}\\ &\cdot\left\{u\left(t-t_{3}\right)-u\left(t-t_{4}\right)\right\}\\ &\end{aligned}$	$\begin{aligned} &\alpha_{2}=0.1\sim 0.8,\quad\tau=0.008\sim 0.04,\\ &t_{4}-t_{3}=0.05T\sim 3T,\\ &f_{n}=300\sim 900\mathrm{~{}Hz}\end{aligned}$
$\begin{aligned} \text{V7}\\ &\end{aligned}$	$\begin{aligned} \text{Impulsive transient (IT) }\\ &\end{aligned}$	$\begin{aligned} &V(t)=\sin(\omega t)+\alpha_{2}\mathrm{e}^{-\frac{\left(t-t_{3% }\right)}{\tau}}\left\{u\left(t-t_{3}\right)-u\left(t-t_{4}\right)\right\}\\ &\end{aligned}$	$\begin{aligned} &\alpha_{2}=1\sim 10,\tau=0.008\sim 0.04,\\ &t_{4}-t_{3}=0.05T\sim 3T\end{aligned}$

Utilizing the previously mentioned signal model, a MATLAB simulation script enables the generation of a substantial quantity of individual disturbance samples.Fig. 3 illustrates the typical waveforms.

This section selects prevalent PQDs along with their corresponding signal models and parameter explanations. These encompass 7 types of dual-disturbance power quality compound disturbances. The fundamental frequency for all the investigated models of PQ disturbances was set at 50 Hz, with parameters being randomly generated within the specified ranges. Table II delineates the signal models and standard parameters for the mixed PQDs.

TABLE II: ENVIRONMENT REQUIRED FOR THE EXPERIMENT

Labels	PQD types
V8	harmonics+sag
V9	harmonics+swell
V10	interruption + harmonics
V11	Impulsive transient + sag
V12	Impulsive transient + swell
V13	Impulsive transient + flicker
V14	Impulsive transient + harmonics
V15	harmonics + Oscillatory transient + sag
V16	harmonics + Oscillatory transient + swell
V17	flicker+ Impulsive transient + harmonics
V18	harmonics + Oscillatory transient + Impulsive transient +sag

Utilizing the previously mentioned mixed disturbance signal models, a MATLAB script can generate a substantial number of mixed disturbance samples. Figure 5 illustrates the representative waveforms. The typical waveform is then transformed into a time-frequency plot, as shown in Fig. 4.

After generating, validating, and collecting the data, the subsequent step is to create a scalogram that characterizes the voltage disturbances in the collected signals. Within the MATLAB environment, time-frequency conversion of the generated signal waveforms is performed using the S-Transform to produce their corresponding spectrograms.

By utilizing the coefficients generated by the S-Transform, MATLAB’s color mapping functions, such as the ”jet 264” style, can be employed to analyze these signals and observe the energy scales in the scalogram. This process involves applying the absolute values of the coefficients. Fig. 5 illustrates the time-frequency plot of the signal V10 generated by the S-Transform compared to the original waveform.

After generating all the images containing only scalar plots, they are saved in specific folders corresponding to each PQ disturbance category. The saved images are formatted to be 240x240 pixels and saved in the PNG format using the ”imwrite” command and corresponding supporting code. To achieve this, the ”imwrite” command and corresponding supporting code are used to save a large number of images. The generated dataset consists of 18 categories, each containing 1000 images. Subsequently, these images are divided into training and testing sets in a 7:3 ratio for model evaluation.

III-B Improved ResNet

The improved ResNet model will be used to classify the time-frequency spectrograms of the 18 classes of power quality disturbance signals, which were previously generated by utilizing the S-Transform.As revealed in Fig. 6, the model diagram presents the overall architecture of our proposed model, encompassing the general design on the left, the specific structure of each stage in the middle, and the bottleneck structure on the right. In this context, C, H, W, and S denote the number of channels, height, width, and stride, respectively.

1) Group convolution: The significance of convolutional neural networks in image processing is undeniable. Convolutional operations boast robust feature extraction capabilities and require fewer parameters compared to fully connected networks. Convolution inherently thrives when processing two-dimensional structured data, particularly images. The concept of grouped convolution was first presented in AlexNet[17] in 2012.Duringthattime, considering the memory and computational constraints of a single GPU, grouped convolution was implemented by dividing the feature map and convolution kernel into ’g’ groups along the channel direction. The results obtained from each convolution group are then concatenated to yield the final outcome.

Relative to the traditional convolution operation employed in the ResNet model, grouped convolution augments the network’s ability to model non-linearity and extract intricate features. Additionally, due to the parallelisability of convolution operations, grouped convolution bolsters the computational efficiency of the network. Consequently, this study substitutes the original convolution operation with grouped convolution, thereby further amplifying the capabilities of the original ResNet model.

The standard convolution operation is a fundamental neural network operation used to extract feature information from an input feature map. Illustrated in the figure, this operation involves element-wise multiplication of a set of learnable convolution kernels element-wise with the input feature map and summing the results to generate the output feature map. Grouped convolution involves dividing the input feature map into ”g” groups along the channel dimension and partitioning the convolution kernel into ”g” groups accordingly. Consequently, the size of the convolution kernel in each group is adjusted, resulting in the output feature map being divided into ”g” groups along the channel dimension as well. Fig. 7 show the comparison between ordinary convolution and grouped convolution.

2) Squeeze-and-Excitation: The core of the SE module [18] is the squeeze-and-excitation block. The SE module employs a Multi-Layer Perceptron (MLP) to model each channel, thereby concurrently producing a weight vector that signifies channel attention. Specifically, the feature maps are converted into an intermediate vector through a fully connected layer, which is then followed by the introduction of nonlinearity via an element-wise ReLU activation function. Lastly, the intermediate vector is transformed into a scalar value through another fully connected layer, and the output is interpreted as a vector of channel weights through the application of a sigmoid function.

The complete SE module is composed of the combination of Squeeze and Excitation components. The insertion of the SE block into the network’s convolutional layers aids in the representation of feature correlations and differentiations, thereby leading to the automatic adjustment of channel significance. The integration of the SE block resulted in a significant enhancement in the model’s image classification performance. The SE module was designed based on a straightforward yet effective concept, allowing it to be integrated into various CNN architectures. Fig. 8 illustrates the architecture of the SE module when integrated into ResNet.

Fundamentally, the SE module plays a pivotal role in enhancing ResNet’s performance. The integration of the SE module allows networks to flexibly select and weight features, subsequently enhancing the model’s capacity for representation and classification. In the context of image classification, the SE module effectively steers the network towards the most pertinent features, thereby facilitating learning and accurately discerning complex cases. Moreover, due to its highly flexible and scalable attributes, the SE module can be seamlessly integrated into existing ResNet network architectures. The SE module can enhance network performance by incorporating a limited number of computational operations without introducing additional parameters. The incorporation of the SE module enables the achievement of improved model performance with a minimal increase in computational burden.

3) Nonlinearities: In the seminal work of [19], an innovative nonlinearity termed as h-swish was introduced, showcasing significant effectiveness when flawlessly integrated as a substitute for ReLU. This integration significantly enhances the precision of neural networks. The nonlinearity is explicitly defined as:

\text{h-swish}(x)=x\times\text{ReLU}\left(\frac{x+3}{6}\right)

(2)

In previous studies, a nonlinearity termed swish was presented, which, when utilized as a direct replacement for ReLU, offers numerous advantages. The nonlinearity is defined as:

\text{Swish}(x)=x\cdot\sigma(x)

(3)

\sigma(x)=\frac{1}{1+e^{-x}}

(4)

The benefits of using the sigmoid function in neural network layers include its unbounded nature, smoothness, and non-monotonic properties, which together enhance the network’s expressive power. However, its nonlinearity, while improving accuracy, significantly increases computational demands, particularly on mobile devices. To balance model accuracy and computational efficiency, we employ the h-swish activation function.In the swish function, the input value, x, is scaled by the sigmoid function and then multiplied by x. This setup allows the function to approximate linearity for large x values and display a strong non-linear behavior for small x values.In contrast, the h-swish function normalizes the input value, x, mapping it to the [0, 1] interval, before multiplying it by the original input, x. This approach results in a piecewise linear approximation, making the function simpler and more computationally efficient for both small and large x values.

IV Experimental

IV-A Experimental Setup

1) Experimental Environment and Training Strategy for Model: During training, we employ the Nadam optimization algorithm with a first-order momentum of 0.9, a second-order momentum of 0.999, and a weight decay parameter of 1e-7. We set the initial learning rate to 0.0001 utilizing a cosine annealing learning rate strategy. This allows the learning rate to fluctuate during training according to a cosine function, while maintaining an overall downward trend. The default batch size is set to 16, and training spans 100 epochs, taking approximately 2 to 3 hours.All experiments were conducted on the dataset we created. The detailed training configuration is shown in Table III.

TABLE III: ENVIRONMENT REQUIRED FOR THE EXPERIMENT

Laboratory Setting	Configuration Information
CPU	Intel Core i5-13490F
GPU	Nvidia GeForce RTX 4060ti GPU 16G
CUDA	11.6
Running System	Ubuntu 22.04
Programming Language	Python 3.8
Deep Learning Framework	PyTorch 1.13.1

2) Evaluation metrics: Evaluation metrics are crucial for assessing the effectiveness of diagnostic algorithms, rendering them essential in data analysis. In the context of intelligent fault diagnosis, the accuracy rate serves as a widely adopted evaluation metric. It measures the proportion of accurate predictions in the total sample and thus indicates the classifier’s performance. Higher accuracy rates typically indicate enhanced classifier performance. The definitions of indicators are provided in Eq. (5).

\text{Accuracy}=\frac{TP+TN}{TP+TN+FP+FN}

(5)

The terms TP, TN, FP, and FN correspond to true positives, true negatives, false positives, and false negatives, respectively, within the context of class i. In the context of our study, represents the output of the network model, while denotes the true label. Accuracy is an intuitive metric that represents the proportion of samples correctly predicted by a model, typically expressed as a percentage. This characteristic renders accuracy a highly intuitive evaluation metric that effectively communicates the model’s overall performance. While accuracy is an intuitive and essential metric, relying solely on it to evaluate the performance of an algorithmic model requires more scientific rigor and comprehensiveness. Therefore, this paper additionally employs a confusion matrix to comprehensively assess the classifier’s performance. The confusion matrix offers more information than a single accuracy measure and can depict the confusion between different categories, including the proportions of correctly and incorrectly classified cases. Utilizing the confusion matrix, we can compute the number of correct and incorrect judgments made by the model on the samples.

IV-B Robustness to noise

The performance of ST-GSResNet was evaluated across varying signal-to-noise ratios (SNR), namely at 20 dB, 30 dB, and 40 dB. The dataset comprised 18,000 samples, with each PQDs category comprising 1,000 samples. Seven hundred samples were employed for model training, while the remaining 300 samples were allocated for testing. Table IV illustrates that the classification accuracy decreases as signal-to-noise ratios (SNRs) decrease. At 20 dB, assigning categories C14 and C17 results in a classification accuracy drops to 88.3% and 85.3%, respectively. However, ST-GSesNet achieved an exceptional overall accuracy of 96.8%.

Additionally, as the signal-to-noise ratio increased to 30 dB, the accuracy rose to 98.6%. The results indicate that the CNN model incorporates advanced details despite utilizing only the most basic S-Transform in dataset generation. Our approach, ST-GSesNet, aims to enhance the model’s recognition ability and exhibits remarkable tolerance to environmental noise. This can be attributed to the application of grouped convolution and incorporating the SE module in the model design. Grouped convolution enhances the model’s expressive capability by introducing additional nonlinear transformations. Simultaneously, the SE module dynamically learns the significance of each channel, improving the model’s performance by prioritizing essential feature channels and reducing the impact mitigating the influence of irrelevant information.

Following this, the confusion matrices originating from Table 4 are presented in Fig. 9, which illustrate the accuracy of our proposed method, particularly against label noises. To highlight the robustness of the feature extraction capability inherent in ST-GSResNet, we visualize the feature distributions of the test samples via t-SNE, as depicted in Fig. 10. As can be seen from the top plot of the comparison in Fig. 10, the classification boundary becomes indistinguishable when time–frequency images, transformed by ST, are compressed to a 2D representation. However, as shown on the bottom side of Fig 10, after undergoing training through GSResNet, the features in the final layer become clearly defined. Even at an SNRdB level of 20, the boundaries between categories remain clearly distinguishable. This demonstrates ST-GSResNet’s capacity to extract features that significantly contribute to the excellent classification of power quality disturbance signals.

TABLE IV: THE CLASSIFICATION ACCURACIES OBTAINED UNDER DIFFERENT NOISY ENVIRONMENTS

Classes	Signal-to-Noise Ratio
	20dB	30dB	40dB	No noise
V1	98.0%	96.7%	100%	100%
V2	95.0%	98.3%	100%	100%
V3	99.7%	100%	100%	100%
V4	99.3%	100%	100%	100%
V5	99.3%	100%	100%	100%
V6	99.3%	98.7%	97.7%	99.0%
V7	93.3%	99.7%	100%	100%
V8	100%	100%	100%	100%
V9	100%	100%	100%	100%
V10	97.3%	92.7%	91.3%	100%
V11	98.3%	100%	100%	100%
V12	99.7%	100%	100%	100%
V13	95.0%	99.7%	100%	100%
V14	88.3%	96.7%	99.7%	100%
V15	99.3%	99.3%	100%	100%
V16	93.3%	100%	100%	100%
V17	85.3%	96.3%	98.3%	99.0%
V18	94.0%	99.3%	99.7%	99.3%
Overall	96.5%	98.8%	99.3%	99.9%

IV-C Comparison with existing methods

TABLE V: COMPARISON WITH OTHER EXISTING METHODS

Method	PQD No.s	Features No.s	Accuracy(%)
			20dB	30dB	40dB
CNN from Scratch [20]	6	-	96.7	97.3	-
ST and PNN [21]	9	4	-	98.6	99.1
ST and NSGA-II [22]	15	26	96.4	97.3	99.4
DWT and PNN [23]	16	9	93.6	95.2	98.6
HHT+WBELM [24]	15	36	91.5	-	95.6
DBN+ELM [25]	21	12	95.8	98.2	98.7
SWT+EfficientNetB0 [26]	15	Auto	92.8	99.0	99.2
ST+ResNet50	18	Auto	90.4	94.3	96.7
ST+GSResNet	18	Auto	96.5	98.8	99.3

This section presents a comparative analysis of the proposed method compared with other schemes in the field of PQDs detection and classification. Table V summarizes the results of this comparative study, demonstrating that our proposed method surpasses other methods in both accuracy and the number of studied PQDs categories. Our proposed method utilizes the S-Transform (ST) for feature extraction and an enhanced ResNet-based technique for classification. A comparison of our proposed method with several recent studies reveals its superior accuracy and its ability to handle a greater number and complexity of PQDs categories. The method classifies signals with signal-to-noise ratios of 40 dB, 30 dB, and 20 dB, achieving accuracies of 99.3%, 98.8%, and 96.5%, respectively. Overall, our proposed PQD detection and classification method surpasses other methods in terms of accuracy.

Prior investigations resulted in fewer classes of PQDs with a lower detection accuracy compared to our proposed method. Utkarsh Singh et al. [21] introduced an S-Transform (ST) and NSGA-II-based Randomized Binary Decision Tree (RBDT) classifier to classify 15 classes of PQDs. However, at 30 dB and 20 dB noise conditions, the accuracies are 97.3% and 96.4%, respectively [22], which are lower than those achieved by our proposed method. In 2017, Khokhar, S. et al [23] . introduced a Discrete Wavelet Transform (DWT) and Probabilistic Neural Network-Artificial Bee Colony (PNN-ABC) based Randomized Binary Decision Tree (RBDT) classifier to classify 16 classes of PQDs. The accuracies were 98.6%, 95.2%, and 93.6% for 40 dB, 30 dB, and 20 dB noise, respectively [21]. Compared to these two schemes, our approach demonstrates higher accuracy for 40 dB and 20 dB noise, incorporating more categories and more complex signal models using the same S-Transform-generated time-frequency maps. Sahani and Dash introduced a classifier based on Hilbert-Huang Transform (HHT) and Wavelet-Based Extreme Learning Machine (WBELM) for categorization with 40 dB and 20 dB noise. The accuracies were 95.6% and 91.5%, respectively [24]. Swarnkar et al. designed a multivariate PQ interference identification algorithm using a mixture of S-Transform (ST), Hilbert Transform (HT), and Randomized Binary Decision Tree (RBDT). The accuracies were 98.7%, 98.2%, and 95.8% for 40 dB, 30 dB, and 20 dB noise, respectively [25]. These two schemes exhibit lower accuracy compared to our proposed method. Y. S. Upendra Vishwanath proposed a PQD classification scheme that combines synchrosqueezed wavelet transform (SWT) and EfficientNetB0 [26]. We selected the results of the dataset from their paper, which also incorporates White Noise, for comparison. From the comparative analysis, it was observed that although our scheme’s performance closely mirrors that of Vishwanath’s scheme in the presence of 30dB and 40dB of noise, our scheme’s performance improves by nearly 4 percentage points in the presence of 20dB noise. This indicates that our scheme exhibits a stronger resistance to noise.

Our proposed method outperforms all other methods, even when tested in a noisy environment and dealing with more complex PQDs classes. Overall, these results demonstrate the high effectiveness of our proposed method, surpassing other existing PQDs detection and classification methods.

V Conclusion

In this study, we introduce the ST-GSResNet scheme for PQDs recognition and classification, with a focus on evaluating the model’s performance. The experimental dataset comprises time-frequency images based on the S-Transform. Employing an improved ResNet model, we utilize grouped convolution to reduce the number of parameters, model complexity, and computational cost. Additionally, grouped convolution introduces additional nonlinear transformations, enriching feature representations to improve model performance and generalization.

Furthermore, we introduce the SE module innovatively to enhance learning and focus on crucial parts of the input data, thereby improving recognition robustness and noise resistance. Compared to alternative deep learning methods, our approach demonstrates advantages such as reduced training time, increased accuracy, and fewer parameters. Future efforts will focus on gathering PQD datasets from various domestic and international regions for comprehensive classification and recognition assessments, further validating the method’s practicality and effectiveness. Secondly, we employ knowledge distillation to reduce the model’s size further, enabling its operation on smaller embedded devices to meet portability requirements in practical settings.

With these enhancements and future investigations, the ST-GSResNet method will achieve more substantial progress in electrical energy and deep learning. We look forward to further exploring and applying the method to provide more reliable and efficient solutions for PQDs identification and classification in power systems.

References

[1] Y. Wang et al., “IEEE ICHQP2018 and development directions of power quality,” Electr. Power Autom. Equip., vol. 39, pp. 197–203, 2019.
[2] H. Lin, “Perfecting power quality indices and prospect,” Proceedings of the CSEE, vol. 34, no. 29, pp. 5073–5079, 2014.
[3] Y. Liu, T. Jin, M. A. Mohamed, Q. Wang, “A novel three-step classification approach based on time-dependent spectral features for complex power quality disturbances,” IEEE Trans. Instrum. Meas., vol. 70, pp. 1–14, 2021, doi:https://doi.org/10.1109/TIM.2021.3050187
[4] Y. Xu, S. Fan, S. Xie, M. Lu, “Power quality detection and classification in high permeability active distribution network based on IEWT-MFDE,” CSEE J. Power Energy Syst., vol. 8, no. 6, pp. 1–13, 2022, doi: https://doi.org/10.17775/CSEEJPES.2020.00110
[5] T. Zhong, S. Zhang, G. Cai, Y. Li, B. Yang, Y. Chen, “Power quality disturbance recognition based on multiresolution S-transform and decision tree,” IEEE Access, vol. 7, pp. 88380–88392, 2019, doi: https://doi.org/10.1109/ACCESS.2019.2924918
[6] S. Chen, Z. Li, G. Pan, F. Xu, “Power quality disturbance recognition using empirical wavelet transform and feature selection,” Electr. (Basel), vol. 11, no. 2, pp. 1–20, 2022, doi: https://doi.org/10.3390/electronics11020174
[7] P. Li, J. Gao, D. Xu, C. Wang, X. Yang, “Hilbert-Huang transform with adaptive waveform matching extension and its application in power quality disturbance detection for microgrid,” J. Modern Power Syst. Clean Energy, vol. 4, no. 1, pp. 19–27, 2016, doi: https://doi.org/10.1007/s40565-016-0188-5
[8] T. Zhong, S. Zhang, G. Cai, Y. Li, B. Yang, Y. Chen, “Power quality disturbance recognition based on multiresolution S-transform and decision tree,” IEEE Access, vol. 7, pp. 88380–88392, 2019, doi: https://doi.org/10.1109/ACCESS.2019.2924918
[9] C. Zhao, K. Li, Y. Li, L. Wang, Y. Luo, X. Xu, X. Ding, Q. Meng, “Novel method based on variational mode decomposition and a random discriminative projection extreme learning machine for multiple power quality disturbance recognition,” IEEE Trans. Ind. Inf., vol. 15, no. 5, pp. 2915–2926, 2019, doi: https://doi.org/10.1109/TII.2018.2871253
[10] J. Li, Y. Yang, H. Lin, Z. Teng, F. Zhang, Y. Xu, “A voltage sag detection method based on modified S transform with digital prolate Spheroidal window,” IEEE Trans. Power Delivery, vol. 36, no. 2, pp. 997–1006, 2021, doi: https://doi.org/10.1109/TPWRD.2020.2999693
[11] S. P. Larissa, V. A. Mateus, A. R. David, D. F. Danton, N. H. Belis´ario, R. M. S. Leandro, A. D. Carlos, “Compression method of power quality disturbances based on independent component analysis and fast Fourier transform,” Electr. Power Syst. Res., vol. 187, pp. 1–7, 2020, doi: https://doi.org/10.1016/j.epsr.2020.106428
[12] Y. Liu, D. Yuan, H. Fan, T. Jin and M. A. Mohamed, “A Multidimensional Feature-Driven Ensemble Model for Accurate Classification of Complex Power Quality Disturbance,” IEEE Transactions on Instrumentation and Measurement, vol. 72, pp. 1–13, 2023, Art no. 1501613, doi: https://doi.org/10.1109/TIM.2023.3265756
[13] H. I. Uckol, S. Ilhan, “Identification of corona discharges based on wavelet scalogram images with deep convolutional neural networks,” Electric Power Systems Research, vol. 224, 2023, doi: https://doi.org/10.1016/j.epsr.2023.109712
[14] R. G. Stockwell, “Localization of the complex spectrum: The S transform,” IEEE Transactions on Signal Processing, vol. 44, no. 4, pp. 998–1001, 1996.
[15] K. He, X. Zhang, S. Ren, and J. Sun, “Deep Residual Learning for Image Recognition,” in 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 2016, pp. 770–778, doi: https://doi.org/10.1109/CVPR.2016.90
[16] “IEEE Recommended Practice for Monitoring Electric Power Quality,” in IEEE Std 1159-2019 (Revision of IEEE Std 1159-2009), no. vol., pp. 1–98, Aug. 13, 2019, doi: https://doi.org/10.1109/IEEESTD.2019.8796486
[17] A. Krizhevsky, I. Sutskever, and G. E. Hinton, “Imagenet classification with deep convolutional neural networks,” in Advances in Neural Information Processing Systems 25, 2012.
[18] J. Hu, L. Shen, and G. Sun, “Squeeze-and-excitation networks,” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2018.
[19] B. Koonce and B. Koonce, “MobileNetV3,” in Convolutional Neural Networks with Swift for Tensorflow: Image Recognition and Dataset Categorization, pp. 125–144, 2021.
[20] R. S. Salles and P. F. Ribeiro, “The use of deep learning and 2-D wavelet scalograms for power quality disturbances classification,” Electric Power Systems Research, vol. 214, pp. 108834, 2023, doi: https://doi.org/10.1016/j.epsr.2022.108834
[21] H. H. Wang, P. Wang, and T. Liu, “Power Quality Disturbance Classification Using the S-transform and probabilistic neural network,” Energies, vol. 10, no. 1, pp. 1–19, 2017.
[22] U. Singh and S. N. Singh, “Optimal feature selection via nsga-ii for power quality disturbances classification,” IEEE Trans. Ind. Inf., vol. 14, no. 7, pp. 2994–3002, 2018.
[23] S. Khokhar, A. A. M. Zin, A. P. Memon, and A. S. Mokhtar, “A new optimal feature selection algorithm for classification of power quality disturbances using discrete wavelet transform and probabilistic neural network,” Measurement, vol. 95, pp. 246–259, 2017.
[24] M. Sahani and P. K. Dash, “Automatic power quality events recognition based on Hilbert Huang transform and weighted bidirectional extreme learning machine,” IEEE Trans. Ind. Informat., vol. 14, no. 9, pp. 3849–3858, Sep. 2018.
[25] N. K. Swarnkar, O. P. Mahela, and M. Lalwani, “Multivariable signal processing algorithm for identification of power quality disturbances,” Electr. Power Syst. Res., vol. 221, Aug. 2023, Art. no. 109480.
[26] Y. S. U. Vishwanath, S. Esakkirajan, B. Keerthiveena, and R. B. Pachori, “A Generalized Classification Framework for Power Quality Disturbances Based on Synchrosqueezed Wavelet Transform and Convolutional Neural Networks,” IEEE Transactions on Instrumentation and Measurement, vol. 72, pp. 1-13, 2023, Art. no. 2525313, doi: https://doi.org/10.1109/IEEESTD.2019.8796486.