Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                

Classification of Power Quality Disturbances Using Resnet with Channel Attention Mechanism

Su Pan, Xingyang Nie, Xiaoyu Zhai, Biao Wang, Huilin Ge, Cheng He and Zhenping Ding Xingyang Nie, Su Pan, Biao Wang, Huilin Ge, and Cheng He are with the Ocean College, Jiangsu University of Science And Technology, Zhenjiang 212003, China. E-mail: starsun87@126.com, 1242425221@qq.com, wangbiao@just.edu.cn, ghl1989@just.edu.cn, hecyz@qq.com. Xiaoyu Zhai is with Nanjing Research Institute of Electronic Equipment, China Aerospace Science and Industrial Corporation, Nanjing 210000, China. E-mail: 573911383@qq.com. Zhenping Ding, Nanjing University of Science and Technology, Zijin College, Nanjing 210023, China. E-mail:dzpseu@139.com(Corresponding author: Xingyang Nie.)
Abstract

The detection and classification of power quality disturbances (PQDs) carries significant importance for power systems. In response to this imperative, numerous intelligent diagnostic methods have been developed. However, existing identification methods usually concentrate on single-type signals or on complex signals with two types, rendering them susceptible to noisy labels and environmental effects. This study proposes a novel method for the classification of PQDs, termed ST-GSResNet, which utilizes the S-Transform and an improved residual neural network (ResNet) with a channel attention mechanism. The ST-GSResNet approach initially uses the S-Transform to transform a time-series signal into a 2D time-frequency image for feature enhancement. Then, an improved ResNet model is introduced, which employs grouped convolution instead of the traditional convolution operation. This improvement aims to facilitate learning with a block-diagonal structured sparsity on the channel dimension, the highly-correlated filters are learned in a more structured way in the networks with filter groups. By reducing the number of parameters in the network in this significant manner, the model becomes less prone to overfitting. Furthermore, the SE module concentrates on primary components, which enhances the model’s robustness in recognition and immunity to noise. Experimental results demonstrate that, compared to existing deep learning models, our approach has advantages in computational efficiency and classification accuracy.

Index Terms:
Power Quality Disturbances, Deep learning, Improved ResNet, S-Transform.

I Introduction

Rapid industrialization continues to lead to an increase in greenhouse gas emissions, further intensifying global warming. The urgent task at hand is to implement a comprehensive global dual-carbon strategy to address this challenge. This strategy entails achieving carbon peaking and carbon neutrality, promoting the development and utilization of new energy sources, and enhancing energy efficiency and conservation. However, the integration of new intermittent renewable energy sources, electric vehicles, and energy storage devices into the grid can deteriorate power quality [1]; This poses substantial economic challenges and safety risks are posed for both power users and grid companies [2]. Therefore, a crucial step in addressing power quality issues involves accurately identifying power quality disturbances (PQDs).

Power quality is measured by deviations from defined standards in electrical parameters, including voltage, current, and electromagnetic fields, within the power system. PQDs encompass fluctuations in these electrical parameters, primarily induced by the operation of nonlinear loads, switching devices, and momentary faults. Such disturbances can negatively impact the reliability, production efficiency, and lifespan of equipment within the power system, as well as the safety of humans and property. The prevalence of renewable energy sources and power electronics devices in electricity grids has given rise to a ”dual high” power system, which aims to enhance grid flexibility and cleanliness; however, it may have a detrimental effect on power quality by introducing high-frequency harmonics, spikes, and dips [3]. In complex grid environments, PQDs seldom occur in isolation. Instead, often combine fundamental PQDs, resulting in mixed power quality interference, also known as composite or mixed PQDs.

Additionally, these disturbances often involve transient components. Analyzing and identifying mixed PQDs poses a significant challenge due to the intricate interplay between the characteristics of time and frequency domains. Effective management of PQDs is crucial from a practical standpoint [4].

To address this issue, various signal processing techniques, such as the short-time Fourier transform (STFT) [8], wavelet transform (WT) [6], Hilbert-Huang transform (HHT) [7], empirical mode decomposition (EMD) [8], and variational mode decomposition (VMD) [9], have been employed for parameter detection and feature analysis. Among these techniques, the S-Transform (ST) [10] combines the strengths of STFT, WT, and Fourier transform (FT) [11], while overcoming certain limitations of the wavelet transform. Consequently, the ST transform has widespread application in identifying PQDs.In [12], a PQD classification scheme based on Fully Convolutional Networks (FCN) and Bidirectional Gated Recursive Units (BiGRU) is proposed. In [13], the author proposed a method based on deep learning and 2D wavelet scale images for classifying PQDs. The two aforementioned deep learning models employ distinct classification strategies: one directly utilizes a 1D sampling sequence, while the other transforms it into a 2D image using visualization techniques. Compared to the former, the latter strategy offers the advantage of potentially presenting a more comprehensive representation of the PQD signals’ characteristics, such as spectral and nonlinear features, thereby enhancing the model’s classification performance. Furthermore, the method of converting one-dimensional sequences into two-dimensional images provides superior interpretability, allowing for an intuitive understanding of the PQD signal characteristics through observation of the generated images.

The paper outlines several significant contributions: (i) A new method, ST-GSResNet, for the identification of power quality disturbance signals is proposed, which combines the S-Transform with an improved ResNet. This method demonstrates robustness to noise and maintains satisfactory performance even under conditions of noisy labels and environments. (ii) This paper introduces a novel approach that uses multi-resolution analysis with the S-Transform to encode one-dimensional time series signals into two-dimensional images. (iii) In this paper, we introduce group convolution to replace the original convolution, and incorporate the Squeeze and Excite (SE) module into the network, which significantly improves the model’s identification and generalization ability. The proposed method aims to enhance the time-frequency characteristics of Power Quality Disturbance signals and effectively employ Deep Residual Networks (ResNet) to learn advanced features.

The remainder of this paper is structured as follows. Section 2 describes the background. Section 3 describes the research methodology and development. Section 4 shows the results and analysis. Finally, Section 5 concludes this paper.

II RELATED WORK

This section introduces the theoretical topics that are essential for understanding the progress of this work. The discussion begins with the PQDs and extends to other topics pertinent to this research.

II-A S-Transform(ST)

The S-Transform [14], initially proposed by the American mathematician R.L. Bellman in the 1950s, is a mathematical method employed for signal analysis. It processes both continuous-time and discrete-time signals. The principle of the S-Transform involves converting a signal from the time domain (or spatial domain) to the frequency domain. In the frequency domain, signal characteristics are represented by amplitude and phase, facilitating a more thorough understanding and processing of the signal’s frequency components. The S-Transform can be considered a spectral analysis method that combines the characteristics of the short-time Fourier transform and wavelets. It achieves multi-resolution signal analysis effectively, obtaining the accurate phase of each frequency component. By utilizing a window with width inversely proportional to frequency, the S-Transform can provide high time resolution for high-frequency components and high-frequency resolution for low-frequency signal components. Since most complex power quality events are non-stationary, the S-Transform effectively extracts features by employing an adaptable transformation with a Gaussian window.

The formula for the S-Transform is as follows:

S(a,b)=[f(t)g(tb)e2πiat]𝑑t𝑆𝑎𝑏delimited-[]𝑓𝑡superscript𝑔𝑡𝑏superscript𝑒2𝜋𝑖𝑎𝑡differential-d𝑡S(a,b)=\int\left[f(t)g^{*}(t-b)e^{-2\pi iat}\right]dtitalic_S ( italic_a , italic_b ) = ∫ [ italic_f ( italic_t ) italic_g start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ( italic_t - italic_b ) italic_e start_POSTSUPERSCRIPT - 2 italic_π italic_i italic_a italic_t end_POSTSUPERSCRIPT ] italic_d italic_t (1)

The output obtained after applying the S-Transform is denoted as -S(a, b), where a and b are two parameters utilized in the S-Transform. -f(t) corresponds to the original signal. Moreover, -g(t) represents a Gaussian window with a width that inversely varies with frequency within the S-Transform. Lastly, -a represents the frequency parameter. The formula for the S-Transform illustrates its process of analyzing signals in the frequency domain. Adjusting the values of parameters a and b enables customization of the analysis window’s frequency and time resolution to better match the characteristics of the signal.

II-B ResNet

ResNet [15] is employed as a crucial variant of CNN for classifying images with a magnitude map, which describes voltage perturbation signals. Compared to traditional CNNs, ResNet incorporates residual connections to link the inputs and outputs of different layers, aiming to address the common issues known as gradient vanishing and Gradient explosion that can arise during training in many deep neural networks. In a Residual Network, the inputs and outputs of each stage are referred to as feature maps, generated by a series of convolutional layers, ReLU layers, and pooling layers. The output of each block represents a feature extraction of the input image, enabling the residual network to further increase its depth further and achieve superior performance.

Refer to caption
Figure 1: shows an illustration of a typical ResNet’s architecture.

Fig. 1 illustrates a typical ResNet architecture. The architecture successfully facilitates message delivery between multiple networks by introducing the residual structure. This enhances the stability of the training process and effectively mitigates the issues of gradient vanishing and Gradient explosion.

III Methodology

In practical applications, models for identifying power quality disturbance signals face various challenges. Signals acquired from real-world scenarios often encounter substantial noise interference, which substantially impairs the capacity of neural networks to learn features. Convolutional kernels and pooling kernels act as common local feature extractors in neural networks. However, signal noise can significantly impact the networks’ ability to learn features. Consequently, the performance of numerous existing methods for identifying power quality disturbance signals diminishes notably in such scenarios.

Refer to caption
Figure 2: The overall framework of ST-GSResNet.

To tackle the aforementioned challenges, this study presents a novel scheme for recognizing PQD signals, termed ST-GSResNet. As shown in Fig. 2, the proposed method consists of two key components: S-Transform for generating images and GSResNet for classifying images. The specific steps are outlined as follows:

  1. 1.

    The original power quality disturbance signal is transformed using S-Transform to generate its corresponding image representation..

  2. 2.

    To classify the generated images, the GSResNet is employed as the classifier.

  3. 3.

    The model is trained and optimized to enhance recognition accuracy and overall performance.

  4. 4.

    The practical application involves feeding the power quality disturbance signal into the model and employing ST-GSResNet for recognition.

  5. 5.

    Based on the identification results, suitable measures are implemented to ensure the stability and reliability of power quality.

III-A Generation Two-dimensional time-frequency spectrogram

The generation of data signals constitutes a foundational step in this research, as a substantial amount of data is crucial for obtaining robust training results in deep learning-based models. The MATLAB tool was employed in this study for data signal generation. This section is in adherence to IEEE standards [16], The frequency for all studied PQDs models studied is fixed at 50 Hz, and parameters are randomly generated within specified ranges. For a comprehensive overview of signal models and standard parameters pertaining to single PQD voltages, please consult Table I.

TABLE I: SIGNAL MODEL OF SINGLE POWER QUALITY DISTURBANCE
Labels PQD types Numerical Model Parameters
V1V1missing-subexpressionmissing-subexpression\begin{aligned} \text{V1}\\ &\\ &\end{aligned}start_ROW start_CELL V1 end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL end_CELL end_ROW HarmonicHarmonicmissing-subexpressionmissing-subexpression\begin{aligned} \text{Harmonic}\\ &\\ &\end{aligned}start_ROW start_CELL Harmonic end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL end_CELL end_ROW V(t)=sin(ωt)+α3sin(3ωt+φ3)+α5sin(5ωt+φ5)+α7sin(7ωt+φ7)missing-subexpression𝑉𝑡𝜔𝑡subscript𝛼33𝜔𝑡subscript𝜑3limit-fromsubscript𝛼55𝜔𝑡subscript𝜑5missing-subexpressionsubscript𝛼77𝜔𝑡subscript𝜑7\begin{aligned} &V(t)=\sin(\omega t)+\alpha_{3}\sin(3\omega t+\varphi_{3})+% \alpha_{5}\sin(5\omega t+\varphi_{5})+\\ &\alpha_{7}\sin(7\omega t+\varphi_{7})\\ &\end{aligned}start_ROW start_CELL end_CELL start_CELL italic_V ( italic_t ) = roman_sin ( italic_ω italic_t ) + italic_α start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT roman_sin ( 3 italic_ω italic_t + italic_φ start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ) + italic_α start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT roman_sin ( 5 italic_ω italic_t + italic_φ start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT ) + end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL italic_α start_POSTSUBSCRIPT 7 end_POSTSUBSCRIPT roman_sin ( 7 italic_ω italic_t + italic_φ start_POSTSUBSCRIPT 7 end_POSTSUBSCRIPT ) end_CELL end_ROW α3=00.15,α5=00.15,α7=00.15,φ3=02π,φ5=02π,φ7=02πmissing-subexpressionformulae-sequencesubscript𝛼30similar-to0.15subscript𝛼50similar-to0.15missing-subexpressionformulae-sequencesubscript𝛼70similar-to0.15subscript𝜑30similar-to2𝜋missing-subexpressionformulae-sequencesubscript𝜑50similar-to2𝜋subscript𝜑70similar-to2𝜋\begin{aligned} &\alpha_{3}=0\sim 0.15,\alpha_{5}=0\sim 0.15,\\ &\alpha_{7}=0\sim 0.15,\varphi_{3}=0\sim 2\pi,\\ &\varphi_{5}=0\sim 2\pi,\varphi_{7}=0\sim 2\pi\end{aligned}start_ROW start_CELL end_CELL start_CELL italic_α start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT = 0 ∼ 0.15 , italic_α start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT = 0 ∼ 0.15 , end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL italic_α start_POSTSUBSCRIPT 7 end_POSTSUBSCRIPT = 0 ∼ 0.15 , italic_φ start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT = 0 ∼ 2 italic_π , end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL italic_φ start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT = 0 ∼ 2 italic_π , italic_φ start_POSTSUBSCRIPT 7 end_POSTSUBSCRIPT = 0 ∼ 2 italic_π end_CELL end_ROW
V2 sag V(t)=(1α(u(tt1)u(tt2)))sin(ωt)𝑉𝑡1𝛼𝑢𝑡subscript𝑡1𝑢𝑡subscript𝑡2𝜔𝑡\begin{aligned} V(t)=\left(1-\alpha\left(u\left(t-t_{1}\right)-u\left(t-t_{2}% \right)\right)\right)\sin(\omega t)\end{aligned}start_ROW start_CELL italic_V ( italic_t ) = ( 1 - italic_α ( italic_u ( italic_t - italic_t start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) - italic_u ( italic_t - italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) ) ) roman_sin ( italic_ω italic_t ) end_CELL end_ROW α=0.10.9,t2t1=4T9Tformulae-sequence𝛼0.1similar-to0.9subscript𝑡2subscript𝑡14𝑇similar-to9𝑇\begin{aligned} \alpha=0.1\sim 0.9,\quad t_{2}-t_{1}=4T\sim 9T\end{aligned}start_ROW start_CELL italic_α = 0.1 ∼ 0.9 , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - italic_t start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = 4 italic_T ∼ 9 italic_T end_CELL end_ROW
V3 swell V(t)=(1α(u(tt1)u(tt2)))sin(ωt)𝑉𝑡1𝛼𝑢𝑡subscript𝑡1𝑢𝑡subscript𝑡2𝜔𝑡\begin{aligned} V(t)=\left(1-\alpha\left(u\left(t-t_{1}\right)-u\left(t-t_{2}% \right)\right)\right)\sin(\omega t)\end{aligned}start_ROW start_CELL italic_V ( italic_t ) = ( 1 - italic_α ( italic_u ( italic_t - italic_t start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) - italic_u ( italic_t - italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) ) ) roman_sin ( italic_ω italic_t ) end_CELL end_ROW α=0.10.9,t2t1=4T9Tformulae-sequence𝛼0.1similar-to0.9subscript𝑡2subscript𝑡14𝑇similar-to9𝑇\begin{aligned} \alpha=0.1\sim 0.9,\quad t_{2}-t_{1}=4T\sim 9T\end{aligned}start_ROW start_CELL italic_α = 0.1 ∼ 0.9 , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - italic_t start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = 4 italic_T ∼ 9 italic_T end_CELL end_ROW
V4 interrupt V(t)=(1α(u(tt1)u(tt2)))sin(ωt)𝑉𝑡1𝛼𝑢𝑡subscript𝑡1𝑢𝑡subscript𝑡2𝜔𝑡\begin{aligned} V(t)=\left(1-\alpha\left(u\left(t-t_{1}\right)-u\left(t-t_{2}% \right)\right)\right)\sin(\omega t)\end{aligned}start_ROW start_CELL italic_V ( italic_t ) = ( 1 - italic_α ( italic_u ( italic_t - italic_t start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) - italic_u ( italic_t - italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) ) ) roman_sin ( italic_ω italic_t ) end_CELL end_ROW α=0.90.1,t2t1=4T9Tformulae-sequence𝛼0.9similar-to0.1subscript𝑡2subscript𝑡14𝑇similar-to9𝑇\begin{aligned} \alpha=0.9\sim 0.1,\quad t_{2}-t_{1}=4T\sim 9T\end{aligned}start_ROW start_CELL italic_α = 0.9 ∼ 0.1 , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - italic_t start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = 4 italic_T ∼ 9 italic_T end_CELL end_ROW
V5 flicker V(t)=(1+αfsin(βωt))sin(ωt)𝑉𝑡1subscript𝛼f𝛽𝜔𝑡𝜔𝑡\begin{aligned} V(t)=\left(1+\alpha_{\mathrm{f}}\sin(\beta\omega t)\right)\sin% (\omega t)\end{aligned}start_ROW start_CELL italic_V ( italic_t ) = ( 1 + italic_α start_POSTSUBSCRIPT roman_f end_POSTSUBSCRIPT roman_sin ( italic_β italic_ω italic_t ) ) roman_sin ( italic_ω italic_t ) end_CELL end_ROW αf=0.30.5,β=0.10.4formulae-sequencesubscript𝛼𝑓0.3similar-to0.5𝛽0.1similar-to0.4\begin{aligned} \alpha_{f}=0.3\sim 0.5,\beta=0.1\sim 0.4\end{aligned}start_ROW start_CELL italic_α start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT = 0.3 ∼ 0.5 , italic_β = 0.1 ∼ 0.4 end_CELL end_ROW
V6V6missing-subexpressionmissing-subexpression\begin{aligned} \text{V6}\\ &\\ &\end{aligned}start_ROW start_CELL V6 end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL end_CELL end_ROW Oscillatory transient (OT)Oscillatory transient (OT)missing-subexpressionmissing-subexpression\begin{aligned} \text{Oscillatory transient (OT)}\\ &\\ &\end{aligned}start_ROW start_CELL Oscillatory transient (OT) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL end_CELL end_ROW V(t)=sin(ωt)+α2e(tt3)τsin{ωn(tt3)}{u(tt3)u(tt4)}missing-subexpression𝑉𝑡𝜔𝑡subscript𝛼2superscripte𝑡subscript𝑡3𝜏subscript𝜔𝑛𝑡subscript𝑡3missing-subexpressionabsent𝑢𝑡subscript𝑡3𝑢𝑡subscript𝑡4\begin{aligned} &V(t)=\sin(\omega t)+\alpha_{2}\mathrm{e}^{-\frac{\left(t-t_{3% }\right)}{\tau}}\sin\left\{\omega_{n}\left(t-t_{3}\right)\right\}\\ &\cdot\left\{u\left(t-t_{3}\right)-u\left(t-t_{4}\right)\right\}\\ &\end{aligned}start_ROW start_CELL end_CELL start_CELL italic_V ( italic_t ) = roman_sin ( italic_ω italic_t ) + italic_α start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT roman_e start_POSTSUPERSCRIPT - divide start_ARG ( italic_t - italic_t start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ) end_ARG start_ARG italic_τ end_ARG end_POSTSUPERSCRIPT roman_sin { italic_ω start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( italic_t - italic_t start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ) } end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL ⋅ { italic_u ( italic_t - italic_t start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ) - italic_u ( italic_t - italic_t start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT ) } end_CELL end_ROW α2=0.10.8,τ=0.0080.04,t4t3=0.05T3T,fn=300900Hzmissing-subexpressionformulae-sequencesubscript𝛼20.1similar-to0.8𝜏0.008similar-to0.04missing-subexpressionsubscript𝑡4subscript𝑡30.05𝑇similar-to3𝑇missing-subexpressionsubscript𝑓𝑛300similar-to900Hz\begin{aligned} &\alpha_{2}=0.1\sim 0.8,\quad\tau=0.008\sim 0.04,\\ &t_{4}-t_{3}=0.05T\sim 3T,\\ &f_{n}=300\sim 900\mathrm{~{}Hz}\end{aligned}start_ROW start_CELL end_CELL start_CELL italic_α start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = 0.1 ∼ 0.8 , italic_τ = 0.008 ∼ 0.04 , end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL italic_t start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT - italic_t start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT = 0.05 italic_T ∼ 3 italic_T , end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL italic_f start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = 300 ∼ 900 roman_Hz end_CELL end_ROW
V7V7\begin{aligned} \text{V7}\\ &\end{aligned}start_ROW start_CELL V7 end_CELL end_ROW Impulsive transient (IT)Impulsive transient (IT)\begin{aligned} \text{Impulsive transient (IT) }\\ &\end{aligned}start_ROW start_CELL Impulsive transient (IT) end_CELL end_ROW V(t)=sin(ωt)+α2e(tt3)τ{u(tt3)u(tt4)}missing-subexpression𝑉𝑡𝜔𝑡subscript𝛼2superscripte𝑡subscript𝑡3𝜏𝑢𝑡subscript𝑡3𝑢𝑡subscript𝑡4\begin{aligned} &V(t)=\sin(\omega t)+\alpha_{2}\mathrm{e}^{-\frac{\left(t-t_{3% }\right)}{\tau}}\left\{u\left(t-t_{3}\right)-u\left(t-t_{4}\right)\right\}\\ &\end{aligned}start_ROW start_CELL end_CELL start_CELL italic_V ( italic_t ) = roman_sin ( italic_ω italic_t ) + italic_α start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT roman_e start_POSTSUPERSCRIPT - divide start_ARG ( italic_t - italic_t start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ) end_ARG start_ARG italic_τ end_ARG end_POSTSUPERSCRIPT { italic_u ( italic_t - italic_t start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ) - italic_u ( italic_t - italic_t start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT ) } end_CELL end_ROW α2=110,τ=0.0080.04,t4t3=0.05T3Tmissing-subexpressionformulae-sequencesubscript𝛼21similar-to10𝜏0.008similar-to0.04missing-subexpressionsubscript𝑡4subscript𝑡30.05𝑇similar-to3𝑇\begin{aligned} &\alpha_{2}=1\sim 10,\tau=0.008\sim 0.04,\\ &t_{4}-t_{3}=0.05T\sim 3T\end{aligned}start_ROW start_CELL end_CELL start_CELL italic_α start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = 1 ∼ 10 , italic_τ = 0.008 ∼ 0.04 , end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL italic_t start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT - italic_t start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT = 0.05 italic_T ∼ 3 italic_T end_CELL end_ROW

Utilizing the previously mentioned signal model, a MATLAB simulation script enables the generation of a substantial quantity of individual disturbance samples.Fig. 3 illustrates the typical waveforms.

Refer to caption
Figure 3: Typical waves of 7 single PQDs.

This section selects prevalent PQDs along with their corresponding signal models and parameter explanations. These encompass 7 types of dual-disturbance power quality compound disturbances. The fundamental frequency for all the investigated models of PQ disturbances was set at 50 Hz, with parameters being randomly generated within the specified ranges. Table II delineates the signal models and standard parameters for the mixed PQDs.

TABLE II: ENVIRONMENT REQUIRED FOR THE EXPERIMENT
Labels PQD types
V8 harmonics+sag
V9 harmonics+swell
V10 interruption + harmonics
V11 Impulsive transient + sag
V12 Impulsive transient + swell
V13 Impulsive transient + flicker
V14 Impulsive transient + harmonics
V15 harmonics + Oscillatory transient + sag
V16 harmonics + Oscillatory transient + swell
V17 flicker+ Impulsive transient + harmonics
V18 harmonics + Oscillatory transient + Impulsive transient +sag

Utilizing the previously mentioned mixed disturbance signal models, a MATLAB script can generate a substantial number of mixed disturbance samples. Figure 5 illustrates the representative waveforms. The typical waveform is then transformed into a time-frequency plot, as shown in Fig. 4.

Refer to caption
Figure 4: Typical waves of 11 mixed PQDs.

After generating, validating, and collecting the data, the subsequent step is to create a scalogram that characterizes the voltage disturbances in the collected signals. Within the MATLAB environment, time-frequency conversion of the generated signal waveforms is performed using the S-Transform to produce their corresponding spectrograms.

By utilizing the coefficients generated by the S-Transform, MATLAB’s color mapping functions, such as the ”jet 264” style, can be employed to analyze these signals and observe the energy scales in the scalogram. This process involves applying the absolute values of the coefficients. Fig. 5 illustrates the time-frequency plot of the signal V10 generated by the S-Transform compared to the original waveform.

Refer to caption
Figure 5: The time-frequency plot of the signal V10, generated by the S-Transform, is compared to the original waveform.

After generating all the images containing only scalar plots, they are saved in specific folders corresponding to each PQ disturbance category. The saved images are formatted to be 240x240 pixels and saved in the PNG format using the ”imwrite” command and corresponding supporting code. To achieve this, the ”imwrite” command and corresponding supporting code are used to save a large number of images. The generated dataset consists of 18 categories, each containing 1000 images. Subsequently, these images are divided into training and testing sets in a 7:3 ratio for model evaluation.

III-B Improved ResNet

The improved ResNet model will be used to classify the time-frequency spectrograms of the 18 classes of power quality disturbance signals, which were previously generated by utilizing the S-Transform.As revealed in Fig. 6, the model diagram presents the overall architecture of our proposed model, encompassing the general design on the left, the specific structure of each stage in the middle, and the bottleneck structure on the right. In this context, C, H, W, and S denote the number of channels, height, width, and stride, respectively.

Refer to caption
Figure 6: illustrates the structure of the proposed GSResNet model.

1) Group convolution: The significance of convolutional neural networks in image processing is undeniable. Convolutional operations boast robust feature extraction capabilities and require fewer parameters compared to fully connected networks. Convolution inherently thrives when processing two-dimensional structured data, particularly images. The concept of grouped convolution was first presented in AlexNet[17] in 2012.Duringthattime, considering the memory and computational constraints of a single GPU, grouped convolution was implemented by dividing the feature map and convolution kernel into ’g’ groups along the channel direction. The results obtained from each convolution group are then concatenated to yield the final outcome.

Relative to the traditional convolution operation employed in the ResNet model, grouped convolution augments the network’s ability to model non-linearity and extract intricate features. Additionally, due to the parallelisability of convolution operations, grouped convolution bolsters the computational efficiency of the network. Consequently, this study substitutes the original convolution operation with grouped convolution, thereby further amplifying the capabilities of the original ResNet model.

The standard convolution operation is a fundamental neural network operation used to extract feature information from an input feature map. Illustrated in the figure, this operation involves element-wise multiplication of a set of learnable convolution kernels element-wise with the input feature map and summing the results to generate the output feature map. Grouped convolution involves dividing the input feature map into ”g” groups along the channel dimension and partitioning the convolution kernel into ”g” groups accordingly. Consequently, the size of the convolution kernel in each group is adjusted, resulting in the output feature map being divided into ”g” groups along the channel dimension as well. Fig. 7 show the comparison between ordinary convolution and grouped convolution.

Refer to caption
(a)
Refer to caption
(b)
Figure 7: The figure above illustrates a grouped convolution with two sets of filters. In each of the sets of filters depicted in panel b, every filter has a depth that is half that of the nominal 2D convolution shown in panel a. (a) Convolution. (b) Convolution with filter groups.

2) Squeeze-and-Excitation: The core of the SE module [18] is the squeeze-and-excitation block. The SE module employs a Multi-Layer Perceptron (MLP) to model each channel, thereby concurrently producing a weight vector that signifies channel attention. Specifically, the feature maps are converted into an intermediate vector through a fully connected layer, which is then followed by the introduction of nonlinearity via an element-wise ReLU activation function. Lastly, the intermediate vector is transformed into a scalar value through another fully connected layer, and the output is interpreted as a vector of channel weights through the application of a sigmoid function.

The complete SE module is composed of the combination of Squeeze and Excitation components. The insertion of the SE block into the network’s convolutional layers aids in the representation of feature correlations and differentiations, thereby leading to the automatic adjustment of channel significance. The integration of the SE block resulted in a significant enhancement in the model’s image classification performance. The SE module was designed based on a straightforward yet effective concept, allowing it to be integrated into various CNN architectures. Fig. 8 illustrates the architecture of the SE module when integrated into ResNet.

Refer to caption
Figure 8: The architecture of the SE module add to ResNet.

Fundamentally, the SE module plays a pivotal role in enhancing ResNet’s performance. The integration of the SE module allows networks to flexibly select and weight features, subsequently enhancing the model’s capacity for representation and classification. In the context of image classification, the SE module effectively steers the network towards the most pertinent features, thereby facilitating learning and accurately discerning complex cases. Moreover, due to its highly flexible and scalable attributes, the SE module can be seamlessly integrated into existing ResNet network architectures. The SE module can enhance network performance by incorporating a limited number of computational operations without introducing additional parameters. The incorporation of the SE module enables the achievement of improved model performance with a minimal increase in computational burden.

3) Nonlinearities: In the seminal work of [19], an innovative nonlinearity termed as h-swish was introduced, showcasing significant effectiveness when flawlessly integrated as a substitute for ReLU. This integration significantly enhances the precision of neural networks. The nonlinearity is explicitly defined as:

h-swish(x)=x×ReLU(x+36)h-swish𝑥𝑥ReLU𝑥36\text{h-swish}(x)=x\times\text{ReLU}\left(\frac{x+3}{6}\right)h-swish ( italic_x ) = italic_x × ReLU ( divide start_ARG italic_x + 3 end_ARG start_ARG 6 end_ARG ) (2)

In previous studies, a nonlinearity termed swish was presented, which, when utilized as a direct replacement for ReLU, offers numerous advantages. The nonlinearity is defined as:

Swish(x)=xσ(x)Swish𝑥𝑥𝜎𝑥\text{Swish}(x)=x\cdot\sigma(x)Swish ( italic_x ) = italic_x ⋅ italic_σ ( italic_x ) (3)
σ(x)=11+ex𝜎𝑥11superscript𝑒𝑥\sigma(x)=\frac{1}{1+e^{-x}}italic_σ ( italic_x ) = divide start_ARG 1 end_ARG start_ARG 1 + italic_e start_POSTSUPERSCRIPT - italic_x end_POSTSUPERSCRIPT end_ARG (4)

The benefits of using the sigmoid function in neural network layers include its unbounded nature, smoothness, and non-monotonic properties, which together enhance the network’s expressive power. However, its nonlinearity, while improving accuracy, significantly increases computational demands, particularly on mobile devices. To balance model accuracy and computational efficiency, we employ the h-swish activation function.In the swish function, the input value, x, is scaled by the sigmoid function and then multiplied by x. This setup allows the function to approximate linearity for large x values and display a strong non-linear behavior for small x values.In contrast, the h-swish function normalizes the input value, x, mapping it to the [0, 1] interval, before multiplying it by the original input, x. This approach results in a piecewise linear approximation, making the function simpler and more computationally efficient for both small and large x values.

IV Experimental

IV-A Experimental Setup

1) Experimental Environment and Training Strategy for Model: During training, we employ the Nadam optimization algorithm with a first-order momentum of 0.9, a second-order momentum of 0.999, and a weight decay parameter of 1e-7. We set the initial learning rate to 0.0001 utilizing a cosine annealing learning rate strategy. This allows the learning rate to fluctuate during training according to a cosine function, while maintaining an overall downward trend. The default batch size is set to 16, and training spans 100 epochs, taking approximately 2 to 3 hours.All experiments were conducted on the dataset we created. The detailed training configuration is shown in Table III.

TABLE III: ENVIRONMENT REQUIRED FOR THE EXPERIMENT
Laboratory Setting Configuration Information
CPU Intel Core i5-13490F
GPU Nvidia GeForce RTX 4060ti GPU 16G
CUDA 11.6
Running System Ubuntu 22.04
Programming Language Python 3.8
Deep Learning Framework PyTorch 1.13.1

2) Evaluation metrics: Evaluation metrics are crucial for assessing the effectiveness of diagnostic algorithms, rendering them essential in data analysis. In the context of intelligent fault diagnosis, the accuracy rate serves as a widely adopted evaluation metric. It measures the proportion of accurate predictions in the total sample and thus indicates the classifier’s performance. Higher accuracy rates typically indicate enhanced classifier performance. The definitions of indicators are provided in Eq. (5).

Accuracy=TP+TNTP+TN+FP+FNAccuracy𝑇𝑃𝑇𝑁𝑇𝑃𝑇𝑁𝐹𝑃𝐹𝑁\text{Accuracy}=\frac{TP+TN}{TP+TN+FP+FN}Accuracy = divide start_ARG italic_T italic_P + italic_T italic_N end_ARG start_ARG italic_T italic_P + italic_T italic_N + italic_F italic_P + italic_F italic_N end_ARG (5)

The terms TP, TN, FP, and FN correspond to true positives, true negatives, false positives, and false negatives, respectively, within the context of class i. In the context of our study, represents the output of the network model, while denotes the true label. Accuracy is an intuitive metric that represents the proportion of samples correctly predicted by a model, typically expressed as a percentage. This characteristic renders accuracy a highly intuitive evaluation metric that effectively communicates the model’s overall performance. While accuracy is an intuitive and essential metric, relying solely on it to evaluate the performance of an algorithmic model requires more scientific rigor and comprehensiveness. Therefore, this paper additionally employs a confusion matrix to comprehensively assess the classifier’s performance. The confusion matrix offers more information than a single accuracy measure and can depict the confusion between different categories, including the proportions of correctly and incorrectly classified cases. Utilizing the confusion matrix, we can compute the number of correct and incorrect judgments made by the model on the samples.

Refer to caption
(a)
Refer to caption
(b)
Refer to caption
(c)
Refer to caption
(d)
Figure 9: GSResNet Test Results: (a) Noiseless , (b) SNR = 40dB , (c) SNR = 30dB , (d) SNR = 20dB .
Refer to caption
(a)
Refer to caption
(b)
Refer to caption
(c)
Refer to caption
(d)
Refer to caption
(e)
Refer to caption
(f)
Refer to caption
(g)
Refer to caption
(h)
Figure 10: The top layer represents the visualization of the original time-frequency image via t-SNE, while the bottom layer depicts the t-SNE visualization post-model classification. The sequence from left to right corresponds to an SNR ranging from Noiseless to 40dB.

IV-B Robustness to noise

The performance of ST-GSResNet was evaluated across varying signal-to-noise ratios (SNR), namely at 20 dB, 30 dB, and 40 dB. The dataset comprised 18,000 samples, with each PQDs category comprising 1,000 samples. Seven hundred samples were employed for model training, while the remaining 300 samples were allocated for testing. Table IV illustrates that the classification accuracy decreases as signal-to-noise ratios (SNRs) decrease. At 20 dB, assigning categories C14 and C17 results in a classification accuracy drops to 88.3% and 85.3%, respectively. However, ST-GSesNet achieved an exceptional overall accuracy of 96.8%.

Additionally, as the signal-to-noise ratio increased to 30 dB, the accuracy rose to 98.6%. The results indicate that the CNN model incorporates advanced details despite utilizing only the most basic S-Transform in dataset generation. Our approach, ST-GSesNet, aims to enhance the model’s recognition ability and exhibits remarkable tolerance to environmental noise. This can be attributed to the application of grouped convolution and incorporating the SE module in the model design. Grouped convolution enhances the model’s expressive capability by introducing additional nonlinear transformations. Simultaneously, the SE module dynamically learns the significance of each channel, improving the model’s performance by prioritizing essential feature channels and reducing the impact mitigating the influence of irrelevant information.

Following this, the confusion matrices originating from Table 4 are presented in Fig. 9, which illustrate the accuracy of our proposed method, particularly against label noises. To highlight the robustness of the feature extraction capability inherent in ST-GSResNet, we visualize the feature distributions of the test samples via t-SNE, as depicted in Fig. 10. As can be seen from the top plot of the comparison in Fig. 10, the classification boundary becomes indistinguishable when time–frequency images, transformed by ST, are compressed to a 2D representation. However, as shown on the bottom side of Fig 10, after undergoing training through GSResNet, the features in the final layer become clearly defined. Even at an SNRdB level of 20, the boundaries between categories remain clearly distinguishable. This demonstrates ST-GSResNet’s capacity to extract features that significantly contribute to the excellent classification of power quality disturbance signals.

TABLE IV: THE CLASSIFICATION ACCURACIES OBTAINED UNDER DIFFERENT NOISY ENVIRONMENTS
Classes Signal-to-Noise Ratio
20dB 30dB 40dB No noise
V1 98.0% 96.7% 100% 100%
V2 95.0% 98.3% 100% 100%
V3 99.7% 100% 100% 100%
V4 99.3% 100% 100% 100%
V5 99.3% 100% 100% 100%
V6 99.3% 98.7% 97.7% 99.0%
V7 93.3% 99.7% 100% 100%
V8 100% 100% 100% 100%
V9 100% 100% 100% 100%
V10 97.3% 92.7% 91.3% 100%
V11 98.3% 100% 100% 100%
V12 99.7% 100% 100% 100%
V13 95.0% 99.7% 100% 100%
V14 88.3% 96.7% 99.7% 100%
V15 99.3% 99.3% 100% 100%
V16 93.3% 100% 100% 100%
V17 85.3% 96.3% 98.3% 99.0%
V18 94.0% 99.3% 99.7% 99.3%
Overall 96.5% 98.8% 99.3% 99.9%

IV-C Comparison with existing methods

TABLE V: COMPARISON WITH OTHER EXISTING METHODS
Method PQD No.s Features No.s Accuracy(%)
20dB 30dB 40dB
CNN from Scratch [20] 6 - 96.7 97.3 -
ST and PNN [21] 9 4 - 98.6 99.1
ST and NSGA-II [22] 15 26 96.4 97.3 99.4
DWT and PNN [23] 16 9 93.6 95.2 98.6
HHT+WBELM [24] 15 36 91.5 - 95.6
DBN+ELM [25] 21 12 95.8 98.2 98.7
SWT+EfficientNetB0 [26] 15 Auto 92.8 99.0 99.2
ST+ResNet50 18 Auto 90.4 94.3 96.7
ST+GSResNet 18 Auto 96.5 98.8 99.3

This section presents a comparative analysis of the proposed method compared with other schemes in the field of PQDs detection and classification. Table V summarizes the results of this comparative study, demonstrating that our proposed method surpasses other methods in both accuracy and the number of studied PQDs categories. Our proposed method utilizes the S-Transform (ST) for feature extraction and an enhanced ResNet-based technique for classification. A comparison of our proposed method with several recent studies reveals its superior accuracy and its ability to handle a greater number and complexity of PQDs categories. The method classifies signals with signal-to-noise ratios of 40 dB, 30 dB, and 20 dB, achieving accuracies of 99.3%, 98.8%, and 96.5%, respectively. Overall, our proposed PQD detection and classification method surpasses other methods in terms of accuracy.

Prior investigations resulted in fewer classes of PQDs with a lower detection accuracy compared to our proposed method. Utkarsh Singh et al. [21] introduced an S-Transform (ST) and NSGA-II-based Randomized Binary Decision Tree (RBDT) classifier to classify 15 classes of PQDs. However, at 30 dB and 20 dB noise conditions, the accuracies are 97.3% and 96.4%, respectively [22], which are lower than those achieved by our proposed method. In 2017, Khokhar, S. et al [23] . introduced a Discrete Wavelet Transform (DWT) and Probabilistic Neural Network-Artificial Bee Colony (PNN-ABC) based Randomized Binary Decision Tree (RBDT) classifier to classify 16 classes of PQDs. The accuracies were 98.6%, 95.2%, and 93.6% for 40 dB, 30 dB, and 20 dB noise, respectively [21]. Compared to these two schemes, our approach demonstrates higher accuracy for 40 dB and 20 dB noise, incorporating more categories and more complex signal models using the same S-Transform-generated time-frequency maps. Sahani and Dash introduced a classifier based on Hilbert-Huang Transform (HHT) and Wavelet-Based Extreme Learning Machine (WBELM) for categorization with 40 dB and 20 dB noise. The accuracies were 95.6% and 91.5%, respectively [24]. Swarnkar et al. designed a multivariate PQ interference identification algorithm using a mixture of S-Transform (ST), Hilbert Transform (HT), and Randomized Binary Decision Tree (RBDT). The accuracies were 98.7%, 98.2%, and 95.8% for 40 dB, 30 dB, and 20 dB noise, respectively [25]. These two schemes exhibit lower accuracy compared to our proposed method. Y. S. Upendra Vishwanath proposed a PQD classification scheme that combines synchrosqueezed wavelet transform (SWT) and EfficientNetB0 [26]. We selected the results of the dataset from their paper, which also incorporates White Noise, for comparison. From the comparative analysis, it was observed that although our scheme’s performance closely mirrors that of Vishwanath’s scheme in the presence of 30dB and 40dB of noise, our scheme’s performance improves by nearly 4 percentage points in the presence of 20dB noise. This indicates that our scheme exhibits a stronger resistance to noise.

Our proposed method outperforms all other methods, even when tested in a noisy environment and dealing with more complex PQDs classes. Overall, these results demonstrate the high effectiveness of our proposed method, surpassing other existing PQDs detection and classification methods.

V Conclusion

In this study, we introduce the ST-GSResNet scheme for PQDs recognition and classification, with a focus on evaluating the model’s performance. The experimental dataset comprises time-frequency images based on the S-Transform. Employing an improved ResNet model, we utilize grouped convolution to reduce the number of parameters, model complexity, and computational cost. Additionally, grouped convolution introduces additional nonlinear transformations, enriching feature representations to improve model performance and generalization.

Furthermore, we introduce the SE module innovatively to enhance learning and focus on crucial parts of the input data, thereby improving recognition robustness and noise resistance. Compared to alternative deep learning methods, our approach demonstrates advantages such as reduced training time, increased accuracy, and fewer parameters. Future efforts will focus on gathering PQD datasets from various domestic and international regions for comprehensive classification and recognition assessments, further validating the method’s practicality and effectiveness. Secondly, we employ knowledge distillation to reduce the model’s size further, enabling its operation on smaller embedded devices to meet portability requirements in practical settings.

With these enhancements and future investigations, the ST-GSResNet method will achieve more substantial progress in electrical energy and deep learning. We look forward to further exploring and applying the method to provide more reliable and efficient solutions for PQDs identification and classification in power systems.

References

  • [1] Y. Wang et al., “IEEE ICHQP2018 and development directions of power quality,” Electr. Power Autom. Equip., vol. 39, pp. 197–203, 2019.
  • [2] H. Lin, “Perfecting power quality indices and prospect,” Proceedings of the CSEE, vol. 34, no. 29, pp. 5073–5079, 2014.
  • [3] Y. Liu, T. Jin, M. A. Mohamed, Q. Wang, “A novel three-step classification approach based on time-dependent spectral features for complex power quality disturbances,” IEEE Trans. Instrum. Meas., vol. 70, pp. 1–14, 2021, doi:https://doi.org/10.1109/TIM.2021.3050187
  • [4] Y. Xu, S. Fan, S. Xie, M. Lu, “Power quality detection and classification in high permeability active distribution network based on IEWT-MFDE,” CSEE J. Power Energy Syst., vol. 8, no. 6, pp. 1–13, 2022, doi: https://doi.org/10.17775/CSEEJPES.2020.00110
  • [5] T. Zhong, S. Zhang, G. Cai, Y. Li, B. Yang, Y. Chen, “Power quality disturbance recognition based on multiresolution S-transform and decision tree,” IEEE Access, vol. 7, pp. 88380–88392, 2019, doi: https://doi.org/10.1109/ACCESS.2019.2924918
  • [6] S. Chen, Z. Li, G. Pan, F. Xu, “Power quality disturbance recognition using empirical wavelet transform and feature selection,” Electr. (Basel), vol. 11, no. 2, pp. 1–20, 2022, doi: https://doi.org/10.3390/electronics11020174
  • [7] P. Li, J. Gao, D. Xu, C. Wang, X. Yang, “Hilbert-Huang transform with adaptive waveform matching extension and its application in power quality disturbance detection for microgrid,” J. Modern Power Syst. Clean Energy, vol. 4, no. 1, pp. 19–27, 2016, doi: https://doi.org/10.1007/s40565-016-0188-5
  • [8] T. Zhong, S. Zhang, G. Cai, Y. Li, B. Yang, Y. Chen, “Power quality disturbance recognition based on multiresolution S-transform and decision tree,” IEEE Access, vol. 7, pp. 88380–88392, 2019, doi: https://doi.org/10.1109/ACCESS.2019.2924918
  • [9] C. Zhao, K. Li, Y. Li, L. Wang, Y. Luo, X. Xu, X. Ding, Q. Meng, “Novel method based on variational mode decomposition and a random discriminative projection extreme learning machine for multiple power quality disturbance recognition,” IEEE Trans. Ind. Inf., vol. 15, no. 5, pp. 2915–2926, 2019, doi: https://doi.org/10.1109/TII.2018.2871253
  • [10] J. Li, Y. Yang, H. Lin, Z. Teng, F. Zhang, Y. Xu, “A voltage sag detection method based on modified S transform with digital prolate Spheroidal window,” IEEE Trans. Power Delivery, vol. 36, no. 2, pp. 997–1006, 2021, doi: https://doi.org/10.1109/TPWRD.2020.2999693
  • [11] S. P. Larissa, V. A. Mateus, A. R. David, D. F. Danton, N. H. Belis´ario, R. M. S. Leandro, A. D. Carlos, “Compression method of power quality disturbances based on independent component analysis and fast Fourier transform,” Electr. Power Syst. Res., vol. 187, pp. 1–7, 2020, doi: https://doi.org/10.1016/j.epsr.2020.106428
  • [12] Y. Liu, D. Yuan, H. Fan, T. Jin and M. A. Mohamed, “A Multidimensional Feature-Driven Ensemble Model for Accurate Classification of Complex Power Quality Disturbance,” IEEE Transactions on Instrumentation and Measurement, vol. 72, pp. 1–13, 2023, Art no. 1501613, doi: https://doi.org/10.1109/TIM.2023.3265756
  • [13] H. I. Uckol, S. Ilhan, “Identification of corona discharges based on wavelet scalogram images with deep convolutional neural networks,” Electric Power Systems Research, vol. 224, 2023, doi: https://doi.org/10.1016/j.epsr.2023.109712
  • [14] R. G. Stockwell, “Localization of the complex spectrum: The S transform,” IEEE Transactions on Signal Processing, vol. 44, no. 4, pp. 998–1001, 1996.
  • [15] K. He, X. Zhang, S. Ren, and J. Sun, “Deep Residual Learning for Image Recognition,” in 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 2016, pp. 770–778, doi: https://doi.org/10.1109/CVPR.2016.90
  • [16] “IEEE Recommended Practice for Monitoring Electric Power Quality,” in IEEE Std 1159-2019 (Revision of IEEE Std 1159-2009), no. vol., pp. 1–98, Aug. 13, 2019, doi: https://doi.org/10.1109/IEEESTD.2019.8796486
  • [17] A. Krizhevsky, I. Sutskever, and G. E. Hinton, “Imagenet classification with deep convolutional neural networks,” in Advances in Neural Information Processing Systems 25, 2012.
  • [18] J. Hu, L. Shen, and G. Sun, “Squeeze-and-excitation networks,” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2018.
  • [19] B. Koonce and B. Koonce, “MobileNetV3,” in Convolutional Neural Networks with Swift for Tensorflow: Image Recognition and Dataset Categorization, pp. 125–144, 2021.
  • [20] R. S. Salles and P. F. Ribeiro, “The use of deep learning and 2-D wavelet scalograms for power quality disturbances classification,” Electric Power Systems Research, vol. 214, pp. 108834, 2023, doi: https://doi.org/10.1016/j.epsr.2022.108834
  • [21] H. H. Wang, P. Wang, and T. Liu, “Power Quality Disturbance Classification Using the S-transform and probabilistic neural network,” Energies, vol. 10, no. 1, pp. 1–19, 2017.
  • [22] U. Singh and S. N. Singh, “Optimal feature selection via nsga-ii for power quality disturbances classification,” IEEE Trans. Ind. Inf., vol. 14, no. 7, pp. 2994–3002, 2018.
  • [23] S. Khokhar, A. A. M. Zin, A. P. Memon, and A. S. Mokhtar, “A new optimal feature selection algorithm for classification of power quality disturbances using discrete wavelet transform and probabilistic neural network,” Measurement, vol. 95, pp. 246–259, 2017.
  • [24] M. Sahani and P. K. Dash, “Automatic power quality events recognition based on Hilbert Huang transform and weighted bidirectional extreme learning machine,” IEEE Trans. Ind. Informat., vol. 14, no. 9, pp. 3849–3858, Sep. 2018.
  • [25] N. K. Swarnkar, O. P. Mahela, and M. Lalwani, “Multivariable signal processing algorithm for identification of power quality disturbances,” Electr. Power Syst. Res., vol. 221, Aug. 2023, Art. no. 109480.
  • [26] Y. S. U. Vishwanath, S. Esakkirajan, B. Keerthiveena, and R. B. Pachori, “A Generalized Classification Framework for Power Quality Disturbances Based on Synchrosqueezed Wavelet Transform and Convolutional Neural Networks,” IEEE Transactions on Instrumentation and Measurement, vol. 72, pp. 1-13, 2023, Art. no. 2525313, doi: https://doi.org/10.1109/IEEESTD.2019.8796486.