New Insights into Gas-in-Oil-Based Fault Diagnosis of Power Transformers
Abstract
:1. Introduction
- We introduce two simple, dedicated evaluation metrics that, combined, unveil the actual performance of DGA-based learning algorithms on imbalanced datasets;
- We show that the MG-score turns out to be a good proxy for the proposed dedicated metrics;
- We propose a new set of features based on the classical non-code ratios, outperforming this one;
- We run and analyze a series of empirical experiments to provide clear guidance on the choice of learning models, feature sets, and oversampling techniques for DGA-based fault diagnosis with imbalanced datasets.
2. Related Works
3. Dataset
4. Learning Framework
5. Feature Engineering
- Gas concentrations: This set of features presents the gas concentrations as an input to ML-based DGA. This is a naive choice as it does not require any kind of preprocessing. These features represent an initial scenario in this paper. Hence, the features consist solely of the concentrations of the seven gases in ppm: , , , , , , and .
- Logarithm of gas concentrations: Gas concentrations can vary over a wide range of values. Some samples present ppm values around unity, while others are in the vicinity of thousands of ppm. These variations tend to affect the convergence of some learning models. To reduce this issue, researchers apply the logarithmic transformation to gas concentrations [14]. This procedure results in a set with seven features ( to ) as , .
- Proposed set of features: A redesign of the feature set is proposed based on the nine ratios of the non-code ratios (, ⋯, ). This redesign properly uses the numerators and denominators of these ratios to encompass all the information of the non-code features. In addition, the logarithm is applied to each equation, resulting in a set of eight features (, , , , , , , and ). It is worth noting that applying the logarithmic function reduces the range of values, which is especially beneficial when the data span several orders of magnitude. This characteristic enhances the separability between classes.
6. Performance Metrics
7. Experimental Results and Discussions
7.1. Gas Concentrations
7.2. Logarithm of Gas Concentrations
7.3. Non-Code Ratios
7.4. Logarithm of Non-Code Ratios
7.5. General Analysis
8. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
Abbreviations
AI | Artificial Intelligence |
ADASYN | Adaptive Synthetic |
ANN | Artificial Neural Network |
ASMOTE | Adaptive Synthetic Minority Oversampling Technique |
AUC | Area Under The Curve |
BPNN | Backpropagation Neural Network |
bi-MOPSO | binary Multi-Objective Particle Swarm Optimization |
CART | Classification and Regression Trees |
CNN | Convolutional Neural Network |
CURE-SMOTE | Clustering Using SMOTE Representatives |
DT | Decision Tree |
DS | Dempster–Shafer |
DGA | Dissolved Gas Analysis |
ELM-RBF | Extreme Learning Machine–Radial Basis Function |
FSVM | Fuzzy Support Vector Machine |
FCM-FSVM | Fuzzy c-means Clustering-based FSVM |
KFCM-FSVM | Kernel Fuzzy c-means Clustering-based FCM-FSVM |
GA-ANN | Genetic Algorithm and Artificial Neural Network |
GS | Grid Search |
GWO | Grey Wolf Optimization |
k-NN | k-Nearest Neighbors |
LDA | Linear Discriminant Analys |
LSTM | Long Short-Term Memory |
ML | Machine Learning |
MCC | Mattheus Correlation Coefficient |
MWSMOTE | Majority Weighted Minority Oversampling Technique |
NB | Naive Bayes |
NN | Neural Network |
OPF | Optimum-Path Forest |
OPF-US | Optimum-Path Forest-based approach for Undersampling |
RBF | Radial Basis Function |
RF | Random Forest |
SEP | Self-Paced Ensemble |
SOMO | Self-Organizing Map Oversampling |
SMOTE | Synthetic Minority Oversampling Technique |
SVM | Support Vector Machine |
References
- Boczar, T.; Borucki, S.; Zmarzly, D. Application possibilities of artificial neural networks for recognizing partial discharges measured by the acoustic emission method. IEEE Trans. Dielectr. Electr. Insul. 2009, 16, 214–223. [Google Scholar] [CrossRef]
- Hong, K.; Huang, H.; Fu, Y.; Zhou, J. A vibration measurement system for health monitoring of power transformers. Measurement 2016, 93, 135–147. [Google Scholar] [CrossRef]
- Ullah, I.; Yang, F.; Khan, R.; Liu, L.; Yang, H.; Gao, B.; Sun, K. Predictive maintenance of power substation equipment by infrared thermography using a machine-learning approach. Energies 2017, 10, 1987. [Google Scholar] [CrossRef]
- Ganyun, L.V.; Haozhong, C.; Haibao, Z.; Lixin, D. Fault diagnosis of power transformer based on multi-layer SVM classifier. Electr. Power Syst. Res. 2005, 74, 1–7. [Google Scholar] [CrossRef]
- Golarz, J. Understanding dissolved gas analysis (DGA) techniques and interpretations. In Proceedings of the 2016 IEEE/PES Transmission and Distribution Conference and Exposition (T&D), Dallas, TX, USA, 3–5 May 2016; IEEE: New York, NY, USA, 2016; pp. 1–5. [Google Scholar]
- Tra, V.; Duong, B.; Kim, J. Improving diagnostic performance of a power transformer using an adaptive over-sampling method for imbalanced data. IEEE Trans. Dielectr. Electr. Insul. 2019, 26, 1325–1333. [Google Scholar] [CrossRef]
- Zhang, Y.; Chen, H.C.; Du, Y.; Chen, M.; Liang, J.; Li, J.; Fan, X.; Yao, X. Power transformer fault diagnosis considering data imbalance and data set fusion. High Volt. 2021, 6, 543–554. [Google Scholar] [CrossRef]
- Dornenburg, E.; Strittmatter, W. Monitoring oil-cooled transformers by gas-analysis. Brown Boveri Rev. 1974, 61, 238–247. [Google Scholar]
- Rogers, R.R. IEEE and IEC codes to interpret incipient faults in transformers, using gas in oil analysis. IEEE Trans. Electr. Insul. 1978, EI-13, 349–354. [Google Scholar] [CrossRef]
- Duval, M.; Depalba, A. Interpretation of gas-in-oil analysis using new IEC publication 60599 and IEC TC 10 databases. IEEE Electr. Insul. Mag. 2001, 17, 31–41. [Google Scholar] [CrossRef]
- Kim, S.W.; Kim, S.J.; Seo, H.D.; Jung, J.R.; Yang, H.J.; Duval, M. New methods of DGA diagnosis using IEC TC 10 and related databases Part 1: Application of gas-ratio combinations. IEEE Trans. Dielectr. Electr. Insul. 2013, 20, 685–690. [Google Scholar]
- Dai, J.; Song, H.; Jiang, X. Dissolved gas analysis of insulating oil for power transformer fault diagnosis with deep belief network. IEEE Trans. Dielectr. Electr. Insul. 2017, 24, 2828–2835. [Google Scholar] [CrossRef]
- Irungu, G.K.; Akumu, A.O.; Munda, J.L. Fault diagnostics in oil filled electrical equipment: Review of duval triangle and possibility of alternatives. In Proceedings of the 2016 IEEE Electrical Insulation Conference (EIC), Montreal, QC, Canada, 19–22 June 2016; pp. 174–177. [Google Scholar]
- Mirowski, P.; Lecun, Y. Statistical machine learning and dissolved gas analysis: A review. IEEE Trans. Power Deliv. 2012, 27, 1791–1799. [Google Scholar] [CrossRef]
- Senoussaoui, M.E.A.; Brahami, M.; Fofana, I. Combining and comparing various machine-learning algorithms to improve dissolved gas analysis interpretation. IET Gener. Transm. Distrib. 2018, 12, 3673–3679. [Google Scholar] [CrossRef]
- Zhang, X.; Zhang, G.; Paul, P.; Zhang, J.; Wu, T.; Fan, S.; Xiong, X. Dissolved Gas Analysis for Transformer Fault Based on Learning Spiking Neural P System with Belief AdaBoost. Int. J. Unconv. Comput. 2021, 16, 239–258. [Google Scholar]
- Ibbahim, S.I.; Ghoneim, S.S.M.; Taha, I.B.M. DGALab: An extensible software implementation for DGA. IET Gener. Transm. Distrib. 2018, 12, 4117–4124. [Google Scholar] [CrossRef]
- Aciu, A.M.; Nicola, C.I.; Nicola, M.; Nitu, M.C. Complementary analysis for DGA based on duval methods and furan compounds using artificial neural networks. Energies 2021, 14, 588. [Google Scholar] [CrossRef]
- Miranda, V.; Castro, A.R.G. Improving the IEC table for transformer failure diagnosis with knowledge extraction from neural networks. IEEE Trans. Power Deliv. 2005, 20, 2509–2516. [Google Scholar] [CrossRef]
- Duval, M. A review of faults detectable by gas-in-oil analysis in transformers. IEEE Electr. Insul. Mag. 2002, 18, 8–17. [Google Scholar] [CrossRef]
- Wang, M.H. A novel extension method for transformer fault diagnosis. IEEE Trans. Power Deliv. 2003, 18, 164–169. [Google Scholar] [CrossRef]
- Gouda, O.E.; Saleh, S.M.; El-Hoshy, S.H. Power transformer incipient faults diagnosis based on dissolved gas analysis. Indones. J. Electr. Eng. Comput. Sci. 2016, 1, 10–16. [Google Scholar]
- Li, Y.; Zhang, X. Improving k nearest neighbor with exemplar generalization for imbalanced classification. In Proceedings of the Pacific-Asia Conference on Knowledge Discovery and Data Mining, Shenzhen, China, 24–27 May 2011; Springer: Berlin/Heidelberg, Germany, 2011; pp. 321–332. [Google Scholar]
- Laszlo, Z.; Torok, L.; Kovacs, G. Improving the performance of the k Rare Class Nearest Neighbor classifier by the ranking of point patterns. In Proceedings of the International Symposium on Foundations of Information and Knowledge Systems, Budapest, Hungary, 14–18 May 2018; Springer: Cham, Switzerland, 2018; pp. 265–283. [Google Scholar]
- Kukar, M.; Kononenko, I. Cost-sensitive learning with neural networks. ECAI 1998, 15, 88–94. [Google Scholar]
- Lomax, S.; Vadera, S. A survey of cost-sensitive decision tree induction algorithms. ACM Comput. Surv. (CSUR) 2013, 45, 1–35. [Google Scholar] [CrossRef]
- Fernández, A.; Garcia, S.; Herrera, F.; Chawla, N.V. SMOTE for learning from imbalanced data: Progress and challenges, marking the 15-year anniversary. J. Artif. Intell. Res. 2018, 61, 863–905. [Google Scholar] [CrossRef]
- Chawla, N.V.; Bowyer, K.W.; Hall, L.O.; Kegelmeyer, W.P. SMOTE: Synthetic minority over-sampling technique. J. Artif. Intell. Res. 2002, 16, 321–357. [Google Scholar] [CrossRef]
- Ma, L.; Fan, S. CURE-SMOTE algorithm and hybrid algorithm for feature selection and parameter optimization based on random forests. BMC Bioinform. 2017, 18, 169. [Google Scholar] [CrossRef] [PubMed]
- Guha, S.; Rastogi, R.; Shim, K. CURE: An efficient clustering algorithm for large databases. ACM Sigmod Rec. 1998, 27, 73–84. [Google Scholar] [CrossRef]
- Brownlee, J. Imbalanced Classification with Python: Better Metrics, Balance Skewed Classes, Cost-Sensitive Learning; Machine Learning Mastery: San Juan, Puerto Rico, 2020; pp. 48–56. [Google Scholar]
- Ma, H.; Ekanayake, C.; Saha, T.K. Power transformer fault diagnosis under measurement originated uncertainties. IEEE Trans. Dielectr. Electr. Insul. 2012, 19, 1982–1990. [Google Scholar] [CrossRef]
- Ashkezari, A.D.; Ma, H.; Saha, T.K.; Ekanayake, C. Application of fuzzy support vector machine for determining the health index of the insulation system of in-service power transformers. IEEE Trans. Dielectr. Electr. Insul. 2013, 20, 965–973. [Google Scholar] [CrossRef]
- Cui, Y.; Ma, H.; Saha, T. Improvement of power transformer insulation diagnosis using oil characteristics data preprocessed by SMOTEBoost technique. IEEE Trans. Dielectr. Electr. Insul. 2014, 21, 2363–2373. [Google Scholar] [CrossRef]
- Peimankar, A.; Weddell, S.J.; Jalal, T.; Lapthorn, A.C. Ensemble classifier selection using multi-objective PSO for fault diagnosis of power transformers. In Proceedings of the 2016 IEEE Congress on Evolutionary Computation (CEC), Vancouver, BC, Canada, 24–29 July 2016; pp. 3622–3629. [Google Scholar]
- Peimankar, A.; Weddell, S.J.; Jalal, T.; Lapthorn, A.C. Evolutionary multi-objective fault diagnosis of power transformers. Swarm Evol. Comput. 2017, 36, 62–75. [Google Scholar] [CrossRef]
- Liao, W.; Wang, H.; Zhang, J.; Guo, C.; Yao, J.; Jin, Y. An Oil-Immersed Transformer Fault Diagnosis Method Based on Data Preprocessing and Gradient Boosting. In Proceedings of the 2019 IEEE Power and Energy Society General Meeting (PESGM), Atlanta, GA, USA, 4–8 August 2019; pp. 1–5. [Google Scholar]
- Wu, X.; He, Y.; Duan, J. A deep parallel diagnostic method for transformer dissolved gas analysis. Appl. Sci. 2020, 10, 1329. [Google Scholar] [CrossRef]
- Dhini, A.; Faqih, A.; Kusumoputro, B.; Surjandari, I.; Kusiak, A. Data-driven fault diagnosis of power transformers using dissolved gas analysis (DGA). Int. J. Technol. 2020, 11, 388–399. [Google Scholar] [CrossRef]
- Lopes, S.M.A.; Flauzino, R.A.; Altafim, R.A.C. Incipient fault diagnosis in power transformers by data-driven models with over-sampled dataset. Electr. Power Syst. Res. 2021, 201, 107519. [Google Scholar] [CrossRef]
- Jian, T.; Huijuan, H.; Gehao, S.; Xiuchen, J. Transformer Fault Diagnosis Model with Unbalanced Samples Based on SMOTE Algorithm and Focal Loss. In Proceedings of the 2021 4th International Conference on Energy, Electrical and Power Engineering (CEEPE), Chongqing, China, 23–25 April 2021; pp. 693–697. [Google Scholar]
- Dwiputranto, D.T.H.; Setiawan, N.A.; Adji, T.B. DGA-Based Early Transformer Fault Detection using GA-Optimized ANN. In Proceedings of the 2021 International Conference on Technology and Policy in Energy and Electric Power (ICT-PEP), Jakarta, Indonesia, 29–30 September 2021; pp. 342–347. [Google Scholar]
- Passos, L.A.; Jodas, D.S.; Ribeiro, L.C.F.; Akio, M.; Souza, A.N.; Papa, J.P. Handling imbalanced datasets through Optimum-Path Forest. Knowl.-Based Syst. 2022, 242, 108445. [Google Scholar] [CrossRef]
- Jia Yong, H.; Mohd Yousof, M.F.; Abd Rahman, R.; Talib, M.A.; Azis, N. Classification of Fault and Stray Gassing in Transformer by Using Duval Pentagon and Machine Learning Algorithms. Arab. J. Sci. Eng. 2022, 47, 14355–14364. [Google Scholar] [CrossRef]
- Li, X.; Li, Y.; Xu, Y.; Li, R.; Zhang, G. Fault Diagnostics of Oil-immersed Power Transformer via SMOTE and GWO-SVM. In Proceedings of the 2022 4th Asia Energy and Electrical Engineering Symposium (AEEES), Chengdu, China, 25–28 March 2022; pp. 935–939. [Google Scholar]
- IEEE Guide for the Interpretation of Gases Generated in Mineral Oil-Immersed Transformers. In IEEE Std C57.104-2019 (Revision of IEEE Std C57.104-2008); IEEE: New York, NY, USA, 2019; pp. 1–98.
- Ghoneim, S.S.M.; Taha, I.B.M. A new approach of DGA interpretation technique for transformer fault diagnosis. Int. J. Electr. Power Energy Syst. 2016, 81, 265–274. [Google Scholar] [CrossRef]
- Japkowicz, N. Assessment metrics for imbalanced learning. In Imbalanced Learning: Foundations, Algorithms, and Applications; Wiley: Hoboken, NJ, USA, 2013; pp. 187–206. [Google Scholar]
- Tanha, J.; Abdi, Y.; Samadi, N.; Razzaghi, N.; Asadpour, M. Boosting methods for multi-class imbalanced data classification: An experimental review. J. Big Data 2020, 7, 70. [Google Scholar] [CrossRef]
- Lewis, D.D.; Gale, W.A. A sequential algorithm for training text classifiers. In Proceedings of the SIGIR’94, Dublin, Ireland, 3–6 July 1994; Springer: London, UK, 1994; pp. 3–12. [Google Scholar]
- Géron, A. Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow: Concepts, Tools, and Techniques to Build Intelligent Systems; O’Reilly Media: Sebastopol, CA, USA, 2019. [Google Scholar]
- Kubat, M.; Matwin, S. Addressing the curse of imbalanced training sets: One-sided selection. Icml 1997, 97, 179. [Google Scholar]
- Ynamin, S.; Kamel, M.S.; Wang, Y. Boosting for learning multiple classes with imbalanced class distribution. In Proceedings of the Sixth International Conference on Data Mining (ICDM’06), Hong Kong, China, 18–22 December 2006; pp. 592–602. [Google Scholar]
Ref. | Metrics | Learning Model | Oversampling Algorithm | Classification | Features |
---|---|---|---|---|---|
[14] | AUC, Acc | k-NN, DT, SVM, low-dimensional scaling, NN | Random resampling | Binary | Log of concentrations |
[6] | Average Acc | k-NN, SVM, NN | ASMOTE | Septenary | Non-coded ratios |
[7] | Minority recall, G-score | k-NN, DT, SVM | SMOTE, BorSMOTE, SafeSMOTE ADASYN, MWOTE, CGMOS, MAHAKIL | Binary | Log of concentrations |
[43] | F1-score | Optimum-Path Forest (OPF) | OPF variations for oversampling, SMOTE, Borderline SMOTE, AHC, ADASYN, MWSMOTE, SOMO, k-means SMOTE | Binary | Concentrations |
[38] | Mean Acc over five experiments | k-NN, DT, SVM, NN, CNN, LSTM, fuzzy c-means, deep parallel | ADASYN | Senary | Concentrations |
[40] | Acc, precision, recall, FP rate | Deep NN | SMOTE, Borderline-SMOTE | Senary | Concentrations |
[39] | Acc over ten experiments | SVM, NN, Extreme Learning Machine-RBF | SMOTE, Borderline-SMOTE | Multiple Binary | Concentrations |
[34] | Acc | k-NN, SVM, DT, NN | SMOTE | Quaternary | Gas concentrations, transformer condition, water content, acidity, 2-furfuraldehyde, and others |
[32] | Acc, G-score, sensitivity, specificity over ten experiments | Fuzzy SVM variants | Random oversampling | Quinary | Concentrations |
[35] | Acc, AUC | SVM, Fuzzy k-NN, NN, Naive Bayes, RF | ADASYN | Quaternary | Selected ratios and concentra- tions |
[44] | Acc | Ensemble Learning | SMOTETomek | Ternary | Concentrations |
[45] | Acc | SVM, Grey Wolf Opti- mization-SVM | SMOTE | Senary | Normalized concentration |
[36] | Acc, F1-score, Matthews correlation coefficient | Ensemble Learning | ADASYN | Quaternary | Feature selection from fourteen candidates of ratios and concen- trations |
[33] | Acc | Fuzzy SVM | SMOTE | Quinary | Gas concentrations, water content, dielectric dissipation factor, and others |
[41] | Acc, precision, recall, F1-score | kNN, RF, NN | SMOTE | Nonary | Concentrations |
[37] | Acc | k-NN, RF, NN, Linear Discriminant Analysis | SMOTE | Septenary | Feature selection from fourteen candidates of ratios and concen- trations |
[42] | Acc, precision, recall | Genetic Algorithm + NN | SMOTE | Senary | Eighteen DGA ratios |
Original | Merged | |
---|---|---|
Categories | New categories | Number of samples |
D | 289 | |
T | 234 | |
PD | PD | 43 |
N | N | 62 |
Total | 551 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Laburú, F.M.; Cabral, T.W.; Gomes, F.V.; de Lima, E.R.; Filho, J.C.S.S.; Meloni, L.G.P. New Insights into Gas-in-Oil-Based Fault Diagnosis of Power Transformers. Energies 2024, 17, 2889. https://doi.org/10.3390/en17122889
Laburú FM, Cabral TW, Gomes FV, de Lima ER, Filho JCSS, Meloni LGP. New Insights into Gas-in-Oil-Based Fault Diagnosis of Power Transformers. Energies. 2024; 17(12):2889. https://doi.org/10.3390/en17122889
Chicago/Turabian StyleLaburú, Felipe M., Thales W. Cabral, Felippe V. Gomes, Eduardo R. de Lima, José C. S. S. Filho, and Luís G. P. Meloni. 2024. "New Insights into Gas-in-Oil-Based Fault Diagnosis of Power Transformers" Energies 17, no. 12: 2889. https://doi.org/10.3390/en17122889