Abstract
This work proposes the short-term load forecasting (STLF) using a combination of wavelet transform (WT) and bidirectional gated recurrent unit (BGRU). Selection of the best wavelet basis using the Shannon entropy cost function is introduced in this paper. Since entropy is a measure of the average amount of information, Shannon's entropy has been used to select nodes from the wavelet tree that have more information. The best high- and low-frequency features selected by the Shannon entropy are applied to the BGRU for STLF. In addition, a new time coding approach called the cyclical encoding is designed that appropriately models the periods and time patterns in the electrical load time series. The proposed best-tree wavelet packet transform bidirectional gated recurrent unit (BT-WPT-BGRU) method shows superior performance compared to the wavelet transform and neuro-evolutionary algorithm (WT-NEA), wavelet and collaborative representation transforms (WACRT), convolutional and recurrent neural network (CARNN), WT–BGRU, full wavelet packet transform BGRU (FWPT-BGRU), BT-WPT bidirectional LSTM (BT-WPT-BLSTM) and BT-WPT-BGRU (with one-hot encoding). The BT-WPT-BGRU model performs 71.7%, 58.8%, 58.2%, 17.6%, 12.5%, 12.5% and 6.6% better than WT-NEA, WACRT, CARNN, WT-BGRU, FWPT-BGRU, BT-WPT-BGRU (with one-hot encoding) and BT-WPT-BLSTM in terms of the MAPE metric in ISONE dataset, respectively.
Similar content being viewed by others
Availability of data and materials
No new data are used in this paper. The datasets used for the experiments are benchmark datasets.
Abbreviations
- FFNN:
-
Feed forward neural network
- CNN:
-
Convolutional neural network
- RNN:
-
Recurrent neural network
- LSTM:
-
Long short-term memory
- GRU:
-
Gated recurrent unit
- STLF:
-
Short-term load forecasting
- MTLF:
-
Medium-term load forecasting
- LTLF:
-
Long-term load forecasting
- WT:
-
Wavelet transform
- DWT:
-
Discrete wavelet transform
- FWPT:
-
Full wavelet packet transform
- BT-WPT:
-
Best tree- wavelet packet transform
- AI:
-
Artificial intelligence
- SVR:
-
Support vector regression
- ReLU:
-
Rectified linear unit
- SELU:
-
Scaled exponential linear unit
- MAE:
-
Mean absolute error
- MAPE:
-
Mean absolute percentage error
- RMSE:
-
Root-mean-square error
- C t :
-
Memory cell in time step t in GRU
- h t :
-
Hidden state vector in time step t
- f t :
-
Output of forget gate in LSTM unit
- i t :
-
Output of input gate in LSTM unit
- o t :
-
Output of output gate in LSTM unit
- g t :
-
Output of output node in LSTM unit
- U t :
-
Output of update gate in GRU unit
- R t :
-
Output of reset gate in GRU unit
- F norm :
-
Normalized feature
- Tempnorm :
-
Normalized temperature
- x k :
-
Actual hourly load in kth sample
- \(\mathop x\limits^{\prime }_{k}\) :
-
Predicted hourly load in kth sample
- M :
-
Total number of test samples
- CEM :
-
Cyclical encoding vector
- n M :
-
Number of months of the year
References
Hou H, Liu C, Wang Q, Wu X, Tang J, Shi Y, Xie C (2022) Review of load forecasting based on artificial intelligence methodologies, models, and challenges. Electric Power Syst Res 210:108067
Keshvari R, Imani M, Parsa Moghaddam M (2022) A clustering-based short-term load forecasting using independent component analysis and multi-scale decomposition transform. J Supercomput 78:7908–7935
Papalexopoulos AD, Hesterberg TC (1990) A regression-based approach to short-term system load forecasting. IEEE Trans Power Syst 5(4):1535–1547
Jeong D, Park C, Ko YM (2021) Short-term electric load forecasting for buildings using logistic mixture vector autoregressive model with curve registration. Appl Energy 282(Part B):116249
Imani M (2022) Fuzzy-based weighting long short-term memory network for demand forecasting. J Supercomput
Taylor J (2003) Short-term electricity demand forecasting using double seasonal exponential smoothing. J Oper Res Soc 54:799–805
Taylor JW (2008) An evaluation of methods for very short-term load forecasting using minute-by-minute British data. Int J Forecast 24(4):645–658
Bracale A, Caramia P, De Falco P, Hong T (2020) A multivariate approach to probabilistic industrial load forecasting. Electric Power Syst Res 187:106430
Cao X, Dong S, Wu Z, Jing Y (2015) a data-driven hybrid optimization model for short-term residential load forecasting. In: IEEE International Conference on Computer and Information Technology; Ubiquitous Computing and Communications; Dependable, Autonomic and Secure Computing; Pervasive Intelligence and Computing, Liverpool, pp 283–287
Din GMU, Marnerides AK (2017) Short term power load forecasting using Deep Neural Networks. In: 2017 International Conference on Computing, Networking and Communications (ICNC), Santa Clara, CA , pp 594–598
Hoori AO, Motai Y (2018) Multicolumn RBF Network. IEEE Trans Neural Netw Learn Syst 29(4):766–778
Yu H, Reiner PD, Xie T, Bartczak T, Wilamowski BM (2014) An incremental design of radial basis function networks. IEEE Trans Neural Netw Learn Syst 25(10):1793–1803
Amarasinghe K, Marino DL, Manic M (2017) Deep neural networks for energy load forecasting. In: 2017 IEEE 26th international symposium on industrial electronics (ISIE), Edinburgh, pp.1483–1488
Khuntia SR, Rueda JL, van der Meijden MAMM (2016) Forecasting the load of electrical power systems in mid and long-term horizons: a review. IET Gener Transm Distrib 10(16):3971–3977
Li S, Goel L, Wang P (2016) An ensemble approach for short-term load forecasting by extreme learning machine. Appl Energy 170:22–29
Kamruzzaman M, Bhusal N, Benidris M (2022) A convolutional neural network-based approach to composite power system reliability evaluation. Int J Electr Power Energy Syst 135:107468
Shi H, Xu M, Li R (2018) Deep learning for household load forecasting-a novel pooling deep RNN. IEEE Trans Smart Grid 9(5):5271–5280
Imani M (2019) Long short-term memory network and support vector regression for electrical load forecasting. In: 2019 International Conference on Power Generation Systems and Renewable Energy Technologies (PGSRET), Istanbul, Turkey, pp 1–6
Imani M, Ghassemian H (2019) Sequence to image transform based convolutional neural network for load forecasting. In: 2019 27th Iranian Conference on Electrical Egineering (ICEE), Yazd, Iran, pp 1362–1366
Shafiei Chafi Z, Afrakhte H (2021) Short-term load forecasting using neural network and particle swarm optimization (PSO) algorithm. Mathematical Problems in Engineering, vol. 2021
Kong Z, Zhang C, Lv H, Xiong F, Fu Z (2020) Multimodal feature extraction and fusion deep neural networks for short-term load forecasting. IEEE Access 8:185373–185383
Alhussein M, Aurangzeb K, Haider SI (2020) Hybrid CNN-LSTM model for short-term individual household load forecasting. IEEE Access 8:180544–180557
Gao Y, Fang Y, Dong H, Kong Y (2020) A multifactorial framework for short-term load forecasting system as well as the jinan’s case study. IEEE Access 8:203086–203096
Guo W, Che L, Shahidehpour M, Wan X (2021) Machine-learning based methods in short-term load forecasting. Electric J 34(1), Article ID 106884
Aly HHH (2020) A proposed intelligent short-term load forecasting hybrid models of ANN, WNN and KF based on clustering techniques for smart grid. Electr Power Syst Res 182, Article ID 106191
Saroha S, Zurek-Mortka M, Szymanski JR, Shekher V, Singla P (2021) Forecasting of market clearing volume using wavelet packet-based neural networks with tracking signals. Energies 14(19):6065
Singla P, Duhan M, Saroha S (2022) A hybrid solar irradiance forecasting using full wavelet packet decomposition and bi-directional long short-term memory (BiLSTM). Arab J Sci Eng 47:14185–14211
Chen Y, Luh PB, Guan C, Zhao Y, Michel LD, Coolbeth MA, Friedland PB, Rourke SJ (2010) Short-term load forecasting: similar day-based wavelet neural network. IEEE Trans Power Syst 25:322–330
Vautrin D, Artusi X, Lucas M-F, Farina D (2009) A novel criterion of wavelet packet best basis selection for signal classification with application to brain-computer interfaces. IEEE Trans Biomed Eng 56:2734–2738
Saito N, Coifman RR (1997) Local discriminant bases, In: Proceedings of the SPIE wavelet applications in signal and image processing.
Mallat SG (1989) A theory for multiresolution signal decomposition: the wavelet representation. IEEE Trans Pattern Anal Mach Intell 11:674–693
Kováˇc S, Conok GM, Halenár I, Važan P (2021) Comparison of heat demand prediction using wavelet analysis and neural network for a district heating network. Energies 14:1545
Coifman RR, Wickerhauser MV (1992) Entropy-based algorithms for best basis selection. IEEE Trans Inf Theory 38:713–718
Hochreiter S, Bengio Y, Frasconi P, Schmidhuber J (2001) Gradient flow in recurrent nets: the difficulty of learning long-term dependencies
Graves A (2013) Generating sequences with recurrent neural networks." arXiv preprint arXiv:1308.0850
Chung J, Gulcehre C, Cho K, Bengio Y (2014) Empirical evaluation of gated recurrent neural networks on sequence modeling, arXiv preprint arXiv:1412.3555
Klambauer G, Unterthiner T, Mayr A, Hochreiter S (2017) Self- normalizing neural networks, Adv Neural Inf Processi Syst, 972–981
Deihimi A, Showkati H (2012) Application of echo state networks in short-term electric load forecasting. Energy 39:327–340
Reis AJR, da Silva APA (2005) Feature extraction via multiresolution analysis for short-term load forecasting. IEEE Trans Power Syst 20:189–198
Amjady N, Keynia F (2009) Short-term load forecasting of power systems by combination of wavelet transform and neuro-evolutionary algorithm. Energy 34:46–57
Ceperic E, Ceperic V, Baric A (2013) A strategy for short-term load forecasting by support vector regression machines. IEEE Trans Power Syst 28:4356–4364
Hu Z, Bao Y, Xiong T (2014) Comprehensive learning particle swarm optimization based memetic algorithm for model selection in short-term load forecasting using support vector regression. Appl Soft Comput 25:15–25
Imani M, Ghassemian H (2019) Residential load forecasting using wavelet and collaborative representation transforms. Appl Energy, 253
Eskandari H, Imani M, Moghaddam MP (2021) Convolutional and recurrent neural network based model for short-term load forecasting. Electr Power Syst Res 195, Article ID 107173
Guan C, Luh PB, Michel LD, Wang Y, Friedland PB (2013) Very short-term load forecasting: wavelet neural networks with data pre-filtering. IEEE Trans Power Syst 28:30–41
Shamsollahi P, Cheung KW, Chen Q, Germain EH (2001) A neural network based very short term load forecaster for the interim ISO New England electricity market system. In: 22nd IEEE PES International Conference on Power Industry. Computer Applications, pp 217–222
Li S, Wang P, Goel L (2015) Short-term load forecasting by wavelet transform and evolutionary extreme learning machine. Electric Power Syst Res 122:96–103
Funding
There is no funding to declare.
Author information
Authors and Affiliations
Contributions
H.E. contributed to methodology; software; validation; and writing. M.I. contributed to conceptualization; editing; and supervision. M.P.M. contributed to review and supervision.
Corresponding author
Ethics declarations
Conflict of interests
The authors declare no competing interests.
Ethics statement
Not applicable.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Eskandari, H., Imani, M. & Parsa Moghaddam, M. Best-tree wavelet packet transform bidirectional GRU for short-term load forecasting. J Supercomput 79, 13545–13577 (2023). https://doi.org/10.1007/s11227-023-05193-4
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11227-023-05193-4