Abstract
Effectively detecting anomalies for multivariate time series is of great importance for the modern industrial system. Recently, reconstruction-based deep learning methods have been widely used in time series anomaly detection. However, the rich local and global characteristics of time series may not be well captured by methods that compress and reconstruct time series through a single-scale neural network. In addition, under the influence of a complex environment, small fluctuations occur during the normal operation of the system, which will also bring challenges for those methods to reconstruct exact values. In this paper, we propose an unsupervised multivariate time series anomaly detection method based on a probabilistic autoencoder with multi-scale feature extraction (PAMFE). The multiple parallel dilated convolutions with different dilation factors and feature fusion module enable PAMFE to capture overall and detailed information of time series to identify various types of anomalies better. Furthermore, considering the normal fluctuation of data, we reconstruct the expected distribution of input and calculate the anomaly score based on the probability that the input belongs to the distribution. Extensive experiments on four publicly real-world datasets demonstrate that PAMFE outperforms state-of-the-art methods in F1-Score. Moreover, we investigate the contributions of the major components of PAMFE, and the experimental results show that they all contribute to performance improvement.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Liu H, Zheng C, Li D et al (2021) EDMF: efficient deep matrix factorization with review feature learning for industrial recommender system. IEEE Trans Indust Inform 18(7):4361–4371. https://doi.org/10.1109/TII.2021.3128240
Liu H, Zheng C, Li D et al (2022) Multi-perspective social recommendation method with graph representation learning. Neurocomputing 468:469–481. https://doi.org/10.1016/j.neucom.2021.10.050https://doi.org/10.1016/j.neucom.2021.10.050
Li Z, Liu H, Zhang Z et al (2021) Learning knowledge graph embedding with heterogeneous relation attention networks. IEEE Trans Neural Netw Learn Syst, https://doi.org/10.1109/TNNLS.2021.3055147https://doi.org/10.1109/TNNLS.2021.3055147
Chen Z, Chen D, Zhang X et al (2022) Learning graph structures with transformer for multivariate time-series anomaly detection in IoT. IEEE Internet Things J 9(12):9179–9189. https://doi.org/10.1109/JIOT.2021.3100509
Cai Z, Zheng X (2018) A private and efficient mechanism for data uploading in smart cyber-physical systems. IEEE Trans Netw Sci Eng 7(2):766–775. https://doi.org/10.1109/TNSE.2018.2830307
Blázquez-García A, Conde A, Mori U et al (2021) A review on outlier/anomaly detection in time series data. ACM Comput Surveys (CSUR) 54(3):1–33. https://doi.org/10.1145/3444690
Xu H, Chen W, Zhao N et al (2018) Unsupervised anomaly detection via variational auto-encoder for seasonal kpis in web applications. In: Proceedings of the 2018 world wide web conference, pp 187–196, https://doi.org/10.1145/3178876.3185996
Zhou B, Liu S, Hooi B et al (2019) Beatgan: anomalous rhythm detection using adversarially generated time series. In: Proceedings of the twenty-eighth international joint conference on artificial intelligence, pp 4433–4439, https://doi.org/10.24963/ijcai.2019/616
Pang G, Shen C, Cao L et al (2021) Deep learning for anomaly detection: a review. ACM Computing Surveys (CSUR) 54(2):1–38. https://doi.org/10.1145/3439950
Park D, Hoshi Y, Kemp C C (2018) A multimodal anomaly detector for robot-assisted feeding using an lstm-based variational autoencoder. IEEE Robot Autom Lett 3(3):1544–1551. https://doi.org/10.1109/LRA.2018.2801475
Audibert J, Michiardi P, Guyard F et al (2020) Usad: unsupervised anomaly detection on multivariate time series. In: Proceedings of the 26th ACM SIGKDD international conference on knowledge discovery & data mining, pp 3395–3404. https://doi.org/10.1145/3394486.3403392
Liang H, Song L, Wang J et al (2021) Robust unsupervised anomaly detection via multi-time scale dcgans with forgetting mechanism for industrial multivariate time series. Neurocomputing 423:444–462. https://doi.org/10.1016/j.neucom.2020.10.084
Jiang J, Yasakethu L (2013) Anomaly detection via one class svm for protection of scada systems. In: 2013 international conference on cyber-enabled distributed computing and knowledge discovery. IEEE, pp 82–88, https://doi.org/10.1109/CyberC.2013.22
Liu FT, Ting KM, Zhou ZH (2008) Isolation forest. In: 2008 eighth IEEE international conference on data mining. IEEE, pp 413–422, https://doi.org/10.1109/ICDM.2008.17
Liu H, Liu T, Zhang Z et al (2022a) ARHPE: asymmetric relation-aware representation learning for head pose estimation in industrial human–computer interaction. https://doi.org/10.1109/TII.2022.3143605, vol 18, pp 7107–7117
Liu H, Liu T, Chen Y et al (2022b) EHPE: skeleton cues-based gaussian coordinate encoding for efficient human pose estimation. IEEE Trans Multi:1–12. https://doi.org/10.1109/TMM.2022.3197364https://doi.org/10.1109/TMM.2022.3197364
Wu J, Zeng W, Yan F (2018) Hierarchical temporal memory method for time-series-based anomaly detection. Neurocomputing 273:535–546. https://doi.org/10.1016/j.neucom.2017.08.026
Hundman K, Constantinou V, Laporte C et al (2018) Detecting spacecraft anomalies using lstms and nonparametric dynamic thresholding. In: Proceedings of the 24th ACM SIGKDD international conference on knowledge discovery & data mining, pp 387–395, https://doi.org/10.1145/3219819.3219845
Lai G, Chang WC, Yang Y et al (2018) Modeling long-and short-term temporal patterns with deep neural networks. In: The 41st international ACM SIGIR conference on research & development in information retrieval, pp 95–104, https://doi.org/10.1145/3209978.3210006
Malhotra P, Ramakrishnan A, Anand G (2016) Lstm-based encoder-decoder for multi-sensor anomaly detection. In: Anomaly detection workshop at 33rd ICML
Zong B, Song Q, Min MR et al (2018) Deep autoencoding gaussian mixture model for unsupervised anomaly detection. In: International conference on learning representations, pp 1–19
Thill M, Konen W, Wang H et al (2021) Temporal convolutional autoencoder for unsupervised anomaly detection in time series. Appl Soft Comput 112:107,751. https://doi.org/10.1016/j.asoc.2021.107751https://doi.org/10.1016/j.asoc.2021.107751
oodfellow I, Pouget-Abadie J, Mirza M et al (2014) Generative adversarial nets. In: Advances in neural information processing systems, pp 2672–2680
Tuli S, Casale G, Jennings N R (2022) TranAD: deep transformer networks for Anomaly Detection in multivariate time series data. Proc VLDB Endow 15(6):1201—1214. https://doi.org/10.14778/3514061.3514067https://doi.org/10.14778/3514061.3514067
Su Y, Zhao Y, Niu C et al (2019) Robust anomaly detection for multivariate time series through stochastic recurrent neural network. In: Proceedings of the 25th ACM SIGKDD international conference on knowledge discovery & data mining, pp 2828–2837, https://doi.org/10.1145/3292500.3330672
Malhotra P, Vig L, Shroff G et al (2015) Long short term memory networks for anomaly detection in time series. In: European symposium on artificial neural networks, pp 89–94
Vaswani A, Shazeer N, Parmar N et al (2017) Attention is all you need. Adv Neural Inform Process Syst:30
Li T, Comer ML, Delp EJ et al (2020) Anomaly scoring for prediction-based anomaly detection in time series. In: 2020 IEEE aerospace conference. IEEE, pp 1–7, https://doi.org/10.1109/AERO47225.2020.9172442
Li X, Luan Y, Chen L (2022) Semi-supervised GAN with similarity constraint for mode diversity. Appl Intell, pp 1–14, https://doi.org/10.1007/s10489-022-03771-2
He Q, Zheng Y, Zhang C et al (2020) Mtad-tf: multivariate time series anomaly detection using the combination of temporal pattern and feature pattern. Complexity 2020:1–9. https://doi.org/10.1155/2020/8846608
Wang X, Han J, Li B et al (2021) Automatic ICD-10 coding based on multi-head attention mechanism and gated residual network. In: 2021 IEEE international conference on bioinformatics and biomedicine (BIBM), pp 536–543, https://doi.org/10.1109/BIBM52615.2021.9669625
Vincent P, Larochelle H, Lajoie I et al (2010) Stacked denoising autoencoders: learning useful representations in a deep network with a local denoising criterion. J Mach Learn Res 11(12):3371–3408
Tao Q, Liu F, Li Y et al (2019) Air pollution forecasting using a deep learning model based on 1d convnets and bidirectional gru. IEEE Access 7:76,690–76,698. https://doi.org/10.1109/ACCESS.2019.2921578
Siffer A, Fouque PA, Termier A et al (2017) Anomaly detection in streams with extreme value theory. In: Proceedings of the 23rd ACM SIGKDD international conference on knowledge discovery and data mining, pp 1067–1075, https://doi.org/10.1145/3097983.3098144
Pot A, Porkert J, Keijzer M (2019) The bidirectional in bilingual: cognitive, social and linguistic effects of and on third-age language learning. Behavior Sci 9(9):98. https://doi.org/10.3390/bs9090098
Sun X, Gao Y, Sutcliffe R et al (2019) Word representation learning based on bidirectional grus with drop loss for sentiment classification. IEEE Trans Syst, Man, Cybern: Syst 51(7):4532–4542. https://doi.org/10.1109/TSMC.2019.2940097
Liu G (2019) Bidirectional lstm with attention mechanism and convolutional layer for text classification. Neurocomputing 337:325–338. https://doi.org/10.1016/j.neucom.2019.01.078
Schuster M, Paliwal K K (1997) Bidirectional recurrent neural networks. IEEE Trans Signal Process 45(11):2673–2681. https://doi.org/10.1109/78.650093
Sun Y, Xue B, Zhang M et al (2020) Automatically designing CNN architectures using the genetic algorithm for image classification. IEEE Trans Cybern 50(9):3840–3854. https://doi.org/10.1109/TCYB.2020.2983860https://doi.org/10.1109/TCYB.2020.2983860
Munir M, Siddiqui S A, Dengel A et al (2018) Deepant: a deep learning approach for unsupervised anomaly detection in time series. IEEE Access 7:1991–2005. https://doi.org/10.1109/ACCESS.2018.2886457https://doi.org/10.1109/ACCESS.2018.2886457
Ratul M A R, Mozaffari M H, Lee W S et al (2020) Skin lesions classification using deep learning based on dilated convolution. BioRxiv p 860700, https://doi.org/10.1101/860700
Murphy KP (2022) Probabilistic machine learning: an introduction. MIT Press, USA
Rootzén H, Segers J, Wadsworth J L (2018) Multivariate generalized Pareto distributions: parametrizations, representations, and properties. J Multivariate Anal 165:117–131. https://doi.org/10.1016/j.jmva.2017.12.003https://doi.org/10.1016/j.jmva.2017.12.003
Mathur AP, Tippenhauer NO (2016) Swat: a water treatment testbed for research and training on ics security. In: 2016 international workshop on cyber-physical systems for smart water networks. IEEE, pp 31–36, https://doi.org/10.1109/CySWater.2016.7469060https://doi.org/10.1109/CySWater.2016.7469060
Ahmed CM, Palleti VR, Mathur AP (2017) Wadi: a water distribution testbed for research in the design of secure cyber physical systems. In: Proceedings of the 3rd international workshop on cyber-physical systems for smart water networks, pp 25–28, https://doi.org/10.1145/3055366.3055375
Abdulaal A, Liu Z, Lancewicki T (2021) Practical approach to asynchronous multivariate time series anomaly detection and localization. In: Proceedings of the 27th ACM SIGKDD conference on knowledge discovery & data mining, pp 2485–2494, https://doi.org/10.1145/3447548.3467174
Abadi M, Agarwal A, Barham P et al (2015) TensorFlow: large-scale machine learning on heterogeneous systems. Available at: http://tensorflow.org
Chollet F et al (2015) Keras. Available at: https://github.com/fchollet/keras
Nothaft FA, Massie M, Danford T et al (2015) Rethinking data-intensive science using scalable analytics systems. In: Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, pp 631–646, https://doi.org/10.1145/2723372.2742787https://doi.org/10.1145/2723372.2742787
Fang W, Zhang F, Sheng V S et al (2018) A method for improving CNN-based image recognition using DCGAN. Comput, Materials Continua 57(1):167–178. https://doi.org/10.32604/cmc.2018.02356https://doi.org/10.32604/cmc.2018.02356
Acknowledgements
The authors would like to thank their colleagues from the machine learning group for discussions on this paper.
Author information
Authors and Affiliations
Contributions
Guangyao Zhang: Methodology, Software, Writing - Original Draft, Writing - Review & Editing. Xin Gao: Conceptualization, Methodology, Supervision, Writing - Original Draft, Writing - Review & Editing. Lei Wang: Conceptualization, Resources. Bing Xue: Software, Validation. Shiyuan Fu: Software, Validation. Jiahao Yu: Software, Writing - Review & Editing. Zijian Huang: Writing - Review & Editing. Xu Huang: Writing - Review & Editing.
Corresponding author
Ethics declarations
Competing Interests
The authors have no competing interests to declare that are relevant to the content of this article.
Additional information
Data Availabilty
The datasets supporting the results of this article are SMD, SWaT, WADI and PSM public datasets, and the authors confirm that the datasets are indicated in the reference list.
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Zhang, G., Gao, X., Wang, L. et al. Probabilistic autoencoder with multi-scale feature extraction for multivariate time series anomaly detection. Appl Intell 53, 15855–15872 (2023). https://doi.org/10.1007/s10489-022-04324-3
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10489-022-04324-3