Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3583780.3615167acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
short-paper

Unsupervised Anomaly Detection & Diagnosis: A Stein Variational Gradient Descent Approach

Published: 21 October 2023 Publication History

Abstract

Detecting and diagnosing anomalies in observational data plays a crucial role in various real-world applications, such as e-commerce applet maintenance. Unsupervised machine learning techniques are typically employed for anomaly detection and diagnosis due to their convenience and independence from labeled data. Density estimation (DE), as one of the most widely used unsupervised machine learning techniques for anomaly detection, can be categorized into kernel density estimation (KDE)-based methods and normalizing flow (NF)-based methods. While KDE-based methods offer fast computation speed, they often ignore the complex manifold structure present in observational data. On the other hand, NF-based methods address the manifold issue but suffer from longer computation times. In this study, we propose a novel DE-based anomaly detection & diagnosis method using Stein Variational Gradient Descent (SVGD), aiming to leverage the strengths of KDE and NF approaches. Firstly, we rigorously derive the DE capability of SVGD through mathematical analysis. Subsequently, we demonstrate the ability of the SVGD method to perform anomaly diagnosis based on input feature attribution. Finally, to validate the effectiveness of our approach, we conduct experiments using synthetic, benchmark, and industrial datasets. The results demonstrate the superior performance and practical applicability of our proposed method.

Supplementary Material

MP4 File (SteinVGD.mp4)
Presentation video for paper with title ``Unsupervised Anomaly Detection & Diagnosis: A Stein Variational Gradient Descent Approach"

References

[1]
Ahmed Abdulaal, Zhuanghua Liu, and Tomer Lancewicki. 2021. Practical approach to asynchronous multivariate time series anomaly detection and localization. In Proceedings of the 27th ACM SIGKDD conference on knowledge discovery & data mining. 2485--2494.
[2]
Alexander Alexandrov, Konstantinos Benidis, Michael Bohlke-Schneider, Valentin Flunkert, Jan Gasthaus, Tim Januschowski, Danielle C. Maddix, Syama Rangapuram, David Salinas, Jasper Schulz, Lorenzo Stella, Ali Caner Türkmen, and Yuyang Wang. 2020. GluonTS: Probabilistic and Neural Time Series Modeling in Python. Journal of Machine Learning Research, Vol. 21, 116 (2020), 1--6. http://jmlr.org/papers/v21/19--820.html
[3]
Mislav Balunovic, Anian Ruoss, and Martin Vechev. 2021. Fair Normalizing Flows. In International Conference on Learning Representations.
[4]
Songqiao Han, Xiyang Hu, Hailiang Huang, Mingqi Jiang, and Yue Zhao. 2022. ADBench: Anomaly Detection Benchmark. In Neural Information Processing Systems (NeurIPS).
[5]
Kyle Hundman, Valentino Constantinou, Christopher Laporte, Ian Colwell, and Tom Soderstrom. 2018. Detecting spacecraft anomalies using lstms and nonparametric dynamic thresholding. In Proceedings of the 24th ACM SIGKDD international conference on knowledge discovery & data mining. 387--395.
[6]
Longyuan Li, Junchi Yan, Qingsong Wen, Yaohui Jin, and Xiaokang Yang. 2023. Learning Robust Deep State Space for Unsupervised Anomaly Detection in Contaminated Time-Series. IEEE Transactions on Knowledge and Data Engineering, Vol. 35, 6 (2023), 6058--6072. https://doi.org/10.1109/TKDE.2022.3171562
[7]
Shiyang Li, Xiaoyong Jin, Yao Xuan, Xiyou Zhou, Wenhu Chen, Yu-Xiang Wang, and Xifeng Yan. 2019. Enhancing the locality and breaking the memory bottleneck of transformer on time series forecasting. Advances in neural information processing systems, Vol. 32 (2019).
[8]
Yingzhen Li and Richard E Turner. 2017. Gradient estimators for implicit models. arXiv preprint arXiv:1705.07107 (2017).
[9]
Qiang Liu. 2017. Stein variational gradient descent as gradient flow. Advances in neural information processing systems, Vol. 30 (2017).
[10]
Qiang Liu and Dilin Wang. 2016. Stein Variational Gradient Descent: A General Purpose Bayesian Inference Algorithm. In Proceedings of the 30th International Conference on Neural Information Processing Systems (Barcelona, Spain) (NIPS'16). Curran Associates Inc., Red Hook, NY, USA, 2378--2386.
[11]
George Papamakarios, Eric Nalisnick, Danilo Jimenez Rezende, Shakir Mohamed, and Balaji Lakshminarayanan. 2021a. Normalizing Flows for Probabilistic Modeling and Inference. Journal of Machine Learning Research, Vol. 22, 57 (2021), 1--64. http://jmlr.org/papers/v22/19--1028.html
[12]
George Papamakarios, Eric Nalisnick, Danilo Jimenez Rezende, Shakir Mohamed, and Balaji Lakshminarayanan. 2021b. Normalizing flows for probabilistic modeling and inference. The Journal of Machine Learning Research, Vol. 22, 1 (2021), 2617--2680.
[13]
Fabian Pedregosa, Gaël Varoquaux, Alexandre Gramfort, Vincent Michel, Bertrand Thirion, Olivier Grisel, Mathieu Blondel, Peter Prettenhofer, Ron Weiss, Vincent Dubourg, et al. 2011. Scikit-learn: Machine learning in Python. Journal of machine learning research, Vol. 12, Oct (2011), 2825--2830.
[14]
Zheyang Shen, Markus Heinonen, and Samuel Kaski. 2021. De-randomizing MCMC dynamics with the diffusion Stein operator. Advances in Neural Information Processing Systems, Vol. 34 (2021), 17507--17517.
[15]
Ya Su, Youjian Zhao, Chenhao Niu, Rong Liu, Wei Sun, and Dan Pei. 2019. Robust anomaly detection for multivariate time series through stochastic recurrent neural network. In Proceedings of the 25th ACM SIGKDD international conference on knowledge discovery & data mining. 2828--2837.
[16]
Mukund Sundararajan, Ankur Taly, and Qiqi Yan. 2017. Axiomatic attribution for deep networks. In International conference on machine learning. PMLR, 3319--3328.
[17]
Haixu Wu, Tengge Hu, Yong Liu, Hang Zhou, Jianmin Wang, and Mingsheng Long. 2023. TimesNet: Temporal 2D-Variation Modeling for General Time Series Analysis. In International Conference on Learning Representations.
[18]
Haixu Wu, Jiehui Xu, Jianmin Wang, and Mingsheng Long. 2021. Autoformer: Decomposition transformers with auto-correlation for long-term series forecasting. Advances in Neural Information Processing Systems, Vol. 34 (2021), 22419--22430.
[19]
Ailing Zeng, Muxi Chen, Lei Zhang, and Qiang Xu. 2023. Are Transformers Effective for Time Series Forecasting? Proceedings of the AAAI Conference on Artificial Intelligence.
[20]
Yue Zhao, Zain Nasrullah, and Zheng Li. 2019. PyOD: A Python Toolbox for Scalable Outlier Detection. Journal of Machine Learning Research, Vol. 20, 96 (2019), 1--7. http://jmlr.org/papers/v20/19-011.html
[21]
Yunyi Zhou, Zhixuan Chu, Yijia Ruan, Ge Jin, Yuchen Huang, and Sheng Li. 2023. pTSE: A Multi-model Ensemble Method for Probabilistic Time Series Forecasting. arXiv preprint arXiv:2305.11304 (2023).

Cited By

View all
  • (2025)Improving Data-Driven Inferential Sensor Modeling by Industrial Knowledge: A Bayesian PerspectiveIEEE Transactions on Systems, Man, and Cybernetics: Systems10.1109/TSMC.2024.349307155:2(1043-1055)Online publication date: Feb-2025
  • (2024)SPOT-I: Similarity Preserved Optimal Transport for Industrial IoT Data ImputationIEEE Transactions on Industrial Informatics10.1109/TII.2024.345224120:12(14421-14429)Online publication date: Dec-2024
  • (2024)An Accurate and Interpretable Framework for Trustworthy Process MonitoringIEEE Transactions on Artificial Intelligence10.1109/TAI.2023.33196065:5(2241-2252)Online publication date: May-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
CIKM '23: Proceedings of the 32nd ACM International Conference on Information and Knowledge Management
October 2023
5508 pages
ISBN:9798400701245
DOI:10.1145/3583780
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 21 October 2023

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. anomaly detection
  2. anomaly diagnosis
  3. density estimation
  4. variational inference

Qualifiers

  • Short-paper

Conference

CIKM '23
Sponsor:

Acceptance Rates

Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

Upcoming Conference

CIKM '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)109
  • Downloads (Last 6 weeks)10
Reflects downloads up to 15 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2025)Improving Data-Driven Inferential Sensor Modeling by Industrial Knowledge: A Bayesian PerspectiveIEEE Transactions on Systems, Man, and Cybernetics: Systems10.1109/TSMC.2024.349307155:2(1043-1055)Online publication date: Feb-2025
  • (2024)SPOT-I: Similarity Preserved Optimal Transport for Industrial IoT Data ImputationIEEE Transactions on Industrial Informatics10.1109/TII.2024.345224120:12(14421-14429)Online publication date: Dec-2024
  • (2024)An Accurate and Interpretable Framework for Trustworthy Process MonitoringIEEE Transactions on Artificial Intelligence10.1109/TAI.2023.33196065:5(2241-2252)Online publication date: May-2024
  • (2024)Attempt of Graph Neural Network Algorithm in the Field of Financial Anomaly DetectionProceedings of the 2nd International Conference on Internet of Things, Communication and Intelligent Technology10.1007/978-981-97-2757-5_65(616-623)Online publication date: 26-Apr-2024
  • (2023)Temporal Attention Convolutional Neural Networks Based on LSTM-Encoder for Time Series Forecasting2023 International Conference on Networks, Communications and Intelligent Computing (NCIC)10.1109/NCIC61838.2023.00014(51-54)Online publication date: 17-Nov-2023
  • (2023)Research on Graph Neural Network Algorithms for Financial Anomaly Detection2023 International Conference on Networks, Communications and Intelligent Computing (NCIC)10.1109/NCIC61838.2023.00009(18-23)Online publication date: 17-Nov-2023

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media