Abstract
Data mining has become an important task for researchers in the past few years, including detecting anomalies that may represent events of interest. The problem of anomaly detection refers to finding samples that do not conform to expected behavior. This paper analyzes recent studies on the detection of anomalies in time series. The goal is to provide an introduction to anomaly detection and a survey of recent research and challenges. The article is divided into three main parts. First, the main concepts are presented. Then, the anomaly detection task is defined. Afterward, the main approaches and strategies to solve the problem are presented.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Aggarwal, C.C.: Outlier Analysis. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-47578-3
Akbarinia, R., Cloez, B.: Efficient matrix profile computation using different distance functions. CoRR abs/1901.05708 (2019). http://arxiv.org/abs/1901.05708
Box, G.: Time Series Analysis: Forecasting and Control. Wiley, Hoboken (2016)
Braei, M., Wagner, S.: Anomaly detection in univariate time-series: a survey on the state-of-the-art (2020)
Celik, M., Dadaser-Celik, F., Dokuz, A.S.: Anomaly detection in temperature data using DBSCAN algorithm. In: 2011 International Symposium on Innovations in Intelligent Systems and Applications. IEEE, June 2011. https://doi.org/10.1109/inista.2011.5946052
Chandola, V., Banerjee, A., Kumar, V.: Anomaly detection: a survey. ACM Comput. Surv. 41 (2009). https://doi.org/10.1145/1541880.1541882
Chandola, V., Mithal, V., Kumar, V.: Comparative evaluation of anomaly detection techniques for sequence data. In: 2008 Eighth IEEE International Conference on Data Mining. IEEE, December 2008. https://doi.org/10.1109/icdm.2008.151
Emadi, H.S., Mazinani, S.M.: A novel anomaly detection algorithm using DBSCAN and SVM in wireless sensor networks. Wirel. Pers. Commun. 98(2), 2025–2035 (2017). https://doi.org/10.1007/s11277-017-4961-1
Ester, M., Kriegel, H.P., Sander, J., Xu, X.: A density-based algorithm for discovering clusters in large spatial databases with noise. In: Proceedings of the Second International Conference on Knowledge Discovery and Data Mining, KDD 1996, pp. 226–231. AAAI Press (1996)
Feyrer, J.: Trade and income—exploiting time series in geography. Am. Econ. J. Appl. Econ. 11(4), 1–35 (2019). https://doi.org/10.1257/app.20170616
Hawkins, D.M.: Identification of Outliers. Springer, Netherlands (1980). https://doi.org/10.1007/978-94-015-3994-4
Hyndman, R.: Forecasting: Principles and Practice. OTexts, Heathmont (2014)
Hyndman, R.: Forecasting: Principles and Practice. OTexts, Melbourne (2018)
Ignatov, A.: Real-time human activity recognition from accelerometer data using convolutional neural networks. Appl. Soft Comput. 62, 915–922 (2018). https://doi.org/10.1016/j.asoc.2017.09.027
Keogh, E., Kasetty, S.: Data mining and knowledge discovery 7(4), 349–371 (2003). https://doi.org/10.1023/a:1024988512476
Koijen, R.S., Lustig, H., Nieuwerburgh, S.V.: The cross-section and time series of stock and bond returns. J. Monetary Econ. 88, 50–69 (2017). https://doi.org/10.1016/j.jmoneco.2017.05.006
Kumari, R., Sheetanshu, Singh, M.K., Jha, R., Singh, N.: Anomaly detection in network traffic using k-mean clustering. In: 2016 3rd International Conference on Recent Advances in Information Technology (RAIT). IEEE, March 2016. https://doi.org/10.1109/rait.2016.7507933
Lahmiri, S.: A variational mode decompoisition approach for analysis and forecasting of economic and financial time series. Expert Syst. Appl. 55, 268–273 (2016). https://doi.org/10.1016/j.eswa.2016.02.025
Li, J., Izakian, H., Pedrycz, W., Jamal, I.: Clustering-based anomaly detection in multivariate time series data. Appl. Soft Comput. 100, 106919 (2021). https://doi.org/10.1016/j.asoc.2020.106919
Likas, A., Vlassis, N., Verbeek, J.J.: The global k-means clustering algorithm. Pattern Recogn. 36(2), 451–461 (2003). https://doi.org/10.1016/s0031-3203(02)00060-2
Mueen, A., et al.: The fastest similarity search algorithm for time series subsequences under Euclidean distance, August 2017. http://www.cs.unm.edu/~mueen/FastestSimilaritySearch.html
Paepe, D.D., et al.: A generalized matrix profile framework with support for contextual series analysis. Eng. Appl. Artif. Intell. 90, 103487 (2020). https://doi.org/10.1016/j.engappai.2020.103487
Pincus, R.: Barnett, V., Lewis T.: Outliers in Statistical Data. 3rd edn. Wiley (1994). XVII. 582 pp., £49.95. Biometrical Journal 37(2), 256–256 (1995). https://doi.org/10.1002/bimj.4710370219
Pourbabaee, B., Roshtkhari, M.J., Khorasani, K.: Deep convolutional neural networks and learning ECG features for screening paroxysmal atrial fibrillation patients. IEEE Trans. Syst. Man Cybern. Syst. 48(12), 2095–2104 (2018). https://doi.org/10.1109/tsmc.2017.2705582
Qiu, X., Ren, Y., Suganthan, P.N., Amaratunga, G.A.: Empirical mode decomposition based ensemble deep learning for load demand time series forecasting. Appl. Soft Comput. 54, 246–255 (2017). https://doi.org/10.1016/j.asoc.2017.01.015
Soares, E., Costa, P., Costa, B., Leite, D.: Ensemble of evolving data clouds and fuzzy models for weather time series prediction. Appl. Soft Comput. 64, 445–453 (2018). https://doi.org/10.1016/j.asoc.2017.12.032
Thudumu, S., Branch, P., Jin, J., Singh, J.: A comprehensive survey of anomaly detection techniques for high dimensional big data. J. Big Data 7(1) (2020). https://doi.org/10.1186/s40537-020-00320-x
Wang, X., Ahn, S.H.: Real-time prediction and anomaly detection of electrical load in a residential community. Appl. Energy 259, 114145 (2020). https://doi.org/10.1016/j.apenergy.2019.114145
Yeh, C., et al.: Matrix profile I: all pairs similarity joins for time series: a unifying view that includes motifs, discords and shapelets. In: 2016 IEEE 16th International Conference on Data Mining (ICDM), Los Alamitos, CA, USA, pp. 1317–1322. IEEE Computer Society, December 2016. https://doi.org/10.1109/ICDM.2016.0179. https://doi.ieeecomputersociety.org/10.1109/ICDM.2016.0179
Zhu, Y., et al.: Matrix profile II: Exploiting a novel algorithm and GPUs to break the one hundred million barrier for time series motifs and joins. In: 2016 IEEE 16th International Conference on Data Mining (ICDM), pp. 739–748 (2016). https://doi.org/10.1109/ICDM.2016.0085
Zhu, Y., Yeh, C.C.M., Zimmerman, Z., Kamgar, K., Keogh, E.: Matrix profile XI: SCRIMP++: time series motif discovery at interactive speeds. In: 2018 IEEE International Conference on Data Mining (ICDM), pp. 837–846 (2018). https://doi.org/10.1109/ICDM.2018.00099
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer-Verlag GmbH Germany, part of Springer Nature
About this chapter
Cite this chapter
Borges, H., Akbarinia, R., Masseglia, F. (2021). Anomaly Detection in Time Series. In: Hameurlain, A., Tjoa, A.M. (eds) Transactions on Large-Scale Data- and Knowledge-Centered Systems L. Lecture Notes in Computer Science(), vol 12930. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-64553-6_3
Download citation
DOI: https://doi.org/10.1007/978-3-662-64553-6_3
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-64552-9
Online ISBN: 978-3-662-64553-6
eBook Packages: Computer ScienceComputer Science (R0)