Abstract
Since the time series data have the characteristics of a large amount of data and non-stationarity, we usually cannot obtain a satisfactory result by a single-model-based method to detect anomalies in time series data. To overcome this problem, in this paper, a combination-model-based approach is proposed by combining a similarity-measurement-based method and a model-based method for anomaly detection. First, the process of data representation is performed to generate a new data form to arrive at the purpose of reducing data volume. Furthermore, due to the anomalies being generally caused by changes in amplitude and shape, we take both the original time series data and their amplitude change data into consideration of the process of data representation to capture the shape and morphological features. Then, the results of data representation are employed to establish a model for anomaly detection. Compared with the state-of-the-art methods, experimental studies on a large number of datasets show that the proposed method can significantly improve the performance of anomaly detection with higher data anomaly resolution.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
McAfee A, Brynjolfsson E, Davenport TH (2012) Big data: the management revolution. Harv Bus Rev 90(10):60–68
Esling P, Agon C (2012) Time-series data mining. ACM Computing Surveys (CSUR) 45(1):1–34
Wilson SJ (2017) Data representation for time series data mining: time domain approaches. Wiley Interdisciplinary Reviews: Computational Statistics 9(1):e1392
Keogh E, Lin J, Fu A (2004) HOT SAX: finding the most unusual time series subsequence: algorithms and applications. In proceedings of the Proc. of the 5th IEEE Int. Conf. On data mining: 440-449
Chandra R, Chand S (2016) Evaluation of co-evolutionary neural network architectures for time series prediction with mobile application in finance. Appl Soft Comput 49:462–473
Amini MH, Kargarian A, Karabasoglu O (2016) ARIMA-based decoupled time series forecasting of electric vehicle charging demand for stochastic power system operation. Electr Power Syst Res 140:378–390
Zhao J, Liu K, Wang W, Liu Y (2014) Adaptive fuzzy clustering based anomaly data detection in energy system of steel industry. Inf Sci 259(3):335–345
Thirtle CG, Schimmelpfennig DE, Townsend RE (2002) Induced innovation in United States agriculture, 1880–1990: time series tests and an error correction model. Am J Agric Econ 84(3):598–614
Chandola V, Banerjee A, Kumar V (2009) Anomaly detection: a survey. ACM computing surveys (CSUR) 41(3):1–58
Shi Y, Li R, Zhang Y (2015) An immunity-based time series prediction approach and its application for network security situation. Intell Serv Robot 8(1):1–22
Martí L, Sanchez-Pi N (2015) Anomaly detection based on sensor data in petroleum industry applications. Sensors 15(2):2774–2797
Izakian H, Pedrycz W (2013) Anomaly detection in time series data using a fuzzy c-means clustering. In proceedings of the 2013 joint IFSA world congress and NAFIPS annual meeting (IFSA/NAFIPS): 1513-1518
Lin J, Khade R, Li Y (2012) Rotation-invariant similarity in time series using bag-of-patterns representation. J Intell Inf Syst 39(2):287–315
Xu K, Xia M, Mu X (2018) Ensemblelens: ensemble-based visual exploration of anomaly detection algorithms with multidimensional data. IEEE Trans Vis Comput Graph 25(1):109–119
Han XJ, Zhang XL, Chen YY (2014) Wind power prediction model based on the combination of gray theory and time series forecasting methods. In Proceedings of the Applied Mechanics and Materials 448:1721–1726
Mehrotra K G, Mohan C K, Huang H (2017) Anomaly detection principles and algorithms. Springer
Park JW, Kim DY (2017) Standard time estimation of manual tasks via similarity measure of unequal scale time series. IEEE Transactions on Human-Machine Systems 48(3):241–251
Ren H, Ye Z, Li Z (2017) Anomaly detection based on a dynamic Markov model. Inf Sci 411:52–65
Li J, Pedrycz W, Jamal I (2017) Multivariate time series anomaly detection: a framework of hidden Markov models. Appl Soft Comput 60:229–240
F-h S, Sun D (2010) PLR based on time series tendency turning point. Application Research of Computers 6(27):2075–2074
Nakamura T, Taki K, Nomiya H (2013) A shape-based similarity measure for time series data with ensemble learning. Pattern Anal Applic 16(4):535–548
Kolozali Ş, Puschmann D, Bermudez-Edo M (2016) On the effect of adaptive and nonadaptive analysis of time-series sensory data. IEEE Internet Things J 3(6):1084–1098
Dwivedi Y, Subba Rao S (2011) A test for second-order stationarity of a time series based on the discrete Fourier transform. J Time Ser Anal 32(1):68–91
Chaovalit P, Gangopadhyay A (2011) Discrete wavelet transform-based time series analysis and mining. ACM Computing Surveys (CSUR) 43(2):1–37
Yazdi SV, Douzal-Chouakria A (2018) Time warp invariant kSVD: sparse coding and dictionary learning for time series under time warp. Pattern Recogn Lett 112:1–8
Cai Q, Chen L, Sun J (2015) Piecewise statistic approximation based similarity measure for time series. Knowl-Based Syst 85:181–195
Keogh E, Wei L, Xi X (2009) Supporting exact indexing of arbitrarily rotated shapes and periodic time series under euclidean and warping distance measures. VLDB J 18(3):611–630
Rakthanmanon T, Campana B, Mueen A (2013) Addressing big data time series: mining trillions of time series subsequences under dynamic time warping. ACM Transactions on Knowledge Discovery from Data (TKDD) 7(3):1–31
Fang L, Wan M, Yu M (2015) Analysis of similarity measure in the longitudinal study using improved longest common subsequence method for lung cancer. Biomedical Signal Processing and Control 15:60–66
Ren H, Li X, Li Z, Pedrycz W (2018) Data representation based on interval-sets for anomaly detection in time series. IEEE Access 6(99):27473–27479
Izakian H, Pedrycz W, Jamal I (2015) Fuzzy clustering of time series data using dynamic time warping distance. Eng Appl Artif Intell 39:235–244
Münz G, Li S, Carle G (2007) Traffic anomaly detection using k-means clustering. In proceedings of the GI/ITG workshop MMBnet: 13-14
Huang H, Mehrotra K, Mohan C K (2012) Algorithms for detecting outliers via clustering and ranks. In Proceedings of the International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems: 20–29
Van Der Voort M, Dougherty M, Watson S (1996) Combining Kohonen maps with ARIMA time series models to forecast traffic flow. Transportation Research Part C: Emerging Technologies 4(5):307–318
Karim F, Majumdar S, Darabi H (2017) LSTM fully convolutional networks for time series classification. IEEE Access 6(99):1662–1669
Pedrycz W, Lu W, Liu X (2014) Human-centric analysis and interpretation of time series: a perspective of granular computing. Soft Comput 18(12):2397–2411
Froelich W, Pedrycz W (2017) Fuzzy cognitive maps in the modeling of granular time series. Knowl-Based Syst 115:110–122
Su M-Y (2011) Real-time anomaly detection systems for denial-of-service attacks by weighted k-nearest-neighbor classifiers. Expert Syst Appl 38(4):3492–3498
Yahyaoui H, Al-Daihani R (2019) A novel trend based SAX reduction technique for time series. Expert Syst Appl 130:113–123
Yan Q-Y, Xia S-X, Feng K-W (2012) Probabilistic distance based abnormal pattern detection in uncertain series data. Knowl-Based Syst 36:182–190
Sternickel K (2002) Automatic pattern recognition in ECG time series. Comput Methods Prog Biomed 68(2):109–115
Acknowledgements
The authors are grateful to the Raytheon Chair for Systems Engineering for funding. This work was supported in part by the Fundamental Research Funds for the Central Universities under Grant K50510040013 and Grant K5051304007, in part by the Natural Science Foundation of China under Grant 61374068, and in part by the Science and Technology Development Fund, MSAR, under Grant 078/2015/A3, and the Doctoral Students’ Short-Term Study Abroad Scholarship Fund of Xidian University.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Zhou, Y., Ren, H., Li, Z. et al. Anomaly detection via a combination model in time series data. Appl Intell 51, 4874–4887 (2021). https://doi.org/10.1007/s10489-020-02041-3
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10489-020-02041-3