Abstract
With the recently rising technologies and numerous applications, the necessity of outlier detection is increasing drastically. Currently, a major variant of outlier detection techniques is witnessed. These techniques played a crucial role in the advancement of fields like medical health, MasterCard fraud, and intrusion detection. However, it is a significant work to spot abnormal behaviours or patterns out from sophisticated data. This paper provides a summary of the outlier detection strategies for the high-dimensional dataset and offers a comprehensive understanding of all basic techniques of outlier detection. This paper provides a comprehensive summary of the ongoing work on anomaly detection techniques, particularly with high-dimensional datasets and data with mixed attributes. The detection of outliers from the given dataset with anomalous data is meaningful work in the area of big data as the data is increasing exponentially every year. Specifically, this paper discusses the current advancement in the field of anomaly detection methods and simultaneously discusses the strengths and limitations of each outlier detection method.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Dubey A, Rasool A (2021) Efficient technique of microarray missing data imputation using clustering and weighted nearest neighbour. Sci Rep 11: 1–12
Dubey A, Rasool A (2019) Data mining based handling missing data. In: Proceeding of the third international conference on I-SMAC (IoT in social, mobile, analytics and cloud) (I-SMAC). Palladam, India, pp 483–489
Dubey A, Rasool A (2020) Clustering-Based Hybrid Approach for Multivariate Missing Data Imputation. Int J Adv Comput Sci Appl 11(11): 710–714
Zhang J (2013) Advancements of outlier detection: a survey. ICST Trans Scalable Inform Syst 13(1):1–26
Xu X, Liu H, Yao M (2019) Recent progress of anomaly detection. Complexity
Upadhyaya S, Singh K (2012) Nearest neighbour-based outlier detection techniques. Int J Comput Trends Technol 3(2):299–303
Zimek A, Campello RJ, Sander J (2014) Ensembles for unsupervised outlier detection: challenges and research questions a position paper. ACM SIGKDD Explorat Newsl 15(1):11–22
Aggarwal C, Sathe S (2015) Theoretical foundations and algorithms for outlier ensembles. ACM SIGKDD Explorat Newsl 17(1):24–47
Do K, Tran T, Phung D, Venkatesh S (2016) Outlier detection on mixed-type data: an energy-based approach. In: Proceeding of the international conference on advanced data mining and applications. Springer, Cham, pp 111–125
Agrawal A (2009) Local subspace-based outlier detection. In: International conference on contemporary computing. Springer, Berlin, Heidelberg, pp 149–157
Dang TT, Ngan HY, Liu W (2015) Distance-based k-nearest neighbours outlier detection method in large-scale traffic data. In: Proceeding of the IEEE international conference on digital signal processing (DSP), pp 507–510
Shah P A critical survey on anomaly detection
Kriegel HP, Kröger P, Schubert E, Zimek A (2009) LoOP: local outlier probabilities. In: Proceedings of the 18th ACM conference on information and knowledge management, pp 1649–1652
Kriegel HP, Kröger P, Sander J, Zimek A (2011) Density-based clustering. Wiley Interdiscip Rev Data Mining Knowl Discov 1(3):231–240
Han J, Pei J, Kamber M (2011) Data mining: concepts and techniques. Elsevier
Huang H, Mehrotra K, Mohan CK (2013) Rank-based outlier detection. J Stat Comput Simul 83(3):518–531
Zimek A, Filzmoser P (2018) There and back again: outlier detection between statistical reasoning and data mining algorithms. Wiley Interdiscip Rev Data Mining Knowl Discov 8(6):1280
Zhang J, Yu X, Li Y, Zhang S, Xun Y, Qin X (2016) A relevant subspace-based contextual outlier mining algorithm. Knowl-Based Syst 99:1–9
Kriegel HP, Kröger P, Schubert E, Zimek A (2009) Outlier detection in axis-parallel subspaces of high dimensional data. In: Pacific-Asia conference on knowledge discovery and data mining. Springer, Berlin, Heidelberg, pp 831–838
Muller E, Assent I, Steinhausen U, Seidl T (2008) Outrank ranking outliers in high dimensional data. In: Proceeding of the IEEE 24th international conference on data engineering workshop, pp 600–603
Chakraborty S, Nagwani NK Analysis and study of Incremental DBSCAN clustering algorithm. arXiv preprint arXiv,1406.4754.
Müller E, Schiffer M, Seidl T (2010) Adaptive outlierness for subspace outlier ranking. In: Proceedings of the 19th ACM international conference on information and knowledge management, pp 1629–1632
Zhou Z (2016) Machine learning. Tsinghua University Press, Beijing, pp 53–72
Lazarevic A, Kumar V (2005) Feature bagging for outlier detection. In: Proceedings of the eleventh ACM SIGKDD international conference on knowledge discovery in data mining, pp 157–166
Keller F, Muller E, Bohm K (2012) HiCS: high contrast subspaces for density-based outlier ranking. In: Proceeding of the IEEE 28th international conference on data engineering. Washington, DC, pp 1037–1048
Pasillas-DÃaz JR, Ratté S (2016) An unsupervised approach for combining scores of outlier detection techniques, based on similarity measures. Electr Notes Theor Comput Sci 61(7):329
Ghoting A, Otey ME, Parthasarathy S (2004) Loaded: link-based outlier and anomaly detection in evolving data sets. In: Proceeding of the fourth IEEE international conference on data mining (ICDM’04), pp 387–390
Moens S, Aksehirli E, Goethals B (2013) Frequent itemset mining for big data. In: IEEE international conference on big data, pp 111–118
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Das, C., Dubey, A., Rasool, A. (2022). Outlier Detection Techniques: A Comparative Study. In: Patgiri, R., Bandyopadhyay, S., Borah, M.D., Emilia Balas, V. (eds) Edge Analytics. Lecture Notes in Electrical Engineering, vol 869. Springer, Singapore. https://doi.org/10.1007/978-981-19-0019-8_42
Download citation
DOI: https://doi.org/10.1007/978-981-19-0019-8_42
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-19-0018-1
Online ISBN: 978-981-19-0019-8
eBook Packages: EngineeringEngineering (R0)