Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Outlier Detection Techniques: A Comparative Study

  • Conference paper
  • First Online:
Edge Analytics

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 869))

  • 691 Accesses

Abstract

With the recently rising technologies and numerous applications, the necessity of outlier detection is increasing drastically. Currently, a major variant of outlier detection techniques is witnessed. These techniques played a crucial role in the advancement of fields like medical health, MasterCard fraud, and intrusion detection. However, it is a significant work to spot abnormal behaviours or patterns out from sophisticated data. This paper provides a summary of the outlier detection strategies for the high-dimensional dataset and offers a comprehensive understanding of all basic techniques of outlier detection. This paper provides a comprehensive summary of the ongoing work on anomaly detection techniques, particularly with high-dimensional datasets and data with mixed attributes. The detection of outliers from the given dataset with anomalous data is meaningful work in the area of big data as the data is increasing exponentially every year. Specifically, this paper discusses the current advancement in the field of anomaly detection methods and simultaneously discusses the strengths and limitations of each outlier detection method.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 259.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 329.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 329.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Dubey A, Rasool A (2021) Efficient technique of microarray missing data imputation using clustering and weighted nearest neighbour. Sci Rep 11: 1–12

    Google Scholar 

  2. Dubey A, Rasool A (2019) Data mining based handling missing data. In: Proceeding of the third international conference on I-SMAC (IoT in social, mobile, analytics and cloud) (I-SMAC). Palladam, India, pp 483–489

    Google Scholar 

  3. Dubey A, Rasool A (2020) Clustering-Based Hybrid Approach for Multivariate Missing Data Imputation. Int J Adv Comput Sci Appl 11(11): 710–714

    Google Scholar 

  4. Zhang J (2013) Advancements of outlier detection: a survey. ICST Trans Scalable Inform Syst 13(1):1–26

    Article  Google Scholar 

  5. Xu X, Liu H, Yao M (2019) Recent progress of anomaly detection. Complexity

    Google Scholar 

  6. Upadhyaya S, Singh K (2012) Nearest neighbour-based outlier detection techniques. Int J Comput Trends Technol 3(2):299–303

    Google Scholar 

  7. Zimek A, Campello RJ, Sander J (2014) Ensembles for unsupervised outlier detection: challenges and research questions a position paper. ACM SIGKDD Explorat Newsl 15(1):11–22

    Article  Google Scholar 

  8. Aggarwal C, Sathe S (2015) Theoretical foundations and algorithms for outlier ensembles. ACM SIGKDD Explorat Newsl 17(1):24–47

    Article  Google Scholar 

  9. Do K, Tran T, Phung D, Venkatesh S (2016) Outlier detection on mixed-type data: an energy-based approach. In: Proceeding of the international conference on advanced data mining and applications. Springer, Cham, pp 111–125

    Google Scholar 

  10. Agrawal A (2009) Local subspace-based outlier detection. In: International conference on contemporary computing. Springer, Berlin, Heidelberg, pp 149–157

    Google Scholar 

  11. Dang TT, Ngan HY, Liu W (2015) Distance-based k-nearest neighbours outlier detection method in large-scale traffic data. In: Proceeding of the IEEE international conference on digital signal processing (DSP), pp 507–510

    Google Scholar 

  12. Shah P A critical survey on anomaly detection

    Google Scholar 

  13. Kriegel HP, Kröger P, Schubert E, Zimek A (2009) LoOP: local outlier probabilities. In: Proceedings of the 18th ACM conference on information and knowledge management, pp 1649–1652

    Google Scholar 

  14. Kriegel HP, Kröger P, Sander J, Zimek A (2011) Density-based clustering. Wiley Interdiscip Rev Data Mining Knowl Discov 1(3):231–240

    Article  Google Scholar 

  15. Han J, Pei J, Kamber M (2011) Data mining: concepts and techniques. Elsevier

    Google Scholar 

  16. Huang H, Mehrotra K, Mohan CK (2013) Rank-based outlier detection. J Stat Comput Simul 83(3):518–531

    Article  MathSciNet  Google Scholar 

  17. Zimek A, Filzmoser P (2018) There and back again: outlier detection between statistical reasoning and data mining algorithms. Wiley Interdiscip Rev Data Mining Knowl Discov 8(6):1280

    Article  Google Scholar 

  18. Zhang J, Yu X, Li Y, Zhang S, Xun Y, Qin X (2016) A relevant subspace-based contextual outlier mining algorithm. Knowl-Based Syst 99:1–9

    Article  Google Scholar 

  19. Kriegel HP, Kröger P, Schubert E, Zimek A (2009) Outlier detection in axis-parallel subspaces of high dimensional data. In: Pacific-Asia conference on knowledge discovery and data mining. Springer, Berlin, Heidelberg, pp 831–838

    Google Scholar 

  20. Muller E, Assent I, Steinhausen U, Seidl T (2008) Outrank ranking outliers in high dimensional data. In: Proceeding of the IEEE 24th international conference on data engineering workshop, pp 600–603

    Google Scholar 

  21. Chakraborty S, Nagwani NK Analysis and study of Incremental DBSCAN clustering algorithm. arXiv preprint arXiv,1406.4754.

    Google Scholar 

  22. Müller E, Schiffer M, Seidl T (2010) Adaptive outlierness for subspace outlier ranking. In: Proceedings of the 19th ACM international conference on information and knowledge management, pp 1629–1632

    Google Scholar 

  23. Zhou Z (2016) Machine learning. Tsinghua University Press, Beijing, pp 53–72

    Google Scholar 

  24. Lazarevic A, Kumar V (2005) Feature bagging for outlier detection. In: Proceedings of the eleventh ACM SIGKDD international conference on knowledge discovery in data mining, pp 157–166

    Google Scholar 

  25. Keller F, Muller E, Bohm K (2012) HiCS: high contrast subspaces for density-based outlier ranking. In: Proceeding of the IEEE 28th international conference on data engineering. Washington, DC, pp 1037–1048

    Google Scholar 

  26. Pasillas-Díaz JR, Ratté S (2016) An unsupervised approach for combining scores of outlier detection techniques, based on similarity measures. Electr Notes Theor Comput Sci 61(7):329

    Google Scholar 

  27. Ghoting A, Otey ME, Parthasarathy S (2004) Loaded: link-based outlier and anomaly detection in evolving data sets. In: Proceeding of the fourth IEEE international conference on data mining (ICDM’04), pp 387–390

    Google Scholar 

  28. Moens S, Aksehirli E, Goethals B (2013) Frequent itemset mining for big data. In: IEEE international conference on big data, pp 111–118

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Das, C., Dubey, A., Rasool, A. (2022). Outlier Detection Techniques: A Comparative Study. In: Patgiri, R., Bandyopadhyay, S., Borah, M.D., Emilia Balas, V. (eds) Edge Analytics. Lecture Notes in Electrical Engineering, vol 869. Springer, Singapore. https://doi.org/10.1007/978-981-19-0019-8_42

Download citation

  • DOI: https://doi.org/10.1007/978-981-19-0019-8_42

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-19-0018-1

  • Online ISBN: 978-981-19-0019-8

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics