Abstract
Outlier detection is an important problem that has been researched and applied in a myriad of domains ranging from fraudulent transactions to intrusion detection. Most existing methods have been specially developed for detecting global and (or) local outliers by using either content information or structure information. Unfortunately, these conventional algorithms have been facing with unprecedented challenges in social networks, where data and link information are tightly integrated.
In this paper, a novel measurement named Community Outlying Factor is put forward for community outlier, besides its descriptive definition. A scalable community outliers detection algorithm (SCODA), which fully considers both content and structure information of social networks, is proposed. Furthermore, SCODA takes effective measures to minimize the number of input parameters down to only one, the number of outliers. Experimental results demonstrate that the time complexity of SCODA is linear to the number of nodes, which means that our algorithm can easily deal with very large data sets.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Toshniwal, D., Yadav, S.: Adaptive Outlier Detection in Streaming Time Series. In: Proceedings of International Conference on Asia Agriculture and Animal, ICAAA 2011 (2011)
Grubbs, F.E.: Procedures for detecting outlying observations in samples. Technometrics 11 (1969)
Aggarwal, C.C., Yu, P.S.: Outlier detection for high dimensional data. In: Proceedings of the ACM SIGMOD International Conference on Management of Data (2001)
Orair, G., Teixeira, C., Wang, Y., Meira, W., Parthasarathy, S.: Distance-Based Outlier Detection: Consolidation and Renewed Bearing. In: Proceedings of International Conference on Very Large Data Bases, VLDB (2010)
Hodge, V.J., Austin, J.: A Survey of Outlier Detection Methodologies. Artificial Intelligence Review 22 (2004)
Coscia, M., Giannotti, F., Pedreschi, D.: A Classification for Community Discovery Methods in Complex Networks. Statistical Analysis and Data Mining 4 (2011)
Parthasarathy, S., Ruan, Y., Satuluri, V.: Community Discovery in Social Networks: Applications, Methods and Emerging Trends. Social Network Data Analytics, 79–113 (2011)
Aggarwal, C.C.: An Introduction to Social Network Data Analytics. Social Network Data Analytics (2011)
Gao, J., Liang, F., Fan, W., Wang, C., Sun, Y., Han, J.: On Community Outliers and their Efficient Detection in Information Networks. In: Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2010)
Aggarwal, C.C., Zhao, Y., Yu, P.S.: Outlier Detection in Graph Streams. In: Proceedings of the International Conference on Data Engineering, ICDE (2011)
Zhang, J.: Towards Outlier Detection for High-dementional Data Streams using Projected Outlier Analysis Strategy. PhD thesis, Dalhousie University (2009)
Breunig, M.M., Kriegel, H.P., Ng, R.T., Sander, J.: Lof: Identifying density-based local outliers. In: Proceedings of the ACM SIGMOD International Conference on Management of Data (2000)
Papadimitriou, S., Kitagawa, H., Gibbons, P., Faloutsos, C.: LOCI: Fast outlier detection using the local correlation integral. In: Proceedings of the International Conference on Data Engineering, ICDE (2003)
Orair, G.H., Teixeira, C.H.C., Meira Jr., W., Wang, Y., Parthasarathy, S.: Distance-Based Outlier Detection: Consolidation and Renewed Bearing. In: Proceedings of the International Conference on Very Large Data Bases, VLDB (2010)
Moser, F., Ge, R., Ester, M.: Joint cluster analysis of attribute and relationship data withouta-priori specification of the number of clusters. In: Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD (2007)
Yang, T., Jin, R., Chi, Y., Zhu, S.: Combining link and content for community detection: a discriminative approach. In: Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD (2009)
Li, X., Li, Z., Han, J., Lee, J.-G.: Temporal Outlier Detection in Vehicle Traffic Data. In: Proceedings of the IEEE International Conference on Data Engineering, ICDE (2009)
Chakrabarti, D.: AutoPart: Parameter-Free Graph Partitioning and Outlier Detection. In: Boulicaut, J.-F., Esposito, F., Giannotti, F., Pedreschi, D. (eds.) PKDD 2004. LNCS (LNAI), vol. 3202, pp. 112–124. Springer, Heidelberg (2004)
Fortunato, S.: Community detection in graphs. Physics Reports 486 (2009)
He, Z., Xu, X., Deng, S.: Scalable Algorithms for Clustering Large Datasets with Mixed Type Attributes. International Journal of Intelligent Systems 20 (2005)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ji, T., Gao, J., Yang, D. (2012). A Scalable Algorithm for Detecting Community Outliers in Social Networks. In: Gao, H., Lim, L., Wang, W., Li, C., Chen, L. (eds) Web-Age Information Management. WAIM 2012. Lecture Notes in Computer Science, vol 7418. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-32281-5_42
Download citation
DOI: https://doi.org/10.1007/978-3-642-32281-5_42
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-32280-8
Online ISBN: 978-3-642-32281-5
eBook Packages: Computer ScienceComputer Science (R0)