Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1869790.1869841acmconferencesArticle/Chapter ViewAbstractPublication PagesgisConference Proceedingsconference-collections
research-article

Spatial outlier detection: random walk based approaches

Published: 02 November 2010 Publication History

Abstract

A spatial outlier is a spatially referenced object whose non-spatial attributes are very different from those of its spatial neighbors. Spatial outlier detection has been an important part of spatial data mining and attracted attention in the past decades. Numerous SOD (Spatial Outlier Detection) approaches have been proposed. However, in these techniques, there exist the problems of masking and swamping. That is, some spatial outliers can escape the identification, and normal objects can be erroneously identified as outliers. In this paper, two Random walk based approaches, RW-BP (Random Walk on Bipartite Graph) and RW-EC (Random Walk on Exhaustive Combination), are proposed to detect spatial outliers. First, two different weighed graphs, a BP (Bipartite graph) and an EC (Exhaustive Combination), are modeled based on the spatial and/or non-spatial attributes of the spatial objects. Then, random walk techniques are utilized on the graphs to compute the relevance scores between the spatial objects. Using the analysis results, the outlier scores are computed for each object and the top k objects are recognized as outliers. Experiments conducted on the synthetic and real datasets demonstrated the effectiveness of the proposed approaches.

References

[1]
N. R. Adam, V. P. Janeja, and V. Atluri. Neighborhood based detection of anomalies in high dimensional spatio-temporal sensor datasets. In SAC '04: Proceedings of the 2004 ACM symposium on Applied computing, pages 576--583, New York, NY, USA, 2004. ACM.
[2]
F. Angiulli and F. Fassetti. Detecting distance-based outliers in streams of data. In CIKM '07, pages 811--820, New York, NY, USA, 2007. ACM.
[3]
L. Anselin. Local indicators of spatial association(lisa. volume 27, pages 93--115. Geographical Analysis, 1995.
[4]
V. Barnett and T. Lewis. Outliers in Statistical Data. 1994.
[5]
M. M. Breunig, H.-P. Kriegel, R. T. Ng, and J. Sander. Lof: identifying density-based local outliers. SIGMOD Rec., 29(2):93--104, 2000.
[6]
T. Cheng, D. Chen, and Y. Kou. Detecting spatial outliers with multiple attributes. In Transactions in GIS, pages 253--263, 2006.
[7]
N. A. C. Cressie. Statistics for Spatial Data. Wiley, 1993.
[8]
M. Ester, H. peter Kriegel, J. S, and X. Xu. A density-based algorithm for discovering clusters in large spatial databases with noise. In Proceedings of the 2nd International Conference on Knowledge Discovery and Data Mining, pages 226--231. AAAI Press, 1996.
[9]
L. Hagen and A. B. Kahng. A new approach to effective circuit clustering. In ICCAD '92: Proceedings of the 1992 IEEE/ACM international conference on Computer-aided design, pages 422--427, Los Alamitos, CA, USA, 1992. IEEE Computer Society Press.
[10]
R. Haining. Spatial data analysis in the social and environmental sciences. Cambridge University Press, 1993.
[11]
D. Harel and Y. Koren. Clustering spatial data using random walks. In KDD '01, pages 281--286, New York, NY, USA, 2001. ACM.
[12]
V. P. Janeja and V. Atluri. Fs3: A random walk based free-form spatial scan statistic for anomalous window detection. In ICDM '05, pages 661--664, Washington, DC, USA, 2005. IEEE Computer Society.
[13]
W. Jin, A. K. H. Tung, and J. Han. Mining top-n local outliers in large databases. In KDD '01: Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining, pages 293--298, New York, NY, USA, 2001. ACM.
[14]
E. M. Knorr and R. T. Ng. Algorithms for mining distance-based outliers in large datasets. In VLDB '98, pages 392--403, San Francisco, CA, USA, 1998. Morgan Kaufmann Publishers Inc.
[15]
Y. Kou, C.-T. Lu, and R. F. D. Santos. Spatial outlier detection: A graph-based approach. In ICTAI '07: Proceedings of the 19th IEEE International Conference on Tools with Artificial Intelligence, pages 281--288, Washington, DC, USA, 2007. IEEE Computer Society.
[16]
C.-T. Lu, D. Chen, and Y. Kou. Algorithms for spatial outlier detection. In ICDM '03, pages 597--600, Washington, DC, USA, 2003. IEEE Computer Society.
[17]
R. Mihalcea and P. Tarau. TextRank: Bringing order into texts. In Conference on Empirical Methods in Natural Language Processing, July 2004.
[18]
H. D. K. Moonesinghe and P.-N. Tan. Outlier detection using random walks. In ICTAI '06, pages 532--539, Washington, DC, USA, 2006. IEEE Computer Society.
[19]
R. T. Ng and J. Han. Efficient and effective clustering methods for spatial data mining. In VLDB '94, pages 144--155, San Francisco, CA, USA, 1994. Morgan Kaufmann Publishers Inc.
[20]
L. Page, S. Brin, R. Motwani, and T. Winograd. The pagerank citation ranking: Bringing order to the web, 1999.
[21]
J.-Y. Pan, H.-J. Yang, C. Faloutsos, and P. Duygulu. Automatic multimedia cross-modal correlation discovery. In KDD '04, pages 653--658, New York, NY, USA, 2004. ACM.
[22]
S. Ray and R. H. Turi. Determination of number of clusters in k-means clustering and application in color image segmentation. In 4th International Conference on Advances in Pattern Recognition and Digital Techniques (ICAPRDT'99), pages 137--143, 1999.
[23]
O. Schabenberger and C. A. Gotway. Statistical Methods for Spatial Data Analysis. Chapman & Hall/CRC, 2005.
[24]
S. Shekhar, C.-T. Lu, and P. Zhang. Detecting graph-based spatial outliers: algorithms and applications (a summary of results). In KDD '01, pages 371--376, New York, NY, USA, 2001. ACM.
[25]
S. Shekhar, C.-T. Lu, and P. Zhang. A unified approach to spatial outliers detection. GeoInformatica, 7:139--166, 2003.
[26]
J. Sun, H. Qu, D. Chakrabarti, and C. Faloutsos. Neighborhood formation and anomaly detection in bipartite graphs. In ICDM '05, pages 418--425, Washington, DC, USA, 2005. IEEE Computer Society.
[27]
P. Sun and S. Chawla. On local spatial outliers. In ICDM '04, pages 209--216, Washington, DC, USA, 2004. IEEE Computer Society.
[28]
W. R. Tobler. A computer movie simulating urban growth in the detroit region. Economic Geography, 25:234--240, 1970.
[29]
J. Zhao, C.-T. Lu, and Y. Kou. Detecting region outliers in meteorological data. In ACM GIS '03, pages 49--55, New York, NY, USA, 2003. ACM.

Cited By

View all
  • (2022)Unsupervised Outlier Detection for Mixed-Valued Dataset Based on the Adaptive k-Nearest Neighbor Global NetworkIEEE Access10.1109/ACCESS.2022.316148110(32093-32103)Online publication date: 2022
  • (2021)MEOD: Memory-Efficient Outlier Detection on Streaming DataSymmetry10.3390/sym1303045813:3(458)Online publication date: 12-Mar-2021
  • (2021)Advanced Memory Efficient Outlier Detection Approach for Streaming Data using Swarm Optimization2021 44th International Conference on Telecommunications and Signal Processing (TSP)10.1109/TSP52935.2021.9522667(346-351)Online publication date: 26-Jul-2021
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
GIS '10: Proceedings of the 18th SIGSPATIAL International Conference on Advances in Geographic Information Systems
November 2010
566 pages
ISBN:9781450304283
DOI:10.1145/1869790
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 02 November 2010

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. data mining
  2. random walk
  3. spatial outlier detection

Qualifiers

  • Research-article

Conference

GIS '10
Sponsor:

Acceptance Rates

Overall Acceptance Rate 220 of 1,116 submissions, 20%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)7
  • Downloads (Last 6 weeks)3
Reflects downloads up to 04 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2022)Unsupervised Outlier Detection for Mixed-Valued Dataset Based on the Adaptive k-Nearest Neighbor Global NetworkIEEE Access10.1109/ACCESS.2022.316148110(32093-32103)Online publication date: 2022
  • (2021)MEOD: Memory-Efficient Outlier Detection on Streaming DataSymmetry10.3390/sym1303045813:3(458)Online publication date: 12-Mar-2021
  • (2021)Advanced Memory Efficient Outlier Detection Approach for Streaming Data using Swarm Optimization2021 44th International Conference on Telecommunications and Signal Processing (TSP)10.1109/TSP52935.2021.9522667(346-351)Online publication date: 26-Jul-2021
  • (2019)Anomaly Region Detection Based on DMSTInternational Journal of Data Warehousing and Mining10.4018/IJDWM.201901010315:1(39-57)Online publication date: 1-Jan-2019
  • (2018)Outlier Detection in Urban Traffic DataProceedings of the 8th International Conference on Web Intelligence, Mining and Semantics10.1145/3227609.3227692(1-12)Online publication date: 25-Jun-2018
  • (2018)Knowledge Discovery Process for Detection of Spatial OutliersRecent Trends and Future Technology in Applied Intelligence10.1007/978-3-319-92058-0_6(57-68)Online publication date: 30-May-2018
  • (2018)There and back again: Outlier detection between statistical reasoning and data mining algorithmsWIREs Data Mining and Knowledge Discovery10.1002/widm.12808:6Online publication date: 20-Aug-2018
  • (2017)Contextual Spatial Outlier Detection with Metric LearningProceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining10.1145/3097983.3098143(2161-2170)Online publication date: 13-Aug-2017
  • (2016)A Spatial Anomaly Points and Regions Detection Method Using Multi‐Constrained Graphs and Local DensityTransactions in GIS10.1111/tgis.1220821:2(376-405)Online publication date: 27-Jun-2016
  • (2015)Outlier Detection and Trend DetectionProceedings of the 2015 IEEE International Conference on Data Mining Workshop (ICDMW)10.1109/ICDMW.2015.79(40-46)Online publication date: 14-Nov-2015
  • Show More Cited By

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media