Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2820783.2820813acmconferencesArticle/Chapter ViewAbstractPublication PagesgisConference Proceedingsconference-collections
research-article

Detecting collective anomalies from multiple spatio-temporal datasets across different domains

Published: 03 November 2015 Publication History

Abstract

The collective anomaly denotes a collection of nearby locations that are anomalous during a few consecutive time intervals in terms of phenomena collectively witnessed by multiple datasets. The collective anomalies suggest there are underlying problems that may not be identified based on a single data source or in a single location. It also associates individual locations and time intervals, formulating a panoramic view of an event. To detect a collective anomaly is very challenging, however, as different datasets have different densities, distributions, and scales. Additionally, to find the spatio-temporal scope of a collective anomaly is time consuming as there are many ways to combine regions and time slots. Our method consists of three components: Multiple-Source Latent-Topic (MSLT) model, Spatio-Temporal Likelihood Ratio Test (ST_LRT) model, and a candidate generation algorithm. MSLT combines multiple datasets to infer the latent functions of a geographic region in the framework of a topic model. In turn, a region's latent functions help estimate the underlying distribution of a sparse dataset generated in the region. ST_LRT learns a proper underlying distribution for different datasets, and calculates an anomalous degree for each dataset based on a likelihood ratio test (LRT). It then aggregates the anomalous degrees of different datasets, using a skyline detection algorithm. We evaluate our method using five datasets related to New York City (NYC): 311 complaints, taxicab data, bike rental data, points of interest, and road network data, finding the anomalies that cannot be identified (or earlier than those detected) by a single dataset. Results show the advantages beyond six baseline methods.

References

[1]
S., Borzsony, D., Kossmann, K., Stocker. The skyline operator. In Proc. of ICDE 2011, pp. 421--430.
[2]
V. Chandola, A. Banerjee, V. Kumar, Anomaly detection: a survey, ACM Computing Surveys, volume 41, pp. 1--58.
[3]
S. Chawla, Y. Zheng, and J. Hu, Inferring the root cause in road traffic anomalies, In Proc. of ICDM'12, pp. 141--150, 2012.
[4]
L., Diane. Zero-inflated Poisson regression, with an application to defects in manufacturing. Technometrics 34(1) (1992): 1--14.
[5]
J. Lee, J. Han, and X. Li, "Trajectory Outlier Detection: A Partition-and-Detect Framework," In Proc. of ICDE'08, pp. 140--149, 2008.
[6]
W. Liu, Y. Zheng, S. Chawla, J. Yuan, X. Xie. Discovering Spatio-Temporal Causal Interactions in Traffic Data Streams, In Proc. of KDD'11, pp. 1010--1018, 2011.
[7]
D. Mimno and A. McCallum. Topic models conditioned on arbitrary features with dirichlet-multinomial regression. Uncertainty in Artificial Intelligence, pages 411--418, 2008.
[8]
Y., Matsubara, Y., Sakurai, C. Faloutsos. FUNNEL: automatic mining of spatially coevolving epidemics. In Proc. of KDD'14.
[9]
B. Pan, et al. Crowd Sensing of Traffic Anomalies based on Human Mobility and Social Media, In Proc. of GIS'13, pp. 334--343, 2013.
[10]
L. X. Pang, S. Chawla, W. Liu, and Y. Zheng. On Mining Anomalous Patterns in Road Traffic Streams, In Proc. ADMA'11, pp. 237--251, 2011.
[11]
L. X. Pang, S. Chawla, W. Liu, Y. Zheng. On Detection of Emerging Anomalous Traffic Patterns Using GPS Data, Data & Knowledge Engineering 87 (2013): 357--373.
[12]
M. Wu, X. Song, C. Jermaine, et al, A LRT framework for fast spatial anomaly detection, In Proc. of KDD '09, pp. 887--896.
[13]
N. J. Yuan, Y. Zheng, X. Xie. Segmentation of Urban Areas Using Road Networks. MSR-TR-2012-65. 2012.
[14]
J. Yuan, Y. Zheng, X. Xie. Discovering regions of different functions in a city using human mobility and POIs. In Proc. of KDD '12, pp. 186--194. 2012.
[15]
D. Zhang, N. Li, Z. Zhou, et al. iBAT: detecting anomalous taxi trajectories from GPS traces, In Proc. of UbiComp '11, pp.99--108.
[16]
Y. Zheng, Methodologies for cross-domain data fusion: an overview, ACM Transactions on Big Data, vol. 1, no. 1, 2015.
[17]
Y. Zheng, L. Capra, O. Wolfson, H. Yang. Urban Computing: Concepts, Methodologies, and Applications, ACM Trans. Intelligent Systems and Technology, vol. 5, no. 3, pp. 38--55, 2014
[18]
Y. Zheng, Y. Liu, J. Yuan, X. Xie, Urban Computing with Taxicabs, In Proc. of UbiComp'11, pp. 89--98, 2011.

Cited By

View all
  • (2024)A Sensor-Based Simulation Method for Spatiotemporal Event DetectionISPRS International Journal of Geo-Information10.3390/ijgi1305014113:5(141)Online publication date: 23-Apr-2024
  • (2024)Outlier Detection in Streaming Data for Telecommunications and Industrial Applications: A SurveyElectronics10.3390/electronics1316333913:16(3339)Online publication date: 22-Aug-2024
  • (2024)Contrasting Estimation of Pattern Prototypes for Anomaly Detection in Urban Crowd FlowIEEE Transactions on Intelligent Transportation Systems10.1109/TITS.2024.335514325:8(10231-10245)Online publication date: Aug-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
SIGSPATIAL '15: Proceedings of the 23rd SIGSPATIAL International Conference on Advances in Geographic Information Systems
November 2015
646 pages
ISBN:9781450339674
DOI:10.1145/2820783
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

In-Cooperation

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 03 November 2015

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. anomaly detection
  2. big data
  3. cross-domain
  4. urban computing

Qualifiers

  • Research-article

Conference

SIGSPATIAL'15
Sponsor:

Acceptance Rates

SIGSPATIAL '15 Paper Acceptance Rate 38 of 212 submissions, 18%;
Overall Acceptance Rate 220 of 1,116 submissions, 20%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)37
  • Downloads (Last 6 weeks)1
Reflects downloads up to 17 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2024)A Sensor-Based Simulation Method for Spatiotemporal Event DetectionISPRS International Journal of Geo-Information10.3390/ijgi1305014113:5(141)Online publication date: 23-Apr-2024
  • (2024)Outlier Detection in Streaming Data for Telecommunications and Industrial Applications: A SurveyElectronics10.3390/electronics1316333913:16(3339)Online publication date: 22-Aug-2024
  • (2024)Contrasting Estimation of Pattern Prototypes for Anomaly Detection in Urban Crowd FlowIEEE Transactions on Intelligent Transportation Systems10.1109/TITS.2024.335514325:8(10231-10245)Online publication date: Aug-2024
  • (2024)Anomaly Detection in Smart Environments: A Comprehensive SurveyIEEE Access10.1109/ACCESS.2024.339505112(64006-64049)Online publication date: 2024
  • (2023)A Survey of Collective Anomaly Detection on Sequence DatasetInternational Journal of Data Warehousing and Mining10.4018/IJDWM.32736319:1(1-22)Online publication date: 4-Aug-2023
  • (2023)Deep Learning Based Urban Anomaly Prediction from Spatiotemporal DataMachine Learning and Knowledge Discovery in Databases10.1007/978-3-031-26387-3_15(242-257)Online publication date: 17-Mar-2023
  • (2022)Dependency Factors in Evidence Theory: An Analysis in an Information Fusion Scenario Applied in Adverse Drug ReactionsSensors10.3390/s2206231022:6(2310)Online publication date: 16-Mar-2022
  • (2022)Graph Convolutional Adversarial Networks for Spatiotemporal Anomaly DetectionIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2021.313617133:6(2416-2428)Online publication date: Jun-2022
  • (2022)CSCAD: Correlation Structure-based Collective Anomaly Detection in Complex SystemIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2022.3154166(1-1)Online publication date: 2022
  • (2022)Foresee Urban Sparse Traffic Accidents: A Spatiotemporal Multi-Granularity PerspectiveIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2020.303431234:8(3786-3799)Online publication date: 1-Aug-2022
  • Show More Cited By

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media