Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.5555/777092.777129acmconferencesArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Rule-based anomaly pattern detection for detecting disease outbreaks

Published: 28 July 2002 Publication History

Abstract

This paper presents an algorithm for performing early detection of disease outbreaks by searching a database of emergency department cases for anomalous patterns. Traditional techniques for anomaly detection are unsatisfactory for this problem because they identify individual data points that are rare due to particular combinations of features. When applied to our scenario, these traditional algorithms discover isolated outliers of particularly strange events, such as someone accidentally shooting their ear, that are not indicative of a new outbreak. Instead, we would like to detect anomalous patterns. These patterns are groups with specific characteristics whose recent pattern of illness is anomalous relative to historical patterns. We propose using a rule-based anomaly detection algorithm that characterizes each anomalous pattern with a rule. The significance of each rule is carefully evaluated using Fisher's Exact Test and a randomization test. Our algorithm is compared against a standard detection algorithm by measuring the number of false positives and the timeliness of detection. Simulated data, produced by a simulator that creates the effects of an epidemic on a city, is used for evaluation. The results indicate that our algorithm has significantly better detection times for common significance thresholds while having a slightly higher false positive rate.

References

[1]
Bay, S. D., and Pazzani, M.J. 1999. Detecting change in categorical data: Mining contrast sets. In Knowledge Discovery and Data Mining, 302-306.
[2]
Benjamini, Y., and Hochberg, Y. 1995. Controlling the false discovery rate: a practical and powerful approach to multiple testing. Journal of the Royal Statistical Society, Series B. 57:289-300.
[3]
Bishop, C. M. 1994. Novelty detection and neural network validation. IEEE Proceedings - Vision, Image and Signal Processing 141(4):217-222.
[4]
Bonferroni, C. E. 1936. Teoria statistica delle classi e calcolo delle probabilità. Pubblicazioni del R Istituto Superiore di Scienze Economiche e Commerciali di Firenze 8:3-62.
[5]
Brin, S.; Motwani, R.; Ullman, J. D.; and Tsur, S. 1997. Dynamic itemset counting and implication rules for market basket data. In Peckham, J., ed., SIGMOD 1997, Proceedings ACM SIGMOD International Conference on Management of Data, May 13-15, 1997, Tucson, Arizona, USA, 255-264. ACM Press.
[6]
Eskin, E. 2000. Anomaly detection over noisy data using learned probability distributions. In Proceedings of the 2000 International Conference on Machine Learning (ICML-2000).
[7]
Goldenberg, A. 2001. Framework for using grocery data for early detection of bio-terrorism attacks. Master's thesis, Carnegie Mellon University.
[8]
Good, P. 2000. Permutation Tests - A Practical Guide to Resampling Methods for Testing Hypotheses. New York: Springer-Verlag, 2nd edition.
[9]
Hamerly, G., and Elkan, C. 2001. Bayesian approaches to failure prediction for disk drives. In Proceedings of the eighteenth international conference on machine learning, 202-209. Morgan Kaufmann, San Francisco, CA.
[10]
Lane, T., and Brodley, C.E. 1999. Temporal sequence learning and data reduction for anomaly detection. ACM Transactions on Information and System Security 2:295-331.
[11]
Maron, O., and Moore, A. W. 1997. The racing algorithm: Model selection for lazy learners. Artificial Intelligence Review 11(1-5):193-225.
[12]
Maxion, R. A., and Tan, K. M. C. 2001. Anomaly detection in embedded systems. Technical Report CMU-CS-01-157, Carnegie Mellon University.
[13]
Miller, C. J.; Genovese, C.; Nichol, R. C.; Wasserman, L.; Connolly, A.; Reichart, D.; Hopkins, A.; Schneider, J.; and Moore, A. 2001. Controlling the false discovery rate in astrophysical data analysis. Technical report, Carnegie Mellon University.
[14]
Wagner, M. M.; Tsui, F. C.; Espino, J. U.; Dato, V. M.; Sittig, D. F.; Caruana, R. A.; McGinnis, L. F.; Deerfield, D. W.; Druzdzel, M. J.; and Fridsma, D. B. 2001. The emerging science of very early detection of disease outbreaks. Journal of Public Health Management Practice 7(6):51-59.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
Eighteenth national conference on Artificial intelligence
July 2002
1068 pages
ISBN:0262511290

Sponsors

  • NSF: National Science Foundation
  • Alberta Informatics Circle of Research Excellence (iCORE)
  • SIGAI: ACM Special Interest Group on Artificial Intelligence
  • Naval Research Laboratory: Naval Research Laboratory
  • AAAI: American Association for Artificial Intelligence
  • NASA Ames Research Center: NASA Ames Research Center
  • DARPA: Defense Advanced Research Projects Agency

Publisher

American Association for Artificial Intelligence

United States

Publication History

Published: 28 July 2002

Check for updates

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 09 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2018)Contextual outlier interpretationProceedings of the 27th International Joint Conference on Artificial Intelligence10.5555/3304889.3305002(2461-2467)Online publication date: 13-Jul-2018
  • (2018)Group Deviation Detection MethodsACM Computing Surveys10.1145/320324651:4(1-38)Online publication date: 25-Jul-2018
  • (2017)Outlier Detection via Deep Learning ArchitectureProceedings of the 18th International Conference on Computer Systems and Technologies10.1145/3134302.3134337(73-79)Online publication date: 23-Jun-2017
  • (2015)Unsupervised Anomaly Detection in Online GameProceedings of the 6th International Symposium on Information and Communication Technology10.1145/2833258.2833305(4-10)Online publication date: 3-Dec-2015
  • (2015)A fast MST-inspired kNN-based outlier detection methodInformation Systems10.1016/j.is.2014.09.00248:C(89-112)Online publication date: 1-Mar-2015
  • (2014)On detecting spatial categorical outliersGeoinformatica10.1007/s10707-013-0188-918:3(501-536)Online publication date: 1-Jul-2014
  • (2013)Mining multidimensional contextual outliers from categorical relational dataProceedings of the 25th International Conference on Scientific and Statistical Database Management10.1145/2484838.2484883(1-4)Online publication date: 29-Jul-2013
  • (2012)Hidden Source Behavior Change Tracking and DetectionProceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 0210.1109/WI-IAT.2012.137(51-58)Online publication date: 4-Dec-2012
  • (2012)A minimum spanning tree-inspired clustering-based outlier detection techniqueProceedings of the 12th Industrial conference on Advances in Data Mining: applications and theoretical aspects10.1007/978-3-642-31488-9_17(209-223)Online publication date: 13-Jul-2012
  • (2012)Detection of high-risk zones and potential infected neighbors from infectious disease monitoring dataProceedings of the 17th international conference on Database Systems for Advanced Applications10.1007/978-3-642-29023-7_27(262-273)Online publication date: 15-Apr-2012
  • Show More Cited By

View Options

Get Access

Login options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media