Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.5555/1032649.1033431guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Detection of Significant Sets of Episodes in Event Sequences

Published: 01 November 2004 Publication History

Abstract

We present a method for a reliable detection of "unusual" sets of episodes in the form of many pattern sequences, scanned simultaneously for an occurrence as a subsequence in a large event stream within a window of size w. We also investigate the important special case of all permutations of the same sequence, which models the situation where the order of events in an episode does not matter, e.g., when events correspond to purchased market basket items. In order to build a reliable monitoring system we compare obtained measurements to a reference model which in our case is a probabilistic model (Bernoulli or Markov). We first present a precise analysis that leads to a construction of a threshold. The difficulties of carrying out a probabilistic analysis for an arbitrary set of patterns, stems from the possible simultaneous occurrence of many members of the set as subsequences in the same window, the fact that the different patterns typically do have common symbols or common subsequences or possibly common prefixes, and that they may have different lengths. We also report on extensive experimental results, carried out on the Wal-Mart transactions database, that show a remarkable agreement with our theoretical analysis. This paper is an extension of our previous work in [Reliable detection of episodes in event sequences] where we laid out foundation for the problem of the reliable detection of an "unusual" episodes, but did not consider more than one episode scanned simultaneously for an occurrence.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Guide Proceedings
ICDM '04: Proceedings of the Fourth IEEE International Conference on Data Mining
November 2004
580 pages
ISBN:0769521428

Publisher

IEEE Computer Society

United States

Publication History

Published: 01 November 2004

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 09 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2019)Large-Scale Frequent Episode Mining from Complex Event Sequences with HierarchiesACM Transactions on Intelligent Systems and Technology10.1145/332616310:4(1-26)Online publication date: 20-Jul-2019
  • (2019)Discovering frequent chain episodesKnowledge and Information Systems10.1007/s10115-019-01349-y60:1(447-494)Online publication date: 1-Jul-2019
  • (2017)Contextual information fusion for intrusion detectionKnowledge and Information Systems10.1007/s10115-017-1027-352:3(563-619)Online publication date: 1-Sep-2017
  • (2015)Oracle Workload IntelligenceProceedings of the 2015 ACM SIGMOD International Conference on Management of Data10.1145/2723372.2742791(1669-1681)Online publication date: 27-May-2015
  • (2015)A precise ranking method for outlier detectionInformation Sciences: an International Journal10.1016/j.ins.2015.06.030324:C(88-107)Online publication date: 10-Dec-2015
  • (2015)Statistical significance of episodes with general partial ordersInformation Sciences: an International Journal10.1016/j.ins.2014.09.063296:C(175-200)Online publication date: 1-Mar-2015
  • (2013)A prediction framework based on contextual data to support Mobile Personalized MarketingDecision Support Systems10.5555/2747904.274823056:C(234-246)Online publication date: 1-Dec-2013
  • (2012)Mining statistically significant substrings using the chi-square statisticProceedings of the VLDB Endowment10.14778/2336664.23366775:10(1052-1063)Online publication date: 1-Jun-2012
  • (2011)Mining actionable partial orders in collections of sequencesProceedings of the 2011 European conference on Machine learning and knowledge discovery in databases - Volume Part I10.5555/2034063.2034115(613-528)Online publication date: 5-Sep-2011
  • (2011)Discrete wavelet transform-based time series analysis and miningACM Computing Surveys10.1145/1883612.188361343:2(1-37)Online publication date: 4-Feb-2011
  • Show More Cited By

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media