Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/502512.502571acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
Article

Infominer: mining surprising periodic patterns

Published: 26 August 2001 Publication History

Abstract

In this paper, we focus on mining surprising periodic patterns in a sequence of events. In many applications, e.g., computational biology, an infrequent pattern is still considered very significant if its actual occurrence frequency exceeds the prior expectation by a large margin. The traditional metric, such as support, is not necessarily the ideal model to measure this kind of surprising patterns because it treats all patterns equally in the sense that every occurrence carries the same weight towards the assessment of the significance of a pattern regardless of the probability of occurrence. A more suitable measurement, information, is introduced to naturally value the degree of surprise of each occurrence of a pattern as a continuous and monotonically decreasing function of its probability of occurrence. This would allow patterns with vastly different occurrence probabilities to be handled seamlessly. As the accumulated degree of surprise of all repetitions of a pattern, the concept of information gain is proposed to measure the overall degree of surprise of the pattern within a data sequence. The bounded information gain property is identified to tackle the predicament caused by the violation of the downward closure property by the information gain measure and in turn provides an efficient solution to this problem. Empirical tests demonstrate the efficiency and the usefulness of the proposed model.

References

[1]
G. Berger and A. Tuzhilin. Discovering unexpected patterns in temporal data using temporal logic. Temporal Databases - Research and Practice, Lecture Notes on Computer Sciences, (1399) 281-309, 1998.
[2]
R. Blahut. Principles and Practice of Information Theory, Addison-Wesley Publishing Company, 1987.
[3]
S. Brin, R. Motwani, C. Silverstein. Beyond market baskets: generalizing association rules to correlations. Proc. ACM SIGMOD Conf. on Management of Data, 265-276, 1997.
[4]
J. Han, G. Dong, and Y. Yin. Efficient mining partial periodic patterns in time series database. Proc. Int. Conf. on Data Engineering, 106-115, 1999.
[5]
M. Klemetinen, H. Mannila, P. Ronkainen, H. Toivonen, and A. Verkamo. Finding interesting rules from large sets of discovered association rules. Proe. CIKM, 1994.
[6]
B. Liu, W. Hsu, and Y. Ma. Mining association Rules with multiple minimum supports. Proc. ACM SIGKDD, 337-341, 1999.
[7]
S. Ma and J. Hellerstein. Mining partially periodic event patterns with unknown periods. Proe. Int. Conf. on Data Engineering, 205-214, 2001.
[8]
H. Mannila, D. Pavlov, and P. Smyth. Prediction with local patterns using cross-entropy. Proe. ACId SIGKDD, 357-361, 1999.
[9]
T. Oates, M. D. Schmill, P. R. Cohen. Efficient mining of statistical dependencies. Proc. 16th Int. Joint Conf. on Artificial Intelligence, 794-799, 1999.
[10]
B. Padmanabhan and A. Tuzhilin. Small is beautiful: discovering the minimal set of unexpected patterns. Proc. ACM KDD, 54-63, 2000.
[11]
A. Silberschatz and A. Tuzhilin. What makes patterns interesting in knowledge discover systems. IEEE Transactions on Knowledge and Data Engineerin 9 (TKDE) vol. 8 no. 6, pp. 970-974, 1996.
[12]
K. Wang, Y. He, and J. Han. Mining frequent itemsets using support constraints. Proe. Int. Conf. on Very Large Data Bases, 2000.
[13]
J. Yang, W. Wang, and P. Yu. Mining asynchronous periodic patterns in time series data. Proe. ACM SIGKDD Int. Conf. on Knowled9 e Discovery and Data Mining (SIGKDD), pp. 275-279, 2000.
[14]
J. Yang, W. Wang, and P. Yu. InfoMiner: mining surprising periodic patterns. IBM Research Report, 2001.
[15]
M. J. Zaki. Generating non-redundant association rules. Proe. ACM SIGKDD, 34-43, 2000.

Cited By

View all
  • (2024)A Survey of Advanced Border Gateway Protocol Attack Detection TechniquesSensors10.3390/s2419641424:19(6414)Online publication date: 3-Oct-2024
  • (2024)A trajectory similarity computation method based on GAT-based transformer and CNN modelScientific Reports10.1038/s41598-024-67256-714:1Online publication date: 13-Jul-2024
  • (2023)GraphTS: Graph-represented time series for subsequence anomaly detectionPLOS ONE10.1371/journal.pone.029009218:8(e0290092)Online publication date: 16-Aug-2023
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
KDD '01: Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining
August 2001
493 pages
ISBN:158113391X
DOI:10.1145/502512
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 26 August 2001

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Article

Conference

KDD01
Sponsor:

Acceptance Rates

KDD '01 Paper Acceptance Rate 31 of 237 submissions, 13%;
Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)22
  • Downloads (Last 6 weeks)0
Reflects downloads up to 03 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2024)A Survey of Advanced Border Gateway Protocol Attack Detection TechniquesSensors10.3390/s2419641424:19(6414)Online publication date: 3-Oct-2024
  • (2024)A trajectory similarity computation method based on GAT-based transformer and CNN modelScientific Reports10.1038/s41598-024-67256-714:1Online publication date: 13-Jul-2024
  • (2023)GraphTS: Graph-represented time series for subsequence anomaly detectionPLOS ONE10.1371/journal.pone.029009218:8(e0290092)Online publication date: 16-Aug-2023
  • (2022)Interpretable Anomaly Detection in Event Sequences via Sequence Matching and Visual ComparisonIEEE Transactions on Visualization and Computer Graphics10.1109/TVCG.2021.309358528:12(4531-4545)Online publication date: 1-Dec-2022
  • (2020)Parallel Mining of Partial Periodic Itemsets in Big DataTrends in Artificial Intelligence Theory and Applications. Artificial Intelligence Practices10.1007/978-3-030-55789-8_69(807-819)Online publication date: 22-Sep-2020
  • (2019)Efficient Mining Recurring Patterns of Inter-Transaction in Time SeriesJournal of Advanced Computational Intelligence and Intelligent Informatics10.20965/jaciii.2019.p040223:3(402-413)Online publication date: 20-May-2019
  • (2019)Detecting a Variety of Long-Term Stealthy User Behaviors on High Speed LinksIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2018.287331931:10(1912-1925)Online publication date: 1-Oct-2019
  • (2019)Efficient Mining of Event Periodicity in Data SeriesDatabase Systems for Advanced Applications10.1007/978-3-030-18576-3_8(124-139)Online publication date: 24-Apr-2019
  • (2018)Modeling Individual Cyclic Variation in Human BehaviorProceedings of the 2018 World Wide Web Conference10.1145/3178876.3186052(107-116)Online publication date: 10-Apr-2018
  • (2018)A Framework of Loose Travelling Companion Discovery from Human TrajectoriesIEEE Transactions on Mobile Computing10.1109/TMC.2018.281336917:11(2497-2511)Online publication date: 1-Nov-2018
  • Show More Cited By

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media