Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.5555/1887568.1887591guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Lag patterns in time series databases

Published: 30 August 2010 Publication History

Abstract

Time series motif discovery is important as the discovered motifs generally form the primitives for many data mining tasks. In this work, we examine the problem of discovering groups of motifs from different time series that exhibit some lag relationships. We define a new class of pattern called lagPatterns that captures the invariant ordering among motifs. lagPatterns characterize localized associative pattern involving motifs derived from each entity and explicitly accounts for lag across multiple entities. We present an exact algorithm that makes use of the order line concept and the subsequence matching property of the normalized time series to find all motifs of various lengths. We also describe a method called LPMiner to discover lagPatterns efficiently. LPMiner utilizes inverted index and motif alignment technique to reduce the search space and improve the efficiency. A detailed empirical study on synthetic datasets shows the scalability of the proposed approach. We show the usefulness of lagPatterns discovered from a stock dataset by constructing stock portfolio that leads to a higher cumulative rate of return on investment.

References

[1]
Chiu, B., Keogh, E., Lonardi, S.: Probabilistic discovery of time series motifs. In: SIGKDD, pp. 493-498 (2003)
[2]
Cormen, T., Leiserson, E., Rivest, L., Stein, C.: Introduction to Algorithms. The MIT Press, Cambridge (2001)
[3]
Das, G., Lin, K., Mannila, H., Renganathan, G., Smyth, P.: Rule discovery from time series. In: SIGKDD, pp. 16-22 (1998)
[4]
Gsli, H., Samet, H.: Properties of embedding methods for similarity search in metric spaces. In: PAMI, pp. 530-549 (2003)
[5]
Han, J., Kamber, M.: Data Mining: Concepts and Techniques. Morgan Kaufmann, San Francisco (2000)
[6]
Keogh, E., Kasetty, S.: On the need for time series data mining benchmarks: A survey and empirical demonstration. DMKD 7(4), 349-371 (2003)
[7]
Keogh, E., Lin, J.: Clustering of time-series subseq. is meaningless: implications for previous and future research. KIS 8(2), 154-177 (2005)
[8]
Lin, J., Keogh, E., Lonardi, S., Patel, P.: Finding motifs in time series. In: Temporal Data Mining (2002)
[9]
Loh, W., Kim, S., Whang, K.: A subsequence matching algorithm that supports normalization transform in time-series databases. DMKD, 5-28 (2004)
[10]
Minnen, D., Isbell, C.L., Essa, I., Starner, T.: Discovering multivariate motifs using subsequence density estimation and greedy mixture learning. In: AAAI (2007)
[11]
Mueen, A., Keogh, E., Bigdely-Shamlo, N.: A disk-aware algorithm for time series motif discovery. In: ICDM (2009)
[12]
Mueen, A., Keogh, E., Zhu, Q., Cash, S.: Exact discovery of time series motifs. In: SDM (2009)
[13]
Oates, T.: Peruse: An unsupervised algorithm for finding recurring patterns in time series. In: ICDM, pp. 330-337 (2002)
[14]
Vahdatpour, A., Amini, N., Sarrafzadeh, M.: Toward unsupervised activity discovery using multi-dimensional motif detection in time series. In: IJCAI (2009)
[15]
Wu, D., Fung, G.P.C., Yu, J.X., Liu, Z.: Mining multiple time series co-movements. In: Zhang, Y., Yu, G., Bertino, E., Xu, G. (eds.) APWeb 2008. LNCS, vol. 4976, pp. 572-583. Springer, Heidelberg (2008)
[16]
Yoshiki, T., Kazuhisa, I., Kuniaki, U.: Discovery of time-series motif from multi-dimensional data based on mdl principle. Machine Learning 58(2-3), 269-300 (2005)
[17]
Zhu, Y., Shasha, D.: Statstream: Statistical monitoring of thousands of data streams in real time. In: VLDB (2002)

Cited By

View all
  • (2015)AssemblerProceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining10.1145/2783258.2783394(1415-1424)Online publication date: 10-Aug-2015
  • (2013)Database research at the National University of SingaporeACM SIGMOD Record10.1145/2503792.250380342:2(46-51)Online publication date: 16-Jul-2013
  1. Lag patterns in time series databases

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image Guide Proceedings
    DEXA'10: Proceedings of the 21st international conference on Database and expert systems applications: Part II
    August 2010
    502 pages
    ISBN:3642152503
    • Editors:
    • Pablo García Bringas,
    • Abdelkader Hameurlain,
    • Gerald Quirchmayr

    Publisher

    Springer-Verlag

    Berlin, Heidelberg

    Publication History

    Published: 30 August 2010

    Qualifiers

    • Article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 27 Jan 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2015)AssemblerProceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining10.1145/2783258.2783394(1415-1424)Online publication date: 10-Aug-2015
    • (2013)Database research at the National University of SingaporeACM SIGMOD Record10.1145/2503792.250380342:2(46-51)Online publication date: 16-Jul-2013

    View Options

    View options

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media