Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1835804.1835941acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

Online discovery and maintenance of time series motifs

Published: 25 July 2010 Publication History
  • Get Citation Alerts
  • Abstract

    The detection of repeated subsequences, time series motifs, is a problem which has been shown to have great utility for several higher-level data mining algorithms, including classification, clustering, segmentation, forecasting, and rule discovery. In recent years there has been significant research effort spent on efficiently discovering these motifs in static offline databases. However, for many domains, the inherent streaming nature of time series demands online discovery and maintenance of time series motifs. In this paper, we develop the first online motif discovery algorithm which monitors and maintains motifs exactly in real time over the most recent history of a stream. Our algorithm has a worst-case update time which is linear to the window size and is extendible to maintain more complex pattern structures. In contrast, the current offline algorithms either need significant update time or require very costly pre-processing steps which online algorithms simply cannot afford.
    Our core ideas allow useful extensions of our algorithm to deal with arbitrary data rates and discovering multidimensional motifs. We demonstrate the utility of our algorithms with a variety of case studies in the domains of robotics, acoustic monitoring and online compression.

    Supplementary Material

    JPG File (kdd2010_mueen_odmt_01.jpg)
    MOV File (kdd2010_mueen_odmt_01.mov)

    References

    [1]
    Agrawal, R., Faloutsos, C. and Swami, A.N. Efficient Similarity Search in Sequence Databases. FODO 1993: 69--84.
    [2]
    Beaudoin, P., Panne, M., Poulin, P. and Coros, S. Motion-Motif Graphs. ACM/EG Symposium on Computer Animation 2008.
    [3]
    Bespamyatnikh, S. N. An Optimal Algorithm for Closest Pair Maintenance. ACM SCG '95, 152--161.
    [4]
    Bulut, A. and Singh, A.: SWAT: Hierarchical Stream Summarization in Large Networks. In Proceedings of the ICDE (2003).
    [5]
    Cardinal, J. and Eppstein, D. Lazy Algorithms for Dynamic Closest Pair with Arbitrary Distance Measures. ALENEX/ANALC 2004.
    [6]
    Chiu, B., Keogh, E. and Lonardi, S. Probabilistic Discovery of Time Series Motifs. ACM SIGKDD 2003. pp 493--498.
    [7]
    Cummins, M. and Newman, P. FAB-MAP: Probabilistic Localization and Mapping in the Space of Appearance. The International Journal of Robotics Research, 27(6), 647--665, 2008.
    [8]
    Datar, M., Gionis, A., Indyk, P., and Motwani, R. Maintaining Stream Statistics over Sliding Windows. SIAM J. Comput. 31, 6 (Jun. 2002), 1794--1813.
    [9]
    Dawson, D. K. and Efford, M. G. Bird Population Density Estimated from Acoustic Signals. Journal of Applied Ecology. Volume 46 Issue 6, Pages 1201--1209.
    [10]
    Ding, H., Trajcevski, G., Scheuermann, P., Wang, X. and Keogh, E. Querying and Mining of Time Series Data: Experimental Comparison of Representations and Distance Measures. VLDB 2008.
    [11]
    Dohnal, V., Gennaro C. and Zezula, P. Similarity Join in Metric Spaces Using eD-Index. Database and Expert Systems Applications, Volume 2736, pp. 484--493, 2003.
    [12]
    Eppstein, D. Fast Hierarchical Clustering and Other Applications of Dynamic Closest Pairs. ACM Journal of Experimental Algorithmics 5:1 (2000).
    [13]
    Fuchs, E., Gruber, T., Nitschke, J. and Sick, B. On-line Motif Detection in Time Series with SwiftMotif. In: Pattern Recognition 42(11):3015--3031, 2009.
    [14]
    Keogh, E., Chu, S., Hart, D. and Pazzani, M. An Online Algorithm for Segmenting Time Series. ICDM, pp. 289--296, 2001.
    [15]
    Lazaridis, I. and Mehrotra, S. Capturing Sensor-Generated Time Series with Quality Guarantees. ICDE 2003.
    [16]
    Lin, J., Keogh, E., Lonardi, S. and Patel, P. Finding Motifs in Time Series, Workshop on Temporal Data Mining (KDD'02), 2002.
    [17]
    Mueen, A., Keogh, E., Zhu, Q., Cash, S. and Westover, B. Exact Discovery of Time Series Motif. SDM 2009.
    [18]
    N. Tatbul, N., Çetintemel, U., Zdonik, S., Cherniack, M. and Stonebraker. M. Load Shedding in a Data Stream Manager. VLDB 2003, pp. 309--320.
    [19]
    Nanopoulos A., Theodoridis Y. and Manolopoulos, Y. C2P: Clustering Based on Closest Pairs. VLDB, pp. 331--340, 2001.
    [20]
    Odlyzko, A.M. and Rains, E.M. On Longest Increasing Subsequences in Random Permutation, Analysis, Geometry, Number Theory: the Mathematics of Leon Ehrenpreis.439--451, Contemp. Math., 251, Amer. Math. Soc., Providence, RI, 2000.
    [21]
    Ogras, Y. and Ferhatosmanoglu, H. Online Summarization of Dynamic Time Series Data. The VLDB Journal, 15(1):84--98, 2006.
    [22]
    Palpanas, T., Vlachos, M., Keogh, E., Gunopulos, D. and Truppel, W. Online Amnesic Approximation of Streaming Time Series. In ICDE 2004.
    [23]
    Patel, P., Keogh, E., Lin, J. and Lonardi, S. Mining Motifs in Massive Time Series Databases. ICDM 2002.
    [24]
    Patnaik, D., Marwah, M., Sharma, R.K. and Ramakrishnan, N. Sustainable Operation and Management of Data Center Chillers using Temporal Data Mining. KDD 2009: 1305--1314.
    [25]
    Penteriani, V. Variation in the Function of Eagle Owl Vocal Behaviour: Territorial Defence and Intra-Pair Communication? Ethol. Ecol. Evol. 14: 275--281.
    [26]
    Trifa, V.M., Girod, L., Collier, T., Blumstein, D.T. and Taylor, C.E. Automated Wildlife Monitoring Using Self-Configuring Sensor Networks Deployed in Natural Habitats. AROB 2007.
    [27]
    Vahdatpour, A., Amini, N. and Sarrafzadeh, M. Towards Unsupervised Activity Discovery using Multi Dimensional Motif Detection in Time Series. 21st International Joint Conference on Artificial Intelligence (IJCAI) 2009, Pasadena, California.
    [28]
    Weber R., Schek, H-J. and Blott, S. A Quantitative Analysis and Performance Study for Similarity-Search Methods in High-Dimensional Spaces. VLDB, pp. 194--205, 1998.
    [29]
    Wurst, M., Morik, K. and Mierswa, I. Localized Alternative Cluster Ensembles for Collaborative Structuring. ECML 2006. pp. 485--496.
    [30]
    Yankov, D., Keogh, E., Medina, J., Chiu, B. and Zordan V. Detecting Motifs under Uniform Scaling. SIGKDD 2007.
    [31]
    Supporting webpage containing Data, Code, Videos, Excel sheet and Presentation slides. Link: http://www.cs.ucr.edu/~mueen/OnlineMotif/index.html

    Cited By

    View all
    • (2024)MASS: distance profile of a query over a time seriesData Mining and Knowledge Discovery10.1007/s10618-024-01005-238:3(1466-1492)Online publication date: 1-May-2024
    • (2023)Discovering time series motifs of all lengths using dynamic time warpingWorld Wide Web10.1007/s11280-023-01207-626:6(3815-3836)Online publication date: 20-Sep-2023
    • (2022)Cross-Correlation for Streaming Seismic Time Series to Detect Events using Parallel and Real-time Methods2022 IEEE 13th Annual Ubiquitous Computing, Electronics & Mobile Communication Conference (UEMCON)10.1109/UEMCON54665.2022.9965662(0008-0013)Online publication date: 26-Oct-2022
    • Show More Cited By

    Index Terms

    1. Online discovery and maintenance of time series motifs

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      KDD '10: Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
      July 2010
      1240 pages
      ISBN:9781450300551
      DOI:10.1145/1835804
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 25 July 2010

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. motifs
      2. online algorithms
      3. time series

      Qualifiers

      • Research-article

      Conference

      KDD '10
      Sponsor:

      Acceptance Rates

      Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

      Upcoming Conference

      KDD '24

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)19
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 12 Aug 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)MASS: distance profile of a query over a time seriesData Mining and Knowledge Discovery10.1007/s10618-024-01005-238:3(1466-1492)Online publication date: 1-May-2024
      • (2023)Discovering time series motifs of all lengths using dynamic time warpingWorld Wide Web10.1007/s11280-023-01207-626:6(3815-3836)Online publication date: 20-Sep-2023
      • (2022)Cross-Correlation for Streaming Seismic Time Series to Detect Events using Parallel and Real-time Methods2022 IEEE 13th Annual Ubiquitous Computing, Electronics & Mobile Communication Conference (UEMCON)10.1109/UEMCON54665.2022.9965662(0008-0013)Online publication date: 26-Oct-2022
      • (2022)Efficient Range and kNN Twin Subsequence Search in Time SeriesIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2022.3167257(1-1)Online publication date: 2022
      • (2022)Learning Evolvable Time-series Shapelets2022 IEEE 38th International Conference on Data Engineering (ICDE)10.1109/ICDE53745.2022.00064(793-805)Online publication date: May-2022
      • (2022)Normalization in Motif DiscoveryMachine Learning, Optimization, and Data Science10.1007/978-3-031-25891-6_24(314-325)Online publication date: 19-Sep-2022
      • (2021)A GPU Acceleration Framework for Motif and Discord Based Pattern MiningIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2021.305576532:8(1987-2004)Online publication date: 1-Aug-2021
      • (2020)Analysis of Recurrent Neural Network and PredictionsSymmetry10.3390/sym1204061512:4(615)Online publication date: 13-Apr-2020
      • (2020)An Enhanced Time Series Motif Discovery Using Approximated Matrix ProfileProceedings of the 2020 2nd International Conference on Image Processing and Machine Vision10.1145/3421558.3421586(180-189)Online publication date: 5-Aug-2020
      • (2020)Cut-n-RevealACM Transactions on Intelligent Systems and Technology10.1145/339411811:5(1-26)Online publication date: 28-Jul-2020
      • Show More Cited By

      View Options

      Get Access

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media