Dimensionality Reduction for Fast Similarity Search in Large Time Series Databases

Keogh, Eamonn; Chakrabarti, Kaushik; Pazzani, Michael; Mehrotra, Sharad

doi:10.1007/PL00011669

Dimensionality Reduction for Fast Similarity Search in Large Time Series Databases

Published: August 2001

Volume 3, pages 263–286, (2001)
Cite this article

Knowledge and Information Systems Aims and scope Submit manuscript

Eamonn Keogh¹,
Kaushik Chakrabarti²,
Michael Pazzani¹ &
…
Sharad Mehrotra¹

5404 Accesses
968 Citations
6 Altmetric
Explore all metrics

Abstract.

The problem of similarity search in large time series databases has attracted much attention recently. It is a non-trivial problem because of the inherent high dimensionality of the data. The most promising solutions involve first performing dimensionality reduction on the data, and then indexing the reduced data with a spatial access method. Three major dimensionality reduction techniques have been proposed: Singular Value Decomposition (SVD), the Discrete Fourier transform (DFT), and more recently the Discrete Wavelet Transform (DWT). In this work we introduce a new dimensionality reduction technique which we call Piecewise Aggregate Approximation (PAA). We theoretically and empirically compare it to the other techniques and demonstrate its superiority. In addition to being competitive with or faster than the other methods, our approach has numerous other advantages. It is simple to understand and to implement, it allows more flexible distance measures, including weighted Euclidean queries, and the index can be built in linear time.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Speeding up dynamic time warping distance for sparse time series data

Article 28 October 2017