Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
article

Speeding Up HMM Decoding and Training by Exploiting Sequence Repetitions

Published: 01 May 2009 Publication History

Abstract

We present a method to speed up the dynamic program algorithms used for solving the HMM decoding and training problems for discrete time-independent HMMs. We discuss the application of our method to Viterbi’s decoding and training algorithms (IEEE Trans. Inform. Theory IT-13:260–269, 1967), as well as to the forward-backward and Baum-Welch (Inequalities 3:1–8, 1972) algorithms. Our approach is based on identifying repeated substrings in the observed input sequence. Initially, we show how to exploit repetitions of all sufficiently small substrings (this is similar to the Four Russians method). Then, we describe four algorithms based alternatively on run length encoding (RLE), Lempel-Ziv (LZ78) parsing, grammar-based compression (SLP), and byte pair encoding (BPE). Compared to Viterbi’s algorithm, we achieve speedups of Θ(log n) using the Four Russians method, $\Omega(\frac{r}{\log r})$using RLE, $\Omega(\frac{\log n}{k})$using LZ78, $\Omega(\frac{r}{k})$using SLP, and Ω(r) using BPE, where k is the number of hidden states, n is the length of the observed sequence and r is its compression ratio (under each compression scheme). Our experimental results demonstrate that our new algorithms are indeed faster in practice. We also discuss a parallel implementation of our algorithms.

Cited By

View all
  • (2022)Graph Compression for Adjacency-Matrix MultiplicationSN Computer Science10.1007/s42979-022-01084-23:3Online publication date: 21-Mar-2022
  • (2020)Impossibility results for grammar-compressed linear algebraProceedings of the 34th International Conference on Neural Information Processing Systems10.5555/3495724.3496463(8810-8823)Online publication date: 6-Dec-2020
  • (2019)Online Markov decodingProceedings of the 33rd International Conference on Neural Information Processing Systems10.5555/3454287.3454797(5680-5690)Online publication date: 8-Dec-2019
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Algorithmica
Algorithmica  Volume 54, Issue 3
May 2009
201 pages

Publisher

Springer-Verlag

Berlin, Heidelberg

Publication History

Published: 01 May 2009

Author Tags

  1. Compression
  2. Dynamic programming
  3. HMM
  4. Viterbi

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 10 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2022)Graph Compression for Adjacency-Matrix MultiplicationSN Computer Science10.1007/s42979-022-01084-23:3Online publication date: 21-Mar-2022
  • (2020)Impossibility results for grammar-compressed linear algebraProceedings of the 34th International Conference on Neural Information Processing Systems10.5555/3495724.3496463(8810-8823)Online publication date: 6-Dec-2020
  • (2019)Online Markov decodingProceedings of the 33rd International Conference on Neural Information Processing Systems10.5555/3454287.3454797(5680-5690)Online publication date: 8-Dec-2019
  • (2017)Improving Viterbi is hardProceedings of the 34th International Conference on Machine Learning - Volume 7010.5555/3305381.3305414(311-321)Online publication date: 6-Aug-2017
  • (2017)Accelerating Viterbi algorithm on graphics processing unitsComputing10.1007/s00607-017-0557-699:11(1105-1123)Online publication date: 1-Nov-2017
  • (2016)Decoding Hidden Markov Models faster than viterbi via online matrix-vector (max, +)-multiplicationProceedings of the Thirtieth AAAI Conference on Artificial Intelligence10.5555/3016100.3016106(1484-1490)Online publication date: 12-Feb-2016
  • (2013)Computing the discrete Fréchet distance in subquadratic timeProceedings of the twenty-fourth annual ACM-SIAM symposium on Discrete algorithms10.5555/2627817.2627829(156-167)Online publication date: 6-Jan-2013
  • (2011)Speeding up Bayesian HMM by the four Russians methodProceedings of the 11th international conference on Algorithms in bioinformatics10.5555/2039945.2039962(188-200)Online publication date: 5-Sep-2011
  • (2010)Grammar-based compression in a streaming modelProceedings of the 4th international conference on Language and Automata Theory and Applications10.1007/978-3-642-13089-2_23(273-284)Online publication date: 24-May-2010

View Options

View options

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media