Machine Learning for Sequential Data: A Review

Dietterich, Thomas G.

doi:10.1007/3-540-70659-3_2

Thomas G. Dietterich⁵

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2396))

Included in the following conference series:

Joint IAPR International Workshops on Statistical Techniques in Pattern Recognition (SPR) and Structural and Syntactic Pattern Recognition (SSPR)

12k Accesses
16 Altmetric

Abstract

Statistical learning problems in many fields involve sequential data. This paper formalizes the principal learning tasks and describes the methods that have been developed within the machine learning research community for addressing these problems. These methods include sliding window methods, recurrent sliding windows, hidden Markov models, conditional random fields, and graph transformer networks. The paper also discusses some open research issues.

Download to read the full chapter text

Chapter PDF

Dynamic Hybrid Random Fields for the Probabilistic Graphical Modeling of Sequential Data: Definitions, Algorithms, and an Application to Bioinformatics

Article 26 October 2017

United Statistical Algorithms and Data Science: An Introduction to the Principles

Hidden Markov Models

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

G. Bakiri and T. G. Dietterich. Achieving high-accuracy text-to-speech with machine learning. In R. I. Damper, editor, Data Mining Techniques in Speech Synthesis. Chapman and Hall, New York, NY, 2002.
Google Scholar
Y. Bengio and P. Frasconi. Input-output HMM’s for sequence processing. IEEE Transactions on Neural Networks, 7(5):1231–1249, September 1996.
Google Scholar
L. Breiman, J. H. Friedman, R. A. Olshen, and C. J. Stone. Classification and Regression Trees. Wadsworth International Group, 1984.
Google Scholar
C. Chow and C. Liu. Approximating discrete probability distributions with dependence trees. IEEE Transactions on Information Theory, 14:462–467, 1968.
Article MATH Google Scholar
N. Cristianini and J. Shawe-Taylor. An Introduction to Support Vector Machines. Cambridge University Press, 2000.
Google Scholar
A. P. Dempster, N. M. Laird, and D. B Rubin. Maximum-likelihood from incomplete data via the EM algorithm. J. Royal Stat. Soc., B39:1–38, 1977.
MathSciNet Google Scholar
J. L. Elman. Finding structure in time. Cognitive Science, 14:179–211, 1990.
Article Google Scholar
T. Fawcett and F. Provost. Adaptive fraud detection. Knowledge Discovery and Data Mining, 1:291–316, 1997.
Article Google Scholar
C. L. Giles, G. M. Kuhn, and R. J. Williams. Special issue on dynamic recurrent neural networks. IEEE Transactions on Neural Networks, 5(2), 1994.
Google Scholar
A. E. Hoerl and R. W. Kennard. Ridge regression: biased estimation of non-orthogonal components. Technometrics, 12:55–67, 1970.
Article MATH Google Scholar
M. I. Jordan. Serial order: A parallel distributed processing approach. ICS Rep. 8604, Inst. for Cog. Sci., UC San Diego, 1986.
Google Scholar
Ron Kohavi and George H. John. Wrappers for feature subset selection. Artificial Intelligence, 97(1–2):273–324, 1997.
Article MATH Google Scholar
Daphne Koller and Mehran Sahami. Toward optimal feature selection. In Proc. 13th Int. Conf. Machine Learning, pages 284–292. Morgan Kaufmann, 1996.
Google Scholar
Igor Kononenko, Edvard Šimec, and Marko Robnik-Šikonja. Overcoming the myopic of inductive learning algorithms with RELIEFF. Applied Intelligence, 7(1): 39–55, 1997.
Article Google Scholar
John Lafferty, Andrew McCallum, and Fernando Pereira. Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In Int. Conf. Machine Learning, San Francisco, CA, 2001. Morgan Kaufmann.
Google Scholar
Y. LeCun, L. Bottou, Y. Bengio, and P. Haffner. Gradient-based learning applied to document recognition. Proceedings of the IEEE, 86(11):2278–2324, 1998.
Article Google Scholar
Oded Maron and Andrew W. Moore. Hoeffding races: Accelerating model selection search for classification and function approximation. In Adv. Neural Inf. Proc. Sys. 6, 59–66. Morgan Kaufmann, 1994.
Google Scholar
Andrew McCallum, Dayne Freitag, and Fernando Pereira. Maximum entropy Markov models for information extraction and segmentation. In Int. Conf. on Machine Learning, 591–598. Morgan Kaufmann, 2000.
Google Scholar
Thomas M. Mitchell. Machine Learning. McGraw-Hill, New York, 1997.
MATH Google Scholar
N. Qian and T. J. Sejnowski. Predicting the secondary structure of globular proteins using neural network models. J. Molecular Biology, 202:865–884, 1988.
Article Google Scholar
J. R. Quinlan. C4.5: Programs for machine learning. Morgan Kaufmann, 1993.
Google Scholar
D. E. Rumelhart, G. E. Hinton, and R. J. Williams. Learning internal representations by error propagation. In Parallel Distributed Processing — Explorations in the Micro structure of Cognition, chapter 8, pages 318–362. MIT Press, 1986.
Google Scholar
T. J. Sejnowski and C. R. Rosenberg. Parallel networks that learn to pronounce english text. Journal of Complex Systems, 1(1):145–168, February 1987.
Google Scholar
A. S. Weigend, D. E. Rumelhart, and B. A. Huberman. Generalization by weight-elimination with application to forecasting. Adv. Neural Inf. Proc. Sys. 3, 875–882, Morgan Kaufmann, 1991.
Google Scholar

Download references

Author information

Authors and Affiliations

Oregon State University, Corvallis, Oregon, USA
Thomas G. Dietterich

Authors

Thomas G. Dietterich
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Dept. of Computing Science, University of Alberta, Athabasca Hall, Room 409, Edmonton, Alberta, Canada, T6G 2H1
Terry Caelli
School of Computer Science and Engineering, University of New South Wales, Sydney, 2052, NSW, Australia
Adnan Amin
Dept. of Applied Physics Pattern Recognition Group, Delft University of Technology, Lorentzweg 1, 2628 CJ, Delft, The Netherlands
Robert P. W. Duin & Dick de Ridder &
Dept. of Systems Design Engineering, University of Waterloo, Waterloo, Ontario, Canada, N2L 3G1
Mohamed Kamel

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Dietterich, T.G. (2002). Machine Learning for Sequential Data: A Review. In: Caelli, T., Amin, A., Duin, R.P.W., de Ridder, D., Kamel, M. (eds) Structural, Syntactic, and Statistical Pattern Recognition. SSPR /SPR 2002. Lecture Notes in Computer Science, vol 2396. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-70659-3_2

Download citation

DOI: https://doi.org/10.1007/3-540-70659-3_2
Published: 21 August 2002
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44011-6
Online ISBN: 978-3-540-70659-5
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

Societies and partnerships

The International Association for Pattern Recognition (opens in a new tab)

Machine Learning for Sequential Data: A Review

Abstract

Chapter PDF

Similar content being viewed by others

Dynamic Hybrid Random Fields for the Probabilistic Graphical Modeling of Sequential Data: Definitions, Algorithms, and an Application to Bioinformatics

United Statistical Algorithms and Data Science: An Introduction to the Principles

Hidden Markov Models

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Societies and partnerships

Navigation

Machine Learning for Sequential Data: A Review

Abstract

Chapter PDF

Similar content being viewed by others

Dynamic Hybrid Random Fields for the Probabilistic Graphical Modeling of Sequential Data: Definitions, Algorithms, and an Application to Bioinformatics

United Statistical Algorithms and Data Science: An Introduction to the Principles

Hidden Markov Models

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Societies and partnerships

Search

Navigation