Learning stochastic regular grammars by means of a state merging method

Carrasco, Rafael C.; Oncina, Jose

doi:10.1007/3-540-58473-0_144

Rafael C. Carrasco¹ &
Jose Oncina¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 862))

Included in the following conference series:

International Colloquium on Grammatical Inference

479 Accesses
134 Citations
3 Altmetric

Abstract

We propose a new algorithm which allows for the identification of any stochastic deterministic regular language as well as the determination of the probabilities of the strings in the language. The algorithm builds the prefix tree acceptor from the sample set and merges systematically equivalent states. Experimentally, it proves very fast and the time needed grows only linearly with the size of the sample set.

Work partially supported under grant TIC93-0633-C02-02 from CICYT (Programa Nacional de Tecnologías de la Información y de las Comunicaciones)

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Learning Probability Distributions Generated by Finite-State Machines

Learning Morphology of Natural Language as a Finite-State Grammar

Learning Tree Languages

References

E.M. Gold: Complexity of Automaton Identification from Given Data. Information and Control 37 (1978) 302–320.
Article Google Scholar
D. Angluin: Identifying Languages from Stochastic Examples. Internal Report YALEU /DCS /RR-614 (1988).
Google Scholar
F.J. Maryanski and T.L. Booth: Inference of Finite-State Probabilistic Grammars. IEEE Transactions on Computers C26 (1977) 521–536.
Google Scholar
A. van der Mude and A. Walker: On the Inference of Stochastic Regular Grammars. Information and Control 38 (1978) 310–329.
Article Google Scholar
A.W. Smith and D. Zipser: Learning Sequential Structure with the Real-Time Recurrent Learning Algorithm. International Journal of Neural Systems 1 (1989) 125–131.
Article Google Scholar
J.B. Pollack: The Induction of Dynamical Recognizers. Machine Learning 7 (1991) 227–252.
Google Scholar
C.L. Giles: Learning and Extracting Finite State Automata with Second Order Recurrent Neural Networks. Neural Computation 4 (1992) 393–405.
Google Scholar
R.L. Wartous and G.M. Kuhn: Induction of Finite-state Languages Using Second-Order Recurrent Networks. Neural Computation 4 (1992) 406–414.
Google Scholar
M.A. Castaño, F. Casacuberta, E. Vidal: Simulation of Stochastic Regular Grammars through Simple Recurrent Networks, in: New Trends in Neural Computation (Eds. J. Mira, J. Cabestany and A. Prieto). Springer Verlag, Lecture Notes in Computer Science 686 (1993) 210–215.
Google Scholar
A. Stolcke and S. Omohundro: Hidden Markov Model Induction by Bayesian Model Merging. To appear in: Advances in Neural Information Processing Systems 5 (C.L. Giles, S.J. Hanson and J.D. Cowan eds.) Morgan Kaufman, Menlo Park, California (1993).
Google Scholar
J. Oncina and P. García: Inferring Regular Languages in Polynomial Time, in: Pattern Recognition and Image Analysis (N. Pérez de la Blanca, A. Sanfeliu and E. Vidal eds.) World Scientific (1992).
Google Scholar
K.S. Fu: Syntactic Pattern Recognition and Applications. Prentice Hall, Englewood Cliffs, N.J. (1982).
Google Scholar
J.E. Hopcroft and J.D. Ullman: Introduction to Automata Theory, Languages and Computation. Addison Wesley, Reading, Massachusetts (1979).
Google Scholar
W. Hoeffding: Probability inequalities for sums of bounded random variables. American Statistical Association Journal 58 (1963) 13–30.
Google Scholar
W. Feller: An introduction to probability theory and its applications. John Wiley and Sons, New York (1950)
Google Scholar
A.S. Reber: Implicit Learning of Artificial Grammars. Journal of Verbal Learning and Verbal Behaviour 6 (1967) 855–863.
Google Scholar

Download references

Author information

Authors and Affiliations

Departamento de Tecnología Informática y Computación, Universidad de Alicante, E-03071, Alicante
Rafael C. Carrasco & Jose Oncina

Authors

Rafael C. Carrasco
View author publications
You can also search for this author in PubMed Google Scholar
Jose Oncina
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Rafael C. Carrasco Jose Oncina

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Carrasco, R.C., Oncina, J. (1994). Learning stochastic regular grammars by means of a state merging method. In: Carrasco, R.C., Oncina, J. (eds) Grammatical Inference and Applications. ICGI 1994. Lecture Notes in Computer Science, vol 862. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-58473-0_144

Download citation

DOI: https://doi.org/10.1007/3-540-58473-0_144
Published: 04 June 2005
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-58473-5
Online ISBN: 978-3-540-48985-6
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

Learning stochastic regular grammars by means of a state merging method

Abstract

Access this chapter

Preview

Similar content being viewed by others

Learning Probability Distributions Generated by Finite-State Machines

Learning Morphology of Natural Language as a Finite-State Grammar

Learning Tree Languages

References

Author information

Authors and Affiliations

Editor information

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Learning stochastic regular grammars by means of a state merging method

Abstract

Access this chapter

Preview

Similar content being viewed by others

Learning Probability Distributions Generated by Finite-State Machines

Learning Morphology of Natural Language as a Finite-State Grammar

Learning Tree Languages

References

Author information

Authors and Affiliations

Editor information

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation