Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
article

Learning regular languages using RFSAs

Published: 17 February 2004 Publication History

Abstract

Residual languages are important and natural components of regular languages and several grammatical inference algorithms naturally rely on this notion. In order to identify a given target language L, classical inference algorithms try to identify words which define identical residual languages of L. Here, we study whether it could be interesting to perform a tighter analysis by identifying inclusion relations between the residual languages of L. We consider the class of Residual Finite State Automata (RFSAs). An RFSA A is a NonDeterministic Automaton whose states corresponds to residual languages of the language LA it recognizes. The inclusion relations between residual languages of LA can be naturally materialized on A. We prove that the class of RFSAs is not polynomially characterizable. We lead some experiments which show that when a regular language is randomly drawn by using a nondeterministic representation, the number of inclusion relations between its residual languages is very important. Moreover, its minimal RFSA representation is much smaller than its minimal DFA representation. Finally, we design a new learning algorithm, DeLeTe2, based on the search for the inclusion relations between the residual languages of the target language. We give sufficient conditions for the identifiability of the target language. We experimentally compare the performance of DeLeTe2 to those of classical inference algorithms.

References

[1]
{1} D. Angluin, Learning regular sets from queries and counterexamples, Inform. Comput. 75 (2) (1987) 87-106.]]
[2]
{2} J.A. Brzozowski, Derivatives of regular expressions, J. ACM 11 (1964) 481-494.]]
[3]
{3} F. Coste, D. Fredouille, Efficient ambiguity detection in c-nfa, in: Grammatical Inference: Algorithms and Applications, Lecture Notes in Artificial Intelligence, Vol. 1891, Springer, Berlin, 2000.]]
[4]
{4} F. Denis, A. Lemay, A. Terlutte, Learning regular languages using nondeterministic finite automata, in: ICGI'2000, Proc. 5th Internat. Colloquium on Grammatical Inference, Lecture Notes in Artificial Intelligence, Vol. 1891, Springer, Berlin, 2000, pp. 39-50.]]
[5]
{5} F. Denis, A. Lemay, A. Terlutte, Residual finite state automata, in: STACS 2001, Lecture Notes in Computer Science, Vol. 2010, Springer, Berlin, 2001, pp. 144-157.]]
[6]
{6} F. Denis, A. Lemay, A. Terlutte, Residual finite state automata, Fund. Inform. 51 (4) (2002) 339-368.]]
[7]
{7} F. Denis, A. Lemay, A. Terlutte, Some language classes identifiable in the limit from positive data, in: ICGI 2002, Lecture Notes in Artificial Intelligence, Vol. 2484, Springer, Berlin, 2002, pp. 63-76.]]
[8]
{8} T.G. Dietterich, Approximate statistical tests for comparing supervised classification learning algorithms, Neural Comput. 10 (7) (1998) 1895-1923.]]
[9]
{9} E.M. Gold, Complexity of automaton identification from given data, Inform. and Control 37 (1978) 302-320.]]
[10]
{10} C. de la Higuera, Characteristic sets for polynomial grammatical inference, Mach. Learning 27 (1997) 125-137.]]
[11]
{11} J.E. Hopcroft, J.D. Ullman, Introduction to Automata Theory, Languages, and Computation, Addison-Wesley, Reading, MA, 1979.]]
[12]
{12} K.J. Lang, Random DFA's can be approximately learned from sparse uniform examples, in: Proc. 5th Annu. Workshop on Computer Learning Theory, ACM Press, New York, NY, 1992, pp. 45-52.]]
[13]
{13} K.J. Lang, B.A. Pearlmutter, R.A. Price, Results of the Abbadingo one DFA learning competition and a new evidence-driven state merging algorithm, in: Proc. 4th Internat. Colloquium on Grammatical Inference--ICGI 98, Lecture Notes in Artificial Intelligence, Vol. 1433, Springer, Berlin, 1998, pp. 1-12.]]
[14]
{14} C. Nicaud, Etude du comportement des automates finis et des langages rationnels. Ph.D. Thesis, Université de Marne la Vallée, 2001.]]
[15]
{15} J. Oncina, P. Garcia, Inferring regular languages in polynomial update time, in: Pattern Recognition Image Anal., 1992, pp. 49-61.]]
[16]
{16} T. Yokomori, Learning nondeterministic finite automata from queries and counterexamples, in: D. Michie, K. Furukawa, Clarendon S. Muggleton (Eds.), Machine Intelligence 13: Machine Intelligence and Inductive Learning, 1994.]]
[17]
{17} S. Yu, Handbook of Formal Languages, Regular Languages, Vol. 1, Springer, Berlin, 1997, pp. 41-110 (Chapter 2).]]

Cited By

View all
  • (2024)Classifying Words with 3-sort AutomataProceedings of the 39th ACM/SIGAPP Symposium on Applied Computing10.1145/3605098.3636142(787-788)Online publication date: 8-Apr-2024
  • (2023)Inference of Over-Constrained NFA of Size to Efficiently and Systematically Derive NFA of Size k for Grammar LearningComputational Science – ICCS 202310.1007/978-3-031-35995-8_10(134-147)Online publication date: 3-Jul-2023
  • (2022)Learning residual alternating automataInformation and Computation10.1016/j.ic.2022.104981289:PAOnline publication date: 1-Nov-2022
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Theoretical Computer Science
Theoretical Computer Science  Volume 313, Issue 2
Special issue: Algorithmic learning theory
17 February 2004
138 pages

Publisher

Elsevier Science Publishers Ltd.

United Kingdom

Publication History

Published: 17 February 2004

Author Tags

  1. finite state automata
  2. grammatical inference
  3. machine learning
  4. regular languages

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 13 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Classifying Words with 3-sort AutomataProceedings of the 39th ACM/SIGAPP Symposium on Applied Computing10.1145/3605098.3636142(787-788)Online publication date: 8-Apr-2024
  • (2023)Inference of Over-Constrained NFA of Size to Efficiently and Systematically Derive NFA of Size k for Grammar LearningComputational Science – ICCS 202310.1007/978-3-031-35995-8_10(134-147)Online publication date: 3-Jul-2023
  • (2022)Learning residual alternating automataInformation and Computation10.1016/j.ic.2022.104981289:PAOnline publication date: 1-Nov-2022
  • (2022)Non-linear Optimization Methods for Learning Regular DistributionsFormal Methods and Software Engineering10.1007/978-3-031-17244-1_4(54-70)Online publication date: 24-Oct-2022
  • (2021)Learning Nondeterministic Real-Time AutomataACM Transactions on Embedded Computing Systems10.1145/347703020:5s(1-26)Online publication date: 22-Sep-2021
  • (2019)A Comparison of Selected Variable Ordering Methods for NFA InductionComputational Science – ICCS 201910.1007/978-3-030-22750-0_73(741-748)Online publication date: 12-Jun-2019
  • (2017)Learning residual alternating automataProceedings of the Thirty-First AAAI Conference on Artificial Intelligence10.5555/3298483.3298494(1749-1755)Online publication date: 4-Feb-2017
  • (2016)FIDEX: filtering spreadsheet data using examplesACM SIGPLAN Notices10.1145/3022671.298403051:10(195-213)Online publication date: 19-Oct-2016
  • (2016)FIDEX: filtering spreadsheet data using examplesProceedings of the 2016 ACM SIGPLAN International Conference on Object-Oriented Programming, Systems, Languages, and Applications10.1145/2983990.2984030(195-213)Online publication date: 19-Oct-2016
  • (2016)On Parallel Induction of Nondeterministic Finite AutomataProcedia Computer Science10.1016/j.procs.2016.05.31880:C(257-268)Online publication date: 1-Jun-2016
  • Show More Cited By

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media