Robust Learning of Automatic Classes of Languages

This paper adapts and investigates the paradigm of robust learning, originally defined in the inductive inference literature for classes of recursive functions, to learning languages from positive data. Robustness is a very desirable property, as it captures a form of invariance of learnability under admissible transformations on the object of study. The classes of languages of interest are automatic -- a formal concept that captures the notion of being recognisable by a finite automaton. A class of first-order definable operators -- called translators -- is introduced as natural transformations that preserve automaticity of languages in a given class and the inclusion relations between languages in the class. For many learning criteria, we characterise the classes of languages all of whose translations are learnable under that criterion. The learning criteria have been chosen from the literature on both explanatory learning from positive data and query learning, and include consistent and conservative learning, strong-monotonic learning, strong-monotonic consistent learning, finite learning, learning from subset queries, learning from superset queries, and learning from membership queries.

[1]  Thomas Zeugmann,et al.  Language learning in dependence on the space of hypotheses , 1993, COLT '93.

[2]  Sanjay Jain,et al.  ON AUTOMATIC FAMILIES , 2011 .

[3]  Sanjay Jain,et al.  A Tour of Robust Learning , 2003 .

[4]  Sasha Rubin,et al.  Automatic Structures: Overview and Future Directions , 2003, J. Autom. Lang. Comb..

[5]  Carl H. Smith,et al.  Robust Learning Is Rich , 2001, J. Comput. Syst. Sci..

[6]  M. Rabin Automata on Infinite Objects and Church's Problem , 1972 .

[7]  Thomas Zeugmann,et al.  Learning indexed families of recursive languages from positive data: A survey , 2008, Theor. Comput. Sci..

[8]  Anil Nerode,et al.  Automatic Presentations of Structures , 1994, LCC.

[9]  Dana Angluin,et al.  Inductive Inference of Formal Languages from Positive Data , 1980, Inf. Control..

[10]  Rolf Wiehagen,et al.  Learning and Consistency , 1995, GOSLER Final Report.

[11]  Daniel N. Osherson,et al.  Criteria of Language Learning , 1982, Inf. Control..

[12]  Sanjay Jain,et al.  Learnability of automatic classes , 2012, J. Comput. Syst. Sci..

[13]  Analogical and Inductive Inference , 1992, Lecture Notes in Computer Science.

[14]  Frank Stephan,et al.  Avoiding coding tricks by hyperrobust learning , 2002, Theor. Comput. Sci..

[15]  Sandra Zilles,et al.  Learning languages with queries , 2002 .

[16]  Thomas Zeugmann On Barzdin's Conjecture , 1986, AII.

[17]  Mark A. Fulk Robust separations in inductive inference , 1990, Proceedings [1990] 31st Annual Symposium on Foundations of Computer Science.

[18]  Sandra Zilles,et al.  Formal language identification: query learning vs. Gold-style learning , 2004, Inf. Process. Lett..

[19]  Klaus P. Jantke Monotonic and non-monotonic inductive inference , 2009, New Generation Computing.

[20]  Dana Angluin,et al.  Learning Regular Sets from Queries and Counterexamples , 1987, Inf. Comput..

[21]  E. Mark Gold,et al.  Language Identification in the Limit , 1967, Inf. Control..