Gl-learning: an optimized framework for grammatical inference

In this paper, we present a new open-source software library, Gl-learning, for grammatical inference. The rise of new application scenarios in recent years has required optimized methods to address knowledge extraction from huge amounts of data and to model highly complex systems. Our library implements the main state-of-the-art algorithms in the grammatical inference field (RPNI, EDSM, L*), redesigned through the OpenMP library for a parallel execution that drastically decreases execution times. To our best knowledge, it is also the first comprehensive library including a noise tolerance learning algorithm, such as Blue*, that significantly broadens the range of the potential application scenarios for grammar models. The modular design of our C++ library makes it an efficient and extensible framework for the design of further novel algorithms.

[1]  E. Mark Gold,et al.  Language Identification in the Limit , 1967, Inf. Control..

[2]  Marco Morana,et al.  A framework for real-time Twitter data analysis , 2016, Comput. Commun..

[3]  Colin de la Higuera,et al.  Grammatical Inference: Learning Automata and Grammars , 2010 .

[4]  Hardi Hungar,et al.  Model Generation by Moderated Regular Extrapolation , 2002, FASE.

[5]  Giuseppe Lo Re,et al.  Detecting faulty wireless sensor nodes through Stochastic classification , 2011, 2011 IEEE International Conference on Pervasive Computing and Communications Workshops (PERCOM Workshops).

[6]  L. Dagum,et al.  OpenMP: an industry standard API for shared-memory programming , 1998 .

[7]  Jerome A. Feldman,et al.  On the Synthesis of Finite-State Machines from Samples of Their Behavior , 1972, IEEE Transactions on Computers.

[8]  Pedro García,et al.  IDENTIFYING REGULAR LANGUAGES IN POLYNOMIAL TIME , 1993 .

[9]  Jonathan J. Oliver,et al.  Efficient Induction of Finite State Automata , 1997, UAI.

[10]  Barak A. Pearlmutter,et al.  Results of the Abbadingo One DFA Learning Competition and a New Evidence-Driven State Merging Algorithm , 1998, ICGI.

[11]  Claudia Eckert,et al.  Grammatical Inference Algorithms in MATLAB , 2010, ICGI.

[12]  D. Angluin Queries and Concept Learning , 1988 .

[13]  Jeffrey D. Ullman,et al.  Introduction to Automata Theory, Languages and Computation , 1979 .

[14]  Marco Morana,et al.  Nds Lab -networking and Distributed Systems Mimicking Biological Mechanisms for Sensory Information Fusion Mimicking Biological Mechanisms for Sensory Information Fusion , 2022 .

[15]  Dana Angluin,et al.  Learning Regular Sets from Queries and Counterexamples , 1987, Inf. Comput..

[16]  Arlindo L. Oliveira,et al.  Inference of regular languages using state merging algorithms with search , 2005, Pattern Recognit..

[17]  Giuseppe Lo Re,et al.  User activity recognition for energy saving in smart homes , 2013, 2013 Sustainable Internet and ICT for Sustainability (SustainIT).

[18]  Benedikt Bollig,et al.  libalf: The Automata Learning Framework , 2010, CAV.

[19]  Tim Oates,et al.  Motif discovery in spatial trajectories using grammar inference , 2013, CIKM.

[20]  Ronald L. Rivest,et al.  Inference of finite automata using homing sequences , 1989, STOC '89.

[21]  Ah Chung Tsoi,et al.  Noisy Time Series Prediction using Recurrent Neural Networks and Grammatical Inference , 2001, Machine Learning.

[22]  Ron Kohavi,et al.  Irrelevant Features and the Subset Selection Problem , 1994, ICML.

[23]  Harald Raffelt,et al.  LearnLib: a library for automata learning and experimentation , 2005, FMICS '05.

[24]  Giuseppe Lo Re,et al.  A machine learning approach for user localization exploiting connectivity data , 2016, Eng. Appl. Artif. Intell..