A Parameter-Free Learning Automaton Scheme

For a learning automaton, a proper configuration of its learning parameters, which are crucial for the automaton’s performance, is relatively difficult due to the necessity of a manual parameter tuning before real applications. To ensure a stable and reliable performance in stochastic environments, parameter tuning can be a time-consuming and interactioncosting procedure in the field of LA. Especially, it is a fatal limitation for LA-based applications where the interactions with environments are expensive. In this paper, we propose a parameter-free learning automaton scheme to avoid parameter tuning by a Bayesian inference method. In contrast to existing schemes where the parameters should be carefully tuned according to the environment, the performance of this scheme is not sensitive to external environments because a set of parameters can be consistently applied to various environments, which dramatically reduce the difficulty of applying a learning automaton to an unknown stochastic environment. A rigorous proof of ǫ-optimality for the proposed scheme is provided and numeric experiments are carried out on benchmark environments to verify its effectiveness. The results show that, without any parameter tuning cost, the proposed parameter-free learning automaton (PFLA) can achieve a competitive performance compared with other welltuned schemes and outperform untuned schemes on consistency of performance.

[1]  B. John Oommen,et al.  Solving Multiconstraint Assignment Problems Using Learning Automata , 2010, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[2]  John D. Cook,et al.  Exact Calculation of Beta Inequalities , 2006 .

[3]  Anis Yazidi,et al.  Learning-Automaton-Based Online Discovery and Tracking of Spatiotemporal Event Patterns , 2013, IEEE Transactions on Cybernetics.

[4]  MengChu Zhou,et al.  Last-Position Elimination-Based Learning Automata , 2014, IEEE Transactions on Cybernetics.

[5]  Yuan Yan Tang,et al.  A new prospective for Learning Automata: A machine learning approach , 2016, Neurocomputing.

[6]  Ole-Christoffer Granmo,et al.  Solving two-armed Bernoulli bandit problems using a Bayesian learning automaton , 2010, Int. J. Intell. Comput. Cybern..

[7]  B. John Oommen,et al.  Generalized pursuit learning schemes: new families of continuous and discretized learning automata , 2002, IEEE Trans. Syst. Man Cybern. Part B.

[8]  B. John Oommen,et al.  Discretized pursuit learning automata , 1990, IEEE Trans. Syst. Man Cybern..

[9]  Mohammad Reza Meybodi,et al.  Learning Automata-Based Adaptive Petri Net and Its Application to Priority Assignment in Queuing Systems With Unknown Parameters , 2015, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[10]  Mohammad S. Obaidat,et al.  Collaborative Learning Automata-Based Routing for Rescue Operations in Dense Urban Regions Using Vehicular Sensor Networks , 2015, IEEE Systems Journal.

[11]  Erik Cuevas,et al.  Circle detection on images using learning automata , 2012 .

[12]  M. L. Tsetlin On the Behavior of Finite Automata in Random Media , 1961 .

[13]  M. L. Tsetlin,et al.  Automaton theory and modeling of biological systems , 1973 .

[14]  P. Venkata Krishna,et al.  Learning Automata-Based QoS Framework for Cloud IaaS , 2014, IEEE Transactions on Network and Service Management.

[15]  Peter Auer,et al.  Finite-time Analysis of the Multiarmed Bandit Problem , 2002, Machine Learning.

[16]  B. John Oommen,et al.  Continuous and discretized pursuit learning schemes: various algorithms and their comparison , 2001, IEEE Trans. Syst. Man Cybern. Part B.

[17]  Ying Guo,et al.  A Parameter-Free Gradient Bayesian Two-Action Learning Automaton Scheme , 2016 .

[18]  Jianhua Li,et al.  A novel estimator based learning automata algorithm , 2014, Applied Intelligence.

[19]  Yuguang Fang,et al.  Stochastic Channel Selection in Cognitive Radio Networks , 2007, IEEE GLOBECOM 2007 - IEEE Global Telecommunications Conference.

[20]  B. John Oommen,et al.  Modeling a Student–Classroom Interaction in a Tutorial-Like System Using Learning Automata , 2010, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[21]  Kumpati S. Narendra,et al.  Learning Automata - A Survey , 1974, IEEE Trans. Syst. Man Cybern..

[22]  B. John Oommen,et al.  On incorporating the paradigms of discretization and Bayesian estimation to create a new family of pursuit learning automata , 2013, Applied Intelligence.

[23]  Andreas S. Pomportsis,et al.  A New Class of -Optimal Learning Automata , 2004 .

[24]  B. John Oommen,et al.  Cybernetics and Learning Automata , 2009, Handbook of Automation.