A Novel Strategy for Solving the Stochastic Point Location Problem Using a Hierarchical Searching Scheme

Stochastic point location (SPL) deals with the problem of a learning mechanism (LM) determining the optimal point on the line when the only input it receives are stochastic signals about the direction in which it should move. One can differentiate the SPL from the traditional class of optimization problems by the fact that the former considers the case where the directional information, for example, as inferred from an Oracle (which possibly computes the derivatives), suffices to achieve the optimization-without actually explicitly computing any derivatives. The SPL can be described in terms of a LM (algorithm) attempting to locate a point on a line. The LM interacts with a random environment which essentially informs it, possibly erroneously, if the unknown parameter is on the left or the right of a given point. Given a current estimate of the optimal solution, all the reported solutions to this problem effectively move along the line to yield updated estimates which are in the neighborhood of the current solution.1 This paper proposes a dramatically distinct strategy, namely, that of partitioning the line in a hierarchical tree-like manner, and of moving to relatively distant points, as characterized by those along the path of the tree. We are thus attempting to merge the rich fields of stochastic optimization and data structures. Indeed, as in the original discretized solution to the SPL, in one sense, our solution utilizes the concept of discretization and operates a uni-dimensional controlled random walk (RW) in the discretized space, to locate the unknown parameter. However, by moving to nonneighbor points in the space, our newly proposed hierarchical stochastic searching on the line (HSSL) solution performs such a controlled RW on the discretized space structured on a superimposed binary tree. We demonstrate that the HSSL solution is orders of magnitude faster than the original SPL solution proposed by Oommen. By a rigorous analysis, the HSSL is shown to be optimal if the effectiveness (or credibility) of the environment, given by p, is greater than the golden ratio conjugate. The solution has been both analytically solved and simulated, and the results obtained are extremely fascinating, as this is the first reported use of time reversibility in the analysis of stochastic learning. The learning automata extensions of the scheme are currently being investigated.

[1]  B. John Oommen,et al.  Solving Stochastic Nonlinear Resource Allocation Problems Using a Hierarchy of Twofold Resource Allocation Automata , 2010, IEEE Transactions on Computers.

[2]  Daniel Graupe,et al.  Principles of Artificial Neural Networks - 2nd Edition , 2007, Advanced Series in Circuits and Systems.

[3]  S. Lakshmivarahan,et al.  Learning Algorithms Theory and Applications , 1981 .

[4]  B.J. Oommen,et al.  Parameter learning from stochastic teachers and stochastic compulsive liars , 2006, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[5]  Jim R. Parker,et al.  Algorithms for image processing and computer vision , 1996 .

[6]  Christopher M. Bishop,et al.  Pattern Recognition and Machine Learning (Information Science and Statistics) , 2006 .

[7]  B. John Oommen,et al.  A Solution to the Stochastic Point Location Problem in Metalevel Nonstationary Environments , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[8]  Mohammad Reza Meybodi,et al.  Finding minimum weight connected dominating set in stochastic graph based on learning automata , 2012, Inf. Sci..

[9]  David J. C. MacKay,et al.  Information Theory, Inference, and Learning Algorithms , 2004, IEEE Transactions on Information Theory.

[10]  M. Thathachar,et al.  A Hierarchical System of Learning Automata , 1981, IEEE Transactions on Systems, Man, and Cybernetics.

[11]  Kaddour Najim,et al.  Learning automata and stochastic optimization , 1997 .

[12]  M. Livio The Golden Ratio: The Story of Phi, the World's Most Astonishing Number , 2002 .

[13]  Ricardo Baeza-Yates,et al.  Searching with uncertainty , 1988 .

[14]  R. Phillips,et al.  Pricing and Revenue Optimization , 2005 .

[15]  B. John Oommen,et al.  The asymptotic optimality of discretized linear reward-inaction learning automata , 1984, IEEE Transactions on Systems, Man, and Cybernetics.

[16]  Anany Levitin Introduction to the design & analysis of algorithms , 2007 .

[17]  B. John Oommen,et al.  Learning automata-based solutions to the optimal web polling problem modelled as a nonlinear fractional knapsack problem , 2011, Eng. Appl. Artif. Intell..

[18]  Yoshio Mogami,et al.  Learning behaviors of the hierarchical structure stochastic automata : Operating in the nonstationary multiteacher environment , 2005 .

[19]  B. John Oommen,et al.  Generalized pursuit learning schemes: new families of continuous and discretized learning automata , 2002, IEEE Trans. Syst. Man Cybern. Part B.

[20]  Theodosios Pavlidis,et al.  Structural pattern recognition , 1977 .

[21]  B. John Oommen,et al.  Discretized pursuit learning automata , 1990, IEEE Trans. Syst. Man Cybern..

[22]  Pradeep K. Atrey,et al.  A framework for human-centered provisioning of ambient media services , 2009, Multimedia Tools and Applications.

[23]  B. John Oommen,et al.  Stochastic searching on the line and its applications to parameter learning in nonlinear optimization , 1997, IEEE Trans. Syst. Man Cybern. Part B.

[24]  B. John Oommen,et al.  A Novel Multidimensional Scaling Technique for Mapping Word-Of-Mouth Discussions , 2009 .

[25]  N. Flournoy,et al.  Up-and-Down Designs for Selecting the Dose with Maximum Success Probability , 2008 .

[26]  Sorour Afshar,et al.  Presenting a new multiclass classifier based on learning automata , 2013, Neurocomputing.

[27]  Mohammad S. Obaidat,et al.  Guest editorial learning automata: theory, paradigms, and applications , 2002, IEEE Trans. Syst. Man Cybern. Part B.

[28]  Kaddour Najim,et al.  Learning Automata: Theory and Applications , 1994 .

[29]  René Schott,et al.  Parallel Searching in the Plane , 1995, Comput. Geom..

[30]  B. John Oommen,et al.  Generalized Swap-with-Parent Schemes for Self-Organizing Sequential Linear Lists , 1997, ISAAC.

[31]  B. John Oommen,et al.  Absorbing and Ergodic Discretized Two-Action Learning Automata , 1986, IEEE Trans. Syst. Man Cybern..

[32]  B. John Oommen,et al.  Continuous and discretized pursuit learning schemes: various algorithms and their comparison , 2001, IEEE Trans. Syst. Man Cybern. Part B.

[33]  Yusun Chang,et al.  Reinforcement Learning for Repeated Power Control Game in Cognitive Radio Networks , 2012, IEEE Journal on Selected Areas in Communications.

[34]  P. S. Sastry,et al.  Continuous action set learning automata for stochastic optimization , 1994 .

[35]  P. Venkata Krishna,et al.  Learning automata as a utility for power management in smart grids , 2013, IEEE Communications Magazine.

[36]  Snehasis Mukhopadhyay,et al.  Decentralized Indirect Methods for Learning Automata Games , 2011, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[37]  Sheldon M. Ross,et al.  Introduction to probability models , 1975 .

[38]  B. John Oommen,et al.  Discretized estimator learning automata , 1992, IEEE Trans. Syst. Man Cybern..

[39]  Andrew Chi-Chih Yao,et al.  An Almost Optimal Algorithm for Unbounded Searching , 1976, Inf. Process. Lett..

[40]  J. Kingman A FIRST COURSE IN STOCHASTIC PROCESSES , 1967 .

[41]  Lennart Ljung,et al.  System Identification: Theory for the User , 1987 .

[42]  Philip D. Wasserman,et al.  Neural computing - theory and practice , 1989 .

[43]  Mohammad S. Obaidat,et al.  Adaptive wireless networks using learning automata , 2011, IEEE Wireless Communications.

[44]  Georgios I. Papadimitriou Hierarchical Discretized Pursuit Nonlinear Learning Automata with Rapid Convergence and High Accuracy , 1994, IEEE Trans. Knowl. Data Eng..

[45]  Kumpati S. Narendra,et al.  Learning automata approach to hierarchical multiobjective analysis , 1991, IEEE Trans. Syst. Man Cybern..

[46]  M. Thathachar,et al.  Networks of Learning Automata: Techniques for Online Stochastic Optimization , 2003 .

[47]  B. John Oommen,et al.  Automata learning and intelligent tertiary searching for stochastic point location , 1998, IEEE Trans. Syst. Man Cybern. Part B.

[48]  T Ben-Zvi,et al.  Decision Analysis: Environmental Learning Automata for Sensor Placement , 2011, IEEE Sensors Journal.

[49]  De-Shuang Huang,et al.  A General CPL-AdS Methodology for Fixing Dynamic Parameters in Dual Environments , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[50]  Alagan Anpalagan,et al.  Opportunistic Spectrum Access in Unknown Dynamic Environment: A Game-Theoretic Stochastic Learning Solution , 2012, IEEE Transactions on Wireless Communications.

[51]  D. I. Kountanis,et al.  A reorganization scheme for a hierarchical system of learning automata , 1984, IEEE Transactions on Systems, Man, and Cybernetics.

[52]  Yoshio Mogami,et al.  A new learning algorithm for the hierarchical structure learning automata operating in the nonstationary S-model random environment , 2002, IEEE Trans. Syst. Man Cybern. Part B.

[53]  P. Moran,et al.  Reversibility and Stochastic Networks , 1980 .

[54]  Yoshio Mogami,et al.  A new learning algorithm for the hierarchical structure learning automata operating in the general nonstationary multiteacher environment , 2003, SICE 2003 Annual Conference (IEEE Cat. No.03TH8734).

[55]  Anis Yazidi,et al.  Service selection in stochastic environments: a learning-automaton based solution , 2012, Applied Intelligence.