Switching models for nonstationary random environments

Learning automata are stochastic finite state machines that attempt to learn the characteristic of an unknown random environment with which they interact. The fundamental problem is that of learning, through feedback, the action which has the highest probability of being rewarded by the environment. The problem of designing automata for stationary environments has been extensively studied. When the environment is nonstationary, the question of modeling the nonstationarity is, in itself, a very interesting problem. In this paper, the authors generalize the model used in Tsetlin (1971, 1973) to present three models of nonstationarity. In the first two cases, the nonstationarity is modeled by a homogeneous Markov chain governing the way in which the characteristics change. The final model considers the more general case when the transition matrix of this chain itself changes with time in a geometric manner. In each case the authors analyze the stochastic properties of the resultant switching environment. The question of analyzing the various learning machines when interacting with these environments introduces an entire new avenue of open research problems. >

[1]  V. Ralph Algazi,et al.  5 – Image Data Compression† , 1984 .

[2]  B. John Oommen,et al.  List Organizing Strategies Using Stochastic Move-to-Front and Stochastic Move-to-Rear Operations , 1987, SIAM J. Comput..

[3]  Norio Baba,et al.  An application of the stochastic automaton to the investment game , 1980 .

[4]  Don R. Hush,et al.  Change detection for target detection and classification in video sequences , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[5]  Jake K. Aggarwal,et al.  Integrated Analysis of Thermal and Visual Images for Scene Interpretation , 1988, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  Naoki Mukawa,et al.  Uncovered Background Prediction in Interframe Coding , 1985, IEEE Trans. Commun..

[7]  Steven D. Blostein,et al.  Detecting small, moving objects in image sequences using sequential hypothesis testing , 1991, IEEE Trans. Signal Process..

[8]  P. R. Srikantakumar,et al.  A LEARNING MODEL FOR ROUTING IN TELEPHONE NETWORKS , 1982 .

[9]  T.F. Quatieri,et al.  Statistical model-based algorithms for image analysis , 1986, Proceedings of the IEEE.

[10]  Hiroshi Hanaizumi,et al.  Change Detection From Remotely Sensed Multi-temporal Images Using Spatial Segmentation , 1991, [Proceedings] IGARSS'91 Remote Sensing: Global Monitoring for Earth Management.

[11]  Dietmar Hepper,et al.  Efficiency analysis and application of uncovered background prediction in a low bit rate image coder , 1990, IEEE Trans. Commun..

[12]  John W. Woods 3 – Image Detection and Estimation , 1984 .

[13]  Kumpati S. Narendra,et al.  On the Behavior of a Learning Automaton in a Changing Environment with Application to Telephone Traffic Routing , 1980, IEEE Transactions on Systems, Man, and Cybernetics.

[14]  Hans-Hellmut Nagel,et al.  New likelihood test methods for change detection in image sequences , 1984, Comput. Vis. Graph. Image Process..

[15]  Richard D. Holben Adaptive multisensor change detection , 1990, Defense, Security, and Sensing.

[16]  B. John Oommen,et al.  Deterministic Learning Automata Solutions to the Equipartitioning Problem , 1988, IEEE Trans. Computers.

[17]  Evaggelos Geraniotis,et al.  Robust data fusion for multisensor detection systems , 1990, IEEE Trans. Inf. Theory.

[18]  Yoram Yakimovsky,et al.  Boundary and Object Detection in Real World Images , 1974, JACM.

[19]  Jake K. Aggarwal,et al.  Segmentation through the detection of changes due to motion , 1979 .

[20]  Kumpati S. Narendra,et al.  Application of Learning Automata to Telephone Traffic Routing and Control , 1977, IEEE Transactions on Systems, Man, and Cybernetics.

[21]  Nils Sandell,et al.  Detection with Distributed Sensors , 1980, IEEE Transactions on Aerospace and Electronic Systems.

[22]  B. Johnoommen Absorbing and Ergodic Discretized Two-Action Learning Automata , 1986, IEEE Transactions on Systems, Man, and Cybernetics.

[23]  Jake K. Aggarwal,et al.  Multiple Sensor Integration/Fusion Through Image Processing: A Review , 1986 .