Application of Reinforcement Learning Algorithms for the Adaptive Computation of the Smoothing Parameter for Probabilistic Neural Network

In this paper, we propose new methods for the choice and adaptation of the smoothing parameter of the probabilistic neural network (PNN). These methods are based on three reinforcement learning algorithms: Q(0)-learning, Q(λ)-learning, and stateless Q-learning. We regard three types of PNN classifiers: the model that uses single smoothing parameter for the whole network, the model that utilizes single smoothing parameter for each data attribute, and the model that possesses the matrix of smoothing parameters different for each data variable and data class. Reinforcement learning is applied as the method of finding such a value of the smoothing parameter, which ensures the maximization of the prediction ability. PNN models with smoothing parameters computed according to the proposed algorithms are tested on eight databases by calculating the test error with the use of the cross validation procedure. The results are compared with state-of-the-art methods for PNN training published in the literature up to date and, additionally, with PNN whose sigma is determined by means of the conjugate gradient approach. The results demonstrate that the proposed approaches can be used as alternative PNN training procedures.

[1]  Chu Kiong Loo,et al.  A Global k-means Approach for Autonomous Cluster Initialization of Probabilistic Neural Network , 2008, Informatica.

[2]  Dominique Bertrand,et al.  Reduction of the size of the learning data in a probabilistic neural network by hierarchical clustering. Application to the discrimination of seeds by artificial vision , 1996 .

[3]  Kenji Doya,et al.  Meta-learning in Reinforcement Learning , 2003, Neural Networks.

[4]  Martin A. Riedmiller,et al.  Reinforcement learning on explicitly specified time scales , 2003, Neural Computing & Applications.

[5]  Chenn-Jung Huang,et al.  A comparative study of feature selection methods for probabilistic neural networks in cancer classification , 2003, Proceedings. 15th IEEE International Conference on Tools with Artificial Intelligence.

[6]  Michael R. Berthold,et al.  Constructive training of probabilistic neural networks , 1998, Neurocomputing.

[7]  Mohammad Teshnehlab,et al.  Thyroid Disease Diagnosis Based on Genetic Algorithms Using PNN and SVM , 2009, 2009 3rd International Conference on Bioinformatics and Biomedical Engineering.

[8]  Timothy Masters,et al.  Advanced algorithms for neural networks: a C++ sourcebook , 1995 .

[9]  Janusz A. Starzyk,et al.  A Novel Optimization Algorithm Based on Reinforcement Learning , 2010 .

[10]  Richard S. Sutton,et al.  Neuronlike adaptive elements that can solve difficult learning control problems , 1983, IEEE Transactions on Systems, Man, and Cybernetics.

[11]  Hiok Chai Quek,et al.  Maximum reward reinforcement learning: A non-cumulative reward criterion , 2006, Expert Syst. Appl..

[12]  Ahmad Taher Azar,et al.  Probabilistic neural network for breast cancer classification , 2012, Neural Computing and Applications.

[13]  Satoshi Yamada,et al.  Reinforcement learning to train a cooperative network with both discrete and continuous output neurons , 1998, IEEE Trans. Neural Networks.

[14]  B. Chandra,et al.  An improved architecture for Probabilistic Neural Networks , 2011, The 2011 International Joint Conference on Neural Networks.

[15]  Florin Gorunescu,et al.  An evolutionary computational approach to probabilistic neural network with application to hepatic cancer diagnosis , 2005, 18th IEEE Symposium on Computer-Based Medical Systems (CBMS'05).

[16]  Roy L. Streit,et al.  Maximum likelihood training of probabilistic neural networks , 1994, IEEE Trans. Neural Networks.

[17]  Guowang Xu,et al.  Application of probabilistic neural network in the clinical diagnosis of cancers based on clinical chemistry data , 2002 .

[18]  Kenji Doya,et al.  Multiple model-based reinforcement learning explains dopamine neuronal activity , 2007, Neural Networks.

[19]  Michael N. Vrahatis,et al.  Novel Approaches to Probabilistic Neural Networks Through Bagging and Evolutionary Estimating of Prior Probabilities , 2008, Neural Processing Letters.

[20]  Fevzullah Temurtas,et al.  A comparative study on diabetes disease diagnosis using neural networks , 2009, Expert Syst. Appl..

[21]  Srinivasan Ramakrishnan,et al.  Image texture classification using wavelet based curve fitting and probabilistic neural network , 2007, Int. J. Imaging Syst. Technol..

[22]  Hans G. C. Tråvén,et al.  A neural network approach to statistical pattern classification by 'semiparametric' estimation of probability density functions , 1991, IEEE Trans. Neural Networks.

[23]  P. Lanzi,et al.  Adaptive Agents with Reinforcement Learning and Internal Memory , 2000 .

[24]  Donald F. Specht,et al.  Probabilistic neural networks and the polynomial Adaline as complementary techniques for classification , 1990, IEEE Trans. Neural Networks.

[25]  Jun-Geol Baek,et al.  Asynchronous action-reward learning for nonstationary serial supply chain inventory control , 2008, Applied Intelligence.

[26]  Ronald Marsh,et al.  Conjugate gradient and approximate Newton methods for an optimal probabilistic neural network for food color classification , 1998 .

[27]  Leszek Rutkowski,et al.  Adaptive probabilistic neural networks for pattern classification in time-varying environment , 2004, IEEE Transactions on Neural Networks.

[28]  Kuan-Cheng Lin,et al.  Supervised Learning Probabilistic Neural Networks , 2011, Neural Processing Letters.

[29]  George C. Anastassopoulos,et al.  Genetic algorithm pruning of probabilistic neural networks in medical disease estimation , 2011, Neural Networks.

[30]  Q. Henry Wu,et al.  High-dimensional Function Optimisation by Reinforcement Learning , 2010, IEEE Congress on Evolutionary Computation.

[31]  Jianghao Li,et al.  Microassembly path planning using reinforcement learning for improving positioning accuracy of a 1 cm3 omni-directional mobile microrobot , 2011, Applied Intelligence.

[32]  Ben J. A. Kröse,et al.  Learning from delayed rewards , 1995, Robotics Auton. Syst..

[33]  Peter Stone,et al.  Generalized model learning for Reinforcement Learning on a humanoid robot , 2010, 2010 IEEE International Conference on Robotics and Automation.

[34]  Hojjat Adeli,et al.  A probabilistic neural network for earthquake magnitude prediction , 2009, Neural Networks.

[35]  Buse Melis Ozyildirim,et al.  Generalized classifier neural network , 2013, Neural Networks.

[36]  Fan Zhou,et al.  Field-programmable gate array implementation of a probabilistic neural network for motor cortical decoding in rats , 2010, Journal of Neuroscience Methods.

[37]  D. F. Specht,et al.  Experience with adaptive probabilistic neural networks and adaptive general regression neural networks , 1994, Proceedings of 1994 IEEE International Conference on Neural Networks (ICNN'94).

[38]  E. Parzen On Estimation of a Probability Density Function and Mode , 1962 .

[39]  Aluizio F. R. Araújo,et al.  A topological reinforcement learning agent for navigation , 2003, Neural Computing & Applications.

[40]  Wee Ser,et al.  Probabilistic neural-network structure determination for pattern classification , 2000, IEEE Trans. Neural Networks Learn. Syst..

[41]  Roland Siegwart,et al.  Compact Q-learning optimized for micro-robots with processing and memory constraints , 2004, Robotics Auton. Syst..

[42]  D. F. Specht,et al.  Enhancements to probabilistic neural networks , 1992, [Proceedings 1992] IJCNN International Joint Conference on Neural Networks.

[43]  Tao Tang,et al.  Online Learning Algorithms for Train Automatic Stop Control Using Precise Location Data of Balises , 2013, IEEE Transactions on Intelligent Transportation Systems.

[44]  Mark Rivera,et al.  Gap-Based Estimation: Choosing the Smoothing Parameters for Probabilistic and General Regression Neural Networks , 2006, Neural Computation.

[45]  Nicos G. Pavlidis,et al.  New Self-adaptive Probabilistic Neural Networks in Bioinformatic and Medical Tasks , 2006, Int. J. Artif. Intell. Tools.

[46]  Donald F. Specht,et al.  Probabilistic neural networks , 1990, Neural Networks.

[47]  Timothy Masters,et al.  Practical neural network recipes in C , 1993 .

[48]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 2005, IEEE Transactions on Neural Networks.