Conjugate and natural gradient rules for BYY harmony learning on Gaussian mixture with automated model selection

Under the Bayesian Ying–Yang (BYY) harmony learning theory, a harmony function has been developed on a BI-directional architecture of the BYY system for Gaussian mixture with an important feature that, via its maximization through a general gradient rule, a model selection can be made automatically during parameter learning on a set of sample data from a Gaussian mixture. This paper further proposes the conjugate and natural gradient rules to efficiently implement the maximization of the harmony function, i.e. the BYY harmony learning, on Gaussian mixture. It is demonstrated by simulation experiments that these two new gradient rules not only work well, but also converge more quickly than the general gradient ones.

[1]  H. P. Friedman,et al.  On Some Invariant Criteria for Grouping Data , 1967 .

[2]  A. Scott,et al.  Clustering methods based on likelihood ratio criteria. , 1971 .

[3]  N. E. Day Estimating the components of a mixture of normal distributions , 1969 .

[4]  Changshui Zhang,et al.  Competitive EM algorithm for finite mixture models , 2004, Pattern Recognit..

[5]  Jinwen Ma,et al.  A gradient BYY harmony learning rule on Gaussian mixture with automated model selection , 2004, Neurocomputing.

[6]  Stephen J. Roberts,et al.  Maximum certainty data partitioning , 2000, Pattern Recognit..

[7]  G. W. Milligan,et al.  A monte carlo study of thirty internal criterion measures for cluster analysis , 1981 .

[8]  Jinwen Ma,et al.  Entropy Penalized Automated Model Selection On Gaussian Mixture , 2004, Int. J. Pattern Recognit. Artif. Intell..

[9]  R. Redner,et al.  Mixture densities, maximum likelihood, and the EM algorithm , 1984 .

[10]  Nikos A. Vlassis,et al.  A Greedy EM Algorithm for Gaussian Mixture Learning , 2002, Neural Processing Letters.

[11]  Richard O. Duda,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[12]  Shun-ichi Amari,et al.  Methods of information geometry , 2000 .

[13]  Jinwen Ma,et al.  An adaptive BYY harmony learning algorithm and its relation to rewarding and penalizing competitive learning mechanism , 2002, 6th International Conference on Signal Processing, 2002..

[14]  H. Akaike A new look at the statistical model identification , 1974 .

[15]  Lei Xu,et al.  Best Harmony, Unified RPCL and Automated Model Selection for Unsupervised and Supervised Learning on Gaussian Mixtures, Three-Layer Nets and ME-RBF-SVM Models , 2001, Int. J. Neural Syst..

[16]  Robert F. Ling,et al.  Classification and Clustering. , 1979 .

[17]  J. Hartigan Distribution Problems in Clustering , 1977 .

[18]  Lei Xu,et al.  BYY harmony learning, structural RPCL, and topological self-organizing on mixture models , 2002, Neural Networks.

[19]  Michael A. Arbib,et al.  The handbook of brain theory and neural networks , 1995, A Bradford book.

[20]  André Hardy,et al.  An examination of procedures for determining the number of clusters in a data set , 1994 .

[21]  G. W. Milligan,et al.  An examination of procedures for determining the number of clusters in a data set , 1985 .