Interpreting and improving multi-layered networks by free energy-based competitive learning

In this paper, we propose a new algorithm to interpret and improve multi-layered neural networks. The new method is used to simplify and interpret final representations obtained by conventional learning methods such as BP or RBF. In addition, the method is used to retrain networks so as to produce networks with better performance in terms of generalization errors. Neural networks have so far produced effective internal representations for many problems. However, because obtained information is extremely distributed over many elements, it is difficult to interpret the meaning of representations. For this problem, we use competitive learning by which we can focus upon some elements in networks. In addition, because the number of effective elements in neural networks can significantly be reduced, we can expect improved performance in terms of generalization. Competitive processes are considered to be a process of entropy minimization in which a small number of units acquires the majority of information on input patterns. For simplifying computation, we use a free energy in which entropy and training errors are simultaneously used. We applied the methods to an artificial data and a cabinet approval rating estimation problem. In both problems, we succeeded in extracting the main features in input patterns and improved generalization performance could be obtained.

[1]  Dhavan V. Shah,et al.  News Media, Candidates and Issues, and Public Opinion in the 1996 Presidential Campaign , 1997 .

[2]  Teuvo Kohonen,et al.  Self-organization and associative memory: 3rd edition , 1989 .

[3]  Bryan E. Denham,et al.  Horserace, Issue Coverage in Prestige Newspapers during 1988, 1992 Elections , 1994 .

[4]  Ryotaro Kamimura,et al.  Teacher-directed learning: information-theoretic competitive learning in supervised multi-layered networks , 2003, Connect. Sci..

[5]  Dhavan V. Shah,et al.  Elite Cues and Media Bias in Presidential Campaigns , 1999 .

[6]  RYOTARO KAMIMURA Improving information-theoretic competitive learning by accentuated information maximization , 2005, Int. J. Gen. Syst..

[7]  Robert Hecht-Nielsen,et al.  Applications of counterpropagation networks , 1988, Neural Networks.

[8]  Michael C. Mozer,et al.  Using Relevance to Reduce Network Size Automatically , 1989 .

[9]  Rose,et al.  Statistical mechanics and phase transitions in clustering. , 1990, Physical review letters.

[10]  Marc M. Van Hulle,et al.  The Formation of Topographic Maps That Maximize the Average Mutual Information of the Output Responses to Noiseless Input Signals , 1997, Neural Computation.

[11]  Stanley C. Ahalt,et al.  Competitive learning algorithms for vector quantization , 1990, Neural Networks.

[12]  Ryotaro Kamimura Information-theoretic Competitive Learning , 2003, Modelling and Simulation.

[13]  Julie L. Andsager,et al.  Framing the Candidates in Presidential Primaries: Issues and Images in Press Releases and News Coverage , 1998 .

[14]  Stephen Grossberg,et al.  Competitive Learning: From Interactive Activation to Adaptive Resonance , 1987, Cogn. Sci..

[15]  Ryotaro Kamimura,et al.  Flexible feature discovery and structural information control , 2001, Connect. Sci..

[16]  David P. Fan,et al.  Computer Content Analysis of Press Coverage and Prediction of Public Opinion for the 1995 Sovereignty Referendum in Quebec , 1997 .

[17]  David Zipser,et al.  Feature Discovery by Competive Learning , 1986, Cogn. Sci..

[18]  Ryotaro Kamimura,et al.  Progressive Feature Extraction with a Greedy Network-growing Algorithm , 2003, Complex Syst..

[19]  Teuvo Kohonen,et al.  Self-Organizing Maps , 2010 .

[20]  Ryotaro Kamimura,et al.  Information-Theoretic Competitive Learning with Inverse Euclidean Distance Output Units , 2003, Neural Processing Letters.

[21]  Naonori Ueda,et al.  Deterministic Annealing Variant of the EM Algorithm , 1994, NIPS.

[22]  Thomas Martinetz,et al.  'Neural-gas' network for vector quantization and its application to time-series prediction , 1993, IEEE Trans. Neural Networks.

[23]  Duane DeSieno,et al.  Adding a conscience to competitive learning , 1988, IEEE 1988 International Conference on Neural Networks.