On the Analysis of Pattern Sequences by Self-Organizing Maps

This thesis is organized in three parts. In the rst part, the Self-Organizing Map algorithm is introduced. The discussion focuses on the analysis of the Self-Organizing Map algorithm. It is shown that the nonlinear nature of the algorithm makes it di cult to analyze the algorithm except in some trivial cases. In the second part the Self-Organizing Map algorithm is applied to several patterns sequence analysis tasks. The rst application is a voice quality analysis system. It is shown that the Self-Organizing Map algorithm can be applied to voice analysis by providing the visualization of certain deviations. The key point in the applicability of Self-Organizing Map algorithm is the topological nature of the mapping; similar voice samples are mapped to nearby locations in the map. The second application is a speech recognition system. Through several experiments it is demonstrated that by collecting some time dependent features and using them in conjunction with the basic Self-Organizing Map algorithm one can improve the speech recognition accuracy considerably. The applications explained in the second part of the thesis were rather straightforward works where the sequential signal itself was transformed for the analysis. In the third part of the thesis it is demonstrated that the Self-Organizing Map algorithm itself could be extended by identifying each Map unit with an arbitrary operator with capabilities for pattern sequence processing. It is shown that the operator maps are applicable for example to speech signal (waveform) categorization. On the Analysis of Pattern Sequences by Self-Organizing Maps Jari Kangas Helsinki University of Technology Laboratory of Computer and Information Science Rakentajanaukio 2 C, SF-02150, FINLAND Thesis for the degree of Doctor of Technology to be presented with due permission for public examination and criticism in the Auditorium F1 of the Helsinki University of Technology on 6th of May 1994, at 12 o'clock noon. i Dedication To my wife P aivi and our little son Miika. ii Acknowledgements The work presented in this thesis has been carried out in the Laboratory of Computer and Information Science in the Faculty of Information Technology of the Helsinki University of Technology. I wish to thank Professor Teuvo Kohonen for initially directing me to work with the SelfOrganizing Map algorithm and for the discussions and the encouragement to continue the research on SOM during the ensuing years. I also wish to thank Professor Kohonen for providing the excellent facilities and support for this research. I would like to thank all the personnel of the Laboratory of Computer and Information Science for creating an inspiring and comfortable atmosphere. The informal discussions with the research group members have always been useful. Professor Teuvo Kohonen and Professor Erkki Oja commented in detail on the manuscript during the nal revisioning which is gratefully acknowledged. Mr. Paul Schurman deserves many thanks for checking the language of the thesis. The nancial support of the Academy of Finland is acknowledged. Espoo, April 15, 1994 Jari Kangas iii

[1]  Olli Ventä,et al.  Phonetic typewriter for Finnish and Japanese , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[2]  Teuvo Kohonen,et al.  Self-organization and associative memory: 3rd edition , 1989 .

[3]  Alfred Ultsch Knowledge Acquisition with Self-Organizing Neural Networks , 1992, ECAI.

[4]  Teuvo Kohonen,et al.  The self-organizing map , 1990 .

[5]  Jari Kangas Phoneme recognition using time-dependent versions of self-organizing maps , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[6]  Alfred Ultsch,et al.  Self Organized Feature Maps for Monitoring and Knowledge Aquisition of a Chemical Process , 1993 .

[7]  T. Kohonen Analysis of a simple self-organizing process , 1982, Biological Cybernetics.

[8]  Olli Simula,et al.  Process State Monitoring Using Self-Organizing Maps , 1992 .

[9]  Teuvo Kohonen,et al.  An introduction to neural computing , 1988, Neural Networks.

[10]  Olli Simula,et al.  Self-Organizing Feature Maps in Texture Classification and Segmentation , 1992 .

[11]  Thomas Villmann,et al.  Dynamics of Self-Organized Feature Mapping , 1993, IWANN.

[12]  Michael A. Arbib,et al.  The handbook of brain theory and neural networks , 1995, A Bradford book.

[13]  Zhen-Ping Lo,et al.  On the rate of convergence in topology preserving neural networks , 1991, Biological Cybernetics.

[14]  K. Schulten,et al.  Kohonen's self-organizing maps: exploring their computational capabilities , 1988, IEEE 1988 International Conference on Neural Networks.

[15]  L Leinonen,et al.  Acoustic pattern recognition of fricative-vowel coarticulation by the self-organizing map. , 1993, Folia phoniatrica.

[16]  Edgar Sanchez-Sinencio,et al.  Artificial Neural Networks: Paradigms, Applications, and Hardware Implementations , 1994 .

[17]  H. Ritter,et al.  Convergence properties of Kohonen's topology conserving maps: fluctuations, stability, and dimension selection , 1988, Biological Cybernetics.

[18]  Clifford Lau,et al.  Neural Networks: Theoretical Foundations and Analysis , 1991 .

[19]  Helge Ritter Asymptotic level density for a class of vector quantization processes , 1991, IEEE Trans. Neural Networks.

[20]  Robert M. Gray,et al.  An Algorithm for Vector Quantizer Design , 1980, IEEE Trans. Commun..

[21]  Livia De Giovanni,et al.  ON THE MATHEMATICAL TREATMENT OF SELF-ORGANIZATION: EXTENSION OF SOME CLASSICAL RESULTS , 1991 .

[22]  Edgardo A. Ferrán,et al.  UNSUPERVISED CLUSTERING OF PROTEINS , 1991 .

[23]  Eero Carlson SELF-ORGANIZING FEATURE MAPS FOR APPRAISAL OF LAND VALUE OF SHORE PARCELS , 1991 .

[24]  L Leinonen,et al.  Self-organized acoustic feature map in detection of fricative-vowel coarticulation. , 1993, The Journal of the Acoustical Society of America.

[25]  Stephen D. Collins Neurocomputing 2 , 1992, Neurology.

[26]  John G. Taylor,et al.  Storing temporal sequences , 1991, Neural Networks.

[27]  G. Abor,et al.  Genetic Algorithm with Migration on Topology Conserving Maps , 1995 .

[28]  G. L. Sicuranza,et al.  Application of ordered codebooks to image coding , 1993, Neural Networks for Signal Processing III - Proceedings of the 1993 IEEE-SP Workshop.

[29]  J. C. Scholtes Resolving Linguistic Ambiguities with a Neural Data-Oriented Parsing (DOP) System , 1992 .

[30]  Michael H. Gera Finding Multi-Faculty Structure , 1992 .

[31]  Risto Miikkulainen,et al.  Self-Organizing Process Based On Lateral Inhibition And Synaptic Resource Redistribution , 1991 .

[32]  Zhen-Ping Lo,et al.  Analysis of the convergence properties of topology preserving neural networks , 1993, IEEE Trans. Neural Networks.

[33]  R. Lippmann,et al.  An introduction to computing with neural nets , 1987, IEEE ASSP Magazine.

[34]  J. H. Zar,et al.  Biostatistical Analysis (5th Edition) , 1984 .

[35]  Alfred Ultsch,et al.  OPTIMIZING LOGICAL PROOFS WITH CONNECTIONIST NETWORKS , 2006 .

[36]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[37]  L Leinonen,et al.  Dysphonia detected by pattern recognition of spectral composition. , 1992, Journal of speech and hearing research.

[38]  Pietro Morasso SELF-ORGANISING FEATURE MAPS FOR CURSIVE SCRIPT RECOGNITION , 1991 .

[39]  Geoffrey E. Hinton,et al.  Phoneme recognition using time-delay neural networks , 1989, IEEE Trans. Acoust. Speech Signal Process..

[40]  S. P. Luttrell,et al.  Self-organisation: a derivation from first principles of a class of learning algorithms , 1989, International 1989 Joint Conference on Neural Networks.

[41]  Helge J. Ritter,et al.  Neural computation and self-organizing maps - an introduction , 1992, Computation and neural systems series.

[42]  Teuvo Kohonen,et al.  Things you haven't heard about the self-organizing map , 1993, IEEE International Conference on Neural Networks.

[43]  Fernando J. Pineda,et al.  Recurrent Backpropagation and the Dynamical Approach to Adaptive Neural Computation , 1989, Neural Computation.

[44]  Klaus Schulten,et al.  Self-organizing maps: ordering, convergence properties and energy functions , 1992, Biological Cybernetics.

[45]  G. J. Hueter Solution of the traveling salesman problem with an adaptive ring , 1988, IEEE 1988 International Conference on Neural Networks.

[46]  James A. Anderson,et al.  Neurocomputing: Foundations of Research , 1988 .

[47]  Juha Tuominen,et al.  PROCESS ERROR DETECTION USING SELF-ORGANIZING FEATURE MAPS , 1991 .

[48]  J. C. Scholtes Unsupervised learning and the information retrieval problem , 1991, [Proceedings] 1991 IEEE International Joint Conference on Neural Networks.

[49]  K. Schulten,et al.  On the stationary state of Kohonen's self-organizing sensory mapping , 2004, Biological Cybernetics.

[50]  Paul L. Zador,et al.  Asymptotic quantization error of continuous signals and the quantization dimension , 1982, IEEE Trans. Inf. Theory.

[51]  B Brückner,et al.  A MODIFIED HYPERMAP ARCHITECTURE FOR CLASSIFICATION OF BIOLOGICAL SIGNALS , 1992 .

[52]  Simon Cumming,et al.  Neural networks for monitoring of engine condition data , 1993, Neural Computing & Applications.

[53]  J. Makhoul,et al.  Linear prediction: A tutorial review , 1975, Proceedings of the IEEE.

[54]  Ping Wu,et al.  Neural network feature maps for Chinese phonemes , 1992, Neurocomputing.

[55]  Neural pattern recognition : distortion tolerance by self-organizing maps , 1992 .

[56]  Mikko Kurimo,et al.  Status Report Of The Finnish Phonetic Typewriter Project , 1991 .

[57]  Bernice W. Polemis Nonparametric Statistics for the Behavioral Sciences , 1959 .

[58]  Helge J. Ritter,et al.  Three-dimensional neural net for learning visuomotor coordination of a robot arm , 1990, IEEE Trans. Neural Networks.

[59]  Teuvo Kohonen,et al.  The 'neural' phonetic typewriter , 1988, Computer.

[60]  T. Kohonen,et al.  Self-organizing semantic maps , 1989, Biological Cybernetics.

[61]  Sylvie S. Jumpertz,et al.  Image Sequence Coding Using a Neural Vector Quantization , 1993 .

[62]  Jukka Saarinen,et al.  Classification of Epileptic EEG by Using Self-Organizing Maps , 1992 .

[63]  S. Luttrell Derivation of a Class of Training , 1990 .

[64]  V. V. Tolat An analysis of Kohonen's self-organizing maps using a system of energy functions , 1990, Biological Cybernetics.

[65]  Alex Waibel,et al.  Readings in speech recognition , 1990 .

[66]  John G. Taylor,et al.  The temporal Kohönen map , 1993, Neural Networks.

[67]  Teuvo Kohonen,et al.  Transient map method in stop consonant discrimination , 1989, EUROSPEECH.

[68]  Nigel M. Allinson,et al.  FINANCIAL DATA RECOGNITION AND PREDICTION USING NEURAL NETWORKS , 1991 .

[69]  John J. Hopfield,et al.  Connected-digit speaker-dependent speech recognition using a neural network with time-delayed connections , 1991, IEEE Trans. Signal Process..

[70]  Jing Peng,et al.  An Efficient Gradient-Based Algorithm for On-Line Training of Recurrent Network Trajectories , 1990, Neural Computation.