Phonemes are the smallest distinguishable unit of speech signal. Segmentation of phoneme from its word counterpart is a fundamental and crucial part in speech processing since initial phoneme is used to activate words starting with that phoneme. This work describes an Artificial Neural Network (ANN) based algorithm developed for segmentation and classification of consonant phoneme of Assamese language. The algorithm uses weight vectors, obtained by training Self Organizing Map (SOM) with different number of iteration. Segments of different phonemes constituting the word whose LPC samples are used for training are obtained from SOM weights. A two class Probabilistic Neural Network (PNN) trained with clean Assamese phoneme is used to identify phoneme segment. The classification of phoneme segment is performed as per the consonant phoneme structure of Assamese language which consists of six phoneme families. Experimental results establish the superiority of the SOM-based segmentation over the speaker independent phoneme segmentation reported till now including those obtained using Discrete Wavelet Transform (DWT).
[1]
Golockchandra Goswami,et al.
Structure of Assamese
,
1982
.
[2]
M.G. Bellanger,et al.
Digital processing of speech signals
,
1980,
Proceedings of the IEEE.
[3]
A.M. De Lima Araujo,et al.
Formant frequency estimation using a Mel-scale LPC algorithm
,
1998,
ITS'98 Proceedings. SBT/IEEE International Telecommunications Symposium (Cat. No.98EX202).
[4]
Kjell Elenius,et al.
Multi-layer perceptrons and probabilistic neural networks for phoneme recognition
,
1993,
EUROSPEECH.
[5]
Beng T. Tan,et al.
Applying wavelet analysis to speech segmentation and classification
,
1994,
Defense, Security, and Sensing.
[6]
Teuvo Kohonen,et al.
The self-organizing map
,
1990,
Neurocomputing.
[7]
Donald F. Specht,et al.
Probabilistic neural networks
,
1990,
Neural Networks.
[8]
Young-Seuk Park,et al.
Self-Organizing Map
,
2008
.