Forensics in Telecommunications, Information and Multimedia

This paper introduces a novel design for handwritten letter recognition by employing a hybrid back-propagation neural network with an enhanced evolutionary algorithm. Feeding the neural network consists of a new approach which is invariant to translation, rotation, and scaling of input letters. Evolutionary algorithm is used for the global search of the search space and the back-propagation algorithm is used for the local search. The results have been computed by implementing this approach for recognizing 26 English capital letters in the handwritings of different people. The computational results show that the neural network reaches very satisfying results with relatively scarce input data and a promising performance improvement in convergence of the hybrid evolutionary back-propagation algorithms is exhibited.

[1]  Tomi Kinnunen,et al.  Real-time speaker identification and verification , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[2]  P. Fränti,et al.  Voice Activity Detection Using MFCC Features and Support Vector Machine , 2007 .

[3]  T. Kinnunen,et al.  Long-Term F0 Modeling for Text-Independent Speaker Recognition , 2005 .

[4]  Patrick Kenny,et al.  A Study of Interspeaker Variability in Speaker Verification , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[5]  William M. Campbell,et al.  Support vector machines for speaker and language recognition , 2006, Comput. Speech Lang..

[6]  Pasi Fränti,et al.  Accuracy of MFCC-Based Speaker Recognition in Series 60 Device , 2005, EURASIP J. Adv. Signal Process..

[7]  Roland Auckenthaler,et al.  Score Normalization for Text-Independent Speaker Verification Systems , 2000, Digit. Signal Process..

[8]  Andreas G. Andreou,et al.  Heteroscedastic discriminant analysis and reduced rank HMMs for improved speech recognition , 1998, Speech Commun..

[9]  Tomi Kinnunen,et al.  Maximum a Posteriori Adaptation of the Centroid Model for Speaker Verification , 2008, IEEE Signal Processing Letters.

[10]  Tomi Kinnunen,et al.  On Factors Affecting MFCC-Based Speaker Recognition Accuracy , 2005 .

[11]  Tomi Kinnunen,et al.  On the Use of Long-Term Average Spectrum in Automatic Speaker Recognition , 2006 .

[12]  Rong Tong,et al.  Fusion of Acoustic and Tokenization Features for Speaker Recognition , 2006, ISCSLP.

[13]  Lukás Burget,et al.  Analysis of Feature Extraction and Channel Compensation in a GMM Speaker Recognition System , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[14]  Douglas A. Reynolds,et al.  Speaker Verification Using Adapted Gaussian Mixture Models , 2000, Digit. Signal Process..

[15]  Rong Tong,et al.  Speaker cluster based GMM tokenization for speaker recognition , 2006, INTERSPEECH.

[16]  David A. van Leeuwen,et al.  Fusion of Heterogeneous Speaker Recognition Systems in the STBU Submission for the NIST Speaker Recognition Evaluation 2006 , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[17]  Steven Kay,et al.  Fundamentals Of Statistical Signal Processing , 2001 .

[18]  P. Fränti,et al.  645 Improving Speaker Verification by Periodicity Based Voice Activity Detection , .

[19]  Tomi Kinnunen,et al.  APPLYING MFCC-BASED AUTOMATIC SPEAKER RECOGNITION TO GSM AND FORENSIC DATA , 2005 .

[20]  Javier Ramírez,et al.  Efficient voice activity detection algorithms using long-term speech information , 2004, Speech Commun..