A Chinese Minority Script Recognition Method Based on Wavelet Feature and Modified KNN

In recent years, K-Nearest Neighbor (KNN) has demonstrated excellent performance in a variety of pattern recognition problems. In this paper, we proposed modified KNN for the Chinese minority scripts classification, using wavelet energy distribution and wavelet energy proportions features generated from the discrete wavelet multiresolution decomposition. The experimental results show that the method achieves high accuracy in testing dataset that consists of Chinese, English and Chinese minority scripts such as Tibetan, Tai Lue, Naxi Pictographs, Uighur, Tai Le, Yi. The results also show that this method is feasible and reasonable, and the recognition rate is up to 96%.

[1]  Göran Ståhl,et al.  Simultaneous Estimations of Forest Parameters using Aerial Photograph Interpreted Data and the k Nearest Neighbour Method , 2001 .

[2]  Piermaria Corona,et al.  Estimation of Mediterranean forest attributes by the application of k‐NN procedures to multitemporal Landsat ETM+ images , 2005 .

[3]  Dianwei Han,et al.  A comparison of two algorithms for predicting the condition number , 2007, Sixth International Conference on Machine Learning and Applications (ICMLA 2007).

[4]  Fatos T. Yarman-Vural,et al.  Modified condensed nearest neighbor rule as applied to speaker independent word recognition , 1988, Speech Commun..

[5]  William D. Mattson,et al.  Near-neighbor calculations using a modified cell-linked list method , 1999 .

[6]  E. Tomppo,et al.  Selecting estimation parameters for the Finnish multisource National Forest Inventory , 2001 .

[7]  Leonardo Traversoni Modified natural neighbor interpolant , 1992 .

[8]  Jun Zhang,et al.  A comparison of two algorithms for predicting the condition number , 2007, ICMLA 2007.

[9]  Alan J. Broder Strategies for efficient incremental nearest neighbor search , 1990, Pattern Recognit..

[10]  T. N. Tan,et al.  Scale and rotation invariant texture classification , 1994 .

[11]  Xu Jianfeng Web Embedding Fonts Technology of Naxi Pictographs , 2005 .

[12]  Zhu Xue Soft Sensor Modeling Based on a Modified k-Nearest Neighbor Regression Algorithm , 2007 .

[13]  F. Harris On the use of windows for harmonic analysis with the discrete Fourier transform , 1978, Proceedings of the IEEE.

[14]  Jian-ping Li,et al.  Uniform Analytic Construction of Wavelet Analysis Filters Based on Sine and Cosine Trigonometric Functions , 2001 .

[15]  Wang Weilan Study on Printed Tibetn Character Recognition , 2003 .

[16]  Ke-Wei Xu,et al.  Calculation of the surface energy of bcc transition metals by using the second nearest neighbor modified embedded atom method , 2006 .

[17]  Wang Hua Multi-font multi-size printed Uyghur character recognition , 2004 .

[18]  Nasser Kehtarnavaz,et al.  Proceedings of SPIE - The International Society for Optical Engineering , 1991 .

[19]  Gerrit Hoogenboom,et al.  Weather analogue: A tool for real-time prediction of daily weather data realizations based on a modified k-nearest neighbor approach , 2008, Environ. Model. Softw..

[20]  Richard Kronland-Martinet,et al.  Asymptotic wavelet and Gabor analysis: Extraction of instantaneous frequencies , 1992, IEEE Trans. Inf. Theory.

[21]  Stéphane Mallat,et al.  A Theory for Multiresolution Signal Decomposition: The Wavelet Representation , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[22]  Michael Unser,et al.  Texture classification and segmentation using wavelet frames , 1995, IEEE Trans. Image Process..

[23]  Jian Fan,et al.  Texture Classification by Wavelet Packet Signatures , 1993, MVA.

[24]  Bedrich J. Hosticka,et al.  Unsupervised texture segmentation of images using tuned matched Gabor filters , 1995, IEEE Trans. Image Process..

[25]  M. Nilsson,et al.  Regional forest biomass and wood volume estimation using satellite data and ancillary data , 1999 .

[26]  Rangasami L. Kashyap,et al.  A Model-Based Method for Rotation Invariant Texture Classification , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[27]  Prashant Parikh A Theory of Communication , 2010 .

[28]  Ao Li,et al.  Phosphorylation Site Prediction with A Modified k-Nearest Neighbor Algorithm and BLOSUM62 Matrix , 2005, 2005 IEEE Engineering in Medicine and Biology 27th Annual Conference.

[29]  C. Tassoni,et al.  Monitoring of induction Machines by maximum covariance method for frequency tracking , 2004, IEEE Transactions on Industry Applications.

[30]  Muhammad Arif,et al.  Modified Nearest Neighbor Method for Multistep Ahead Time Series Forecasting , 2007, Int. J. Pattern Recognit. Artif. Intell..

[31]  M. Bauer,et al.  Estimation and mapping of forest stand density, volume, and cover type using the k-nearest neighbors method , 2001 .

[32]  Tommy W. S. Chow,et al.  Induction machine fault diagnostic analysis with wavelet technique , 2004, IEEE Transactions on Industrial Electronics.

[33]  E. Tomppo Multi-source national forest inventory of Finland. , 1994 .

[34]  Li Zhen The Research of Printed Mongolian Character Recognition , 2003 .

[35]  Forest Baskett,et al.  An Algorithm for Finding Nearest Neighbors , 1975, IEEE Transactions on Computers.

[36]  Erkki Tomppo,et al.  Using coarse scale forest variables as ancillary information and weighting of variables in k-NN estimation: a genetic algorithm approach , 2004 .

[37]  Andrew O. Finley,et al.  Delineation of forest/nonforest land use classes using nearest neighbor methods , 2004 .

[38]  C.-C. Jay Kuo,et al.  Texture analysis and classification with tree-structured wavelet transform , 1993, IEEE Trans. Image Process..

[39]  András Faragó,et al.  Fast Nearest-Neighbor Search in Dissimilarity Spaces , 1993, IEEE Trans. Pattern Anal. Mach. Intell..