Clustering and maximum likelihood search for efficient statistical classification with medium-sized databases

This paper addresses the problem of insufficient performance of statistical classification with the medium-sized database (thousands of classes). Each object is represented as a sequence of independent segments. Each segment is defined as a random sample of independent features with the distribution of multivariate exponential type. To increase the speed of the optimal Kullback–Leibler minimum information discrimination principle, we apply the clustering of the training set and an approximate nearest neighbor search of the input object in a set of cluster medoids. By using the asymptotic properties of the Kullback–Leibler divergence, we propose the maximal likelihood search procedure. In this method the medoid to check is selected from the cluster with the maximal joint density (likelihood) of the distances to the previously checked medoids. Experimental results in image recognition with artificially generated dataset and Essex facial database prove that the proposed approach is much more effective, than an exhaustive search and the known approximate nearest neighbor methods from FLANN and NonMetricSpace libraries.

[1]  E. Lehmann Testing Statistical Hypotheses , 1960 .

[2]  Ming Yang,et al.  DeepFace: Closing the Gap to Human-Level Performance in Face Verification , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[3]  Tunga Güngör,et al.  A high performance centroid-based classification approach for language identification , 2012, Pattern Recognit. Lett..

[4]  Sunil Arya,et al.  An optimal algorithm for approximate nearest neighbor searching fixed dimensions , 1998, JACM.

[5]  Zhi-Hua Zhou,et al.  Face recognition from a single image per person: A survey , 2006, Pattern Recognit..

[6]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[7]  Panos M. Pardalos,et al.  Mining market data: A network approach , 2006, Comput. Oper. Res..

[8]  E. Chavez,et al.  Pivot selection techniques for proximity searching in metric spaces , 2001, SCCC 2001. 21st International Conference of the Chilean Computer Science Society.

[9]  Panos M. Pardalos,et al.  On the optimization properties of the correntropic loss function in data analysis , 2014, Optim. Lett..

[10]  Boris Mirkin,et al.  Clustering For Data Mining: A Data Recovery Approach (Chapman & Hall/Crc Computer Science) , 2005 .

[11]  Leonid Boytsov,et al.  Engineering Efficient and Effective Non-metric Space Library , 2013, SISAP.

[12]  David G. Lowe,et al.  Shape indexing using approximate nearest-neighbour search in high-dimensional spaces , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[13]  Alexandr Andoni,et al.  Near-Optimal Hashing Algorithms for Approximate Nearest Neighbor in High Dimensions , 2006, 2006 47th Annual IEEE Symposium on Foundations of Computer Science (FOCS'06).

[14]  Panos M. Pardalos,et al.  Decision rules for efficient classification of biological data , 2009, Optim. Lett..

[15]  Christoph F. Eick,et al.  Using Supervised Clustering to Enhance Classifiers , 2005, ISMIS.

[16]  Aram Kawewong,et al.  Approximate nearest neighbor search using self-organizing map clustering for face recognition system , 2014, 2014 International Computer Science and Engineering Conference (ICSEC).

[17]  Simon J. D. Prince,et al.  Computer Vision: Index , 2012 .

[18]  Pierre Hansen,et al.  The p-median problem: A survey of metaheuristic approaches , 2005, Eur. J. Oper. Res..

[19]  Andrey V. Savchenko,et al.  Directed enumeration method in image recognition , 2012, Pattern Recognit..

[20]  D. Defays,et al.  An Efficient Algorithm for a Complete Link Method , 1977, Comput. J..

[21]  Simon J. D. Prince,et al.  Computer Vision: Models, Learning, and Inference , 2012 .

[22]  Andrey V. Savchenko Real-Time Image Recognition with the Parallel Directed Enumeration Method , 2013, ICVS.

[23]  Daoqiang Zhang,et al.  Enhanced (PC)2 A for face recognition with one training image per person , 2004, Pattern Recognit. Lett..

[24]  Lawrence Cayton Efficient Bregman Range Search , 2009, NIPS.

[25]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[26]  Richard I. Hartley,et al.  Optimised KD-trees for fast image descriptor matching , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[27]  S. Kullback,et al.  Information Theory and Statistics , 1959 .

[28]  Luisa Micó,et al.  A new version of the nearest-neighbour approximating and eliminating search algorithm (AESA) with linear preprocessing time and memory requirements , 1994, Pattern Recognit. Lett..

[29]  David G. Lowe,et al.  Fast Approximate Nearest Neighbors with Automatic Algorithm Configuration , 2009, VISAPP.

[30]  Kristian Sabo,et al.  One-dimensional center-based l1-clustering method , 2013, Optim. Lett..

[31]  Zhu Li,et al.  Grassmann Hashing for approximate nearest neighbor search in high dimensional space , 2011, 2011 IEEE International Conference on Multimedia and Expo.

[32]  Anil K. Jain,et al.  Handbook of Face Recognition, 2nd Edition , 2011 .

[33]  S. Ravi Testing Statistical Hypotheses, 3rd edn by E. L. Lehmann and J. P. Romano , 2007 .

[34]  Gonzalo Navarro,et al.  Effective Proximity Retrieval by Ordering Permutations , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[35]  Nan Zhang,et al.  Component-based global k-NN classifier for small sample size problems , 2012, Pattern Recognit. Lett..

[36]  Charu C. Aggarwal,et al.  Data Mining: The Textbook , 2015 .