The Nearest Feature Midpoint - A Novel Approach for Pattern Classification

In this paper, we propose a method, called the nearest feature midpoint (NFM), for pattern classification. Any pair of feature points of the same class is generalized by the feature midpoint (FM) between them. Hence the representational capacity of available prototypes can be expanded. The classification is determined by the nearest distance from the query feature point to each FM. This paper compares the NFM classifier against the nearest feature line (NFL) classifier, which has reported successes in various applications. In the NFL, any pair of feature points of the same class is generalized by the feature line (FL) passing through them, and the classification is evaluated on the nearest distance from the query feature point to each FL. The NFM can be considered to be the refinement of the NFL. A theoretical proof is provided in this paper to show that for the n-dimensional Gaussian distribution, the classification based on the NFM distance metric will achieve the least error probability as compared to those based on any other points on the feature lines. Furthermore, a theoretical investigation is provided that under certain assumption the NFL is approximately equivalent to the NFM when the dimension of the feature space is high. The experimental evaluations on both simulated and real-life benchmark data concur with all the theoretical investigations, as well as indicate that the NFM is effective for the classification of the data with a Gaussian distribution or with a distribution that can be reasonably approximated by a Gaussian.

[1]  Max A. Woodbury On a Probability Distribution , 1949 .

[2]  Peter E. Hart,et al.  Nearest neighbor pattern classification , 1967, IEEE Trans. Inf. Theory.

[3]  L. Wasserman,et al.  Computing Bayes Factors by Combining Simulation and Asymptotic Approximations , 1997 .

[4]  Pavel Pudil,et al.  Introduction to Statistical Pattern Recognition , 2006 .

[5]  S. Dasgupta,et al.  Learning probability distributions , 2000 .

[6]  Stan Z. Li,et al.  Face recognition using the nearest feature line method , 1999, IEEE Trans. Neural Networks.

[7]  Thomas M. Cover,et al.  Estimation by the nearest neighbor rule , 1968, IEEE Trans. Inf. Theory.

[8]  Dimitrios Gunopulos,et al.  Locally Adaptive Metric Nearest-Neighbor Classification , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  David G. Stork,et al.  Pattern Classification , 1973 .

[10]  D. J. Newman,et al.  UCI Repository of Machine Learning Database , 1998 .

[11]  P. Bickel,et al.  Some theory for Fisher''s linear discriminant function , 2004 .

[12]  Sahibsingh A. Dudani The Distance-Weighted k-Nearest-Neighbor Rule , 1976, IEEE Transactions on Systems, Man, and Cybernetics.

[13]  King-Sun Fu,et al.  IEEE Transactions on Pattern Analysis and Machine Intelligence Publication Information , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  Jerome H. Friedman,et al.  Flexible Metric Nearest Neighbor Classification , 1994 .

[15]  S T Roweis,et al.  Nonlinear dimensionality reduction by locally linear embedding. , 2000, Science.

[16]  Stan Z. Li,et al.  Performance Evaluation of the Nearest Feature Line Method in Image Retrieval , 2000 .

[17]  Keinosuke Fukunaga,et al.  Bayes Error Estimation Using Parzen and k-NN Procedures , 1987, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18]  S. Dudoit,et al.  Comparison of Discrimination Methods for the Classification of Tumors Using Gene Expression Data , 2002 .

[19]  G. McLachlan,et al.  Pattern Classification: A Unified View of Statistical and Neural Approaches. , 1998 .

[20]  Keinosuke Fukunaga,et al.  Bias of Nearest Neighbor Error Estimates , 1987, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21]  Jürgen Schürmann,et al.  Pattern classification , 2008 .

[22]  Stan Z. Li,et al.  Content-based Classification and Retrieval of Audio Using the Nearest Feature Line Method , 2000 .

[23]  Shigeo Abe DrEng Pattern Classification , 2001, Springer London.

[24]  Robert Tibshirani,et al.  Discriminant Adaptive Nearest Neighbor Classification , 1995, IEEE Trans. Pattern Anal. Mach. Intell..