Nonparametric “anti-Bayesian” quantile-based pattern classification

Parametric and nonparametric pattern recognition have been studied for almost a century based on a Bayesian paradigm, which is, in turn, founded on the principles of Bayes theorem. It is well known that the accuracy of the Bayes classifier cannot be exceeded. Typically, this reduces to comparing the testing sample to mean or median of the respective distributions. Recently, Oommen and his co-authors have presented a pioneering and non-intuitive paradigm, namely that of achieving the classification by comparing the testing sample with another descriptor, which could also be quite distant from the mean. This paradigm has been termed as being “anti-Bayesian,” and it essentially uses the quantiles of the distributions to achieve the pattern recognition. Such classifiers attain the optimal Bayesian accuracy for symmetric distributions even though they operate with a non-intuitive philosophy. While this paradigm has been applied in a number of domains (briefly explained in the body of this paper), its application for nonparametric domains has been limited. This paper explains, in detail, how such quantile-based classification can be extended to the nonparametric world, using both traditional and kernel-based strategies. The paper analyzes the methodology of such nonparametric schemes and their robustness. From a fundamental perspective, the paper utilizes the so-called large sample theory to derive strong asymptotic results that pertain to the equivalence between the parametric and nonparametric schemes for large samples. Apart from the new theoretical results, the paper also presents experimental results demonstrating their power. These results pertain to artificial data sets and also involve a real-life breast cancer data set obtained from the University Hospital Centre of Coimbra. The experimental results clearly confirm the power of the proposed “anti-Bayesian” procedure, especially when approached from a nonparametric perspective.

[1]  Larry S. Davis,et al.  A non-parametric approach to extending generic binary classifiers for multi-classification , 2016, Pattern Recognit..

[2]  Alfred Müller,et al.  Stochastic Ordering of Multivariate Normal Distributions , 2001 .

[3]  B. John Oommen,et al.  Ultimate Order Statistics-Based Prototype Reduction Schemes , 2013, Australasian Conference on Artificial Intelligence.

[4]  Nancy L. Leech,et al.  A Call for Greater Use of Nonparametric Statistics , 2002 .

[5]  Uwe Ligges,et al.  Unequal Priors in Linear Discriminant Analysis , 2019, Journal of Classification.

[6]  Peter J. Rousseeuw,et al.  Robust regression and outlier detection , 1987 .

[7]  David W. Scott,et al.  Partial Mixture Estimation and Outlier Detection in Data and Regression , 2004 .

[8]  B. John Oommen,et al.  Optimal "Anti-Bayesian" Parametric Pattern Classification Using Order Statistics Criteria , 2012, CIARP.

[9]  W. R. Buckland,et al.  Outliers in Statistical Data , 1979 .

[10]  Anis Yazidi,et al.  "Anti-Bayesian" Flat and Hierarchical Clustering Using Symmetric Quantiloids , 2016, IEA/AIE.

[11]  Lixia Hu A note on order statistics-based parametric pattern classification , 2015, Pattern Recognit..

[12]  D. Farnsworth A First Course in Order Statistics , 1993 .

[13]  Philip H. Ramsey Nonparametric Statistical Methods , 1974, Technometrics.

[14]  Christopher M. Bishop,et al.  Pattern Recognition and Machine Learning (Information Science and Statistics) , 2006 .

[15]  B. Ripley,et al.  Robust Statistics , 2018, Encyclopedia of Mathematical Geosciences.

[16]  M. Aitkin,et al.  Mixture Models, Outliers, and the EM Algorithm , 1980 .

[17]  B. John Oommen,et al.  A Novel Border Identification Algorithm Based on an "Anti-Bayesian" Paradigm , 2013, CAIP.

[18]  Ing Rj Ser Approximation Theorems of Mathematical Statistics , 1980 .

[19]  Thao Nguyen-Trang,et al.  A new approach for determining the prior probabilities in the classification problem by Bayesian method , 2017, Adv. Data Anal. Classif..

[20]  M. Rosenblatt Remarks on Some Nonparametric Estimates of a Density Function , 1956 .

[21]  Ignacio López-de-Ullibarri Bandwidth Selection in Kernel Distribution Function Estimation , 2015 .

[22]  B. John Oommen,et al.  Corrigendum to three papers that deal with "Anti"-Bayesian Pattern Recognition [Pattern Recognition] , 2014, Pattern Recognit..

[23]  Stochastic Orders , 2008 .

[24]  B. John Oommen,et al.  The fundamental theory of optimal "Anti-Bayesian" parametric pattern classification using order statistics criteria , 2013, Pattern Recognit..

[25]  B. John Oommen,et al.  Text Classification Using Novel "Anti-Bayesian" Techniques , 2015, ICCCI.

[26]  F. Caramelo,et al.  Using Resistin, glucose, age and BMI to predict the presence of breast cancer , 2018, BMC Cancer.

[27]  D. Binder Bayesian cluster analysis , 1978 .

[28]  C. R. Kothari,et al.  Research Methodology: Methods and Techniques , 2009 .

[29]  A. Azzalini A note on the estimation of a distribution function and quantiles by a kernel method , 1981 .

[30]  K. Athreya,et al.  Measure Theory and Probability Theory , 2006 .

[31]  B. John Oommen,et al.  "Anti-Bayesian" parametric pattern classification using order statistics criteria for some members of the exponential family , 2014, Pattern Recognit..

[32]  B. John Oommen,et al.  Optimal "Anti-Bayesian" Parametric Pattern Classification for the Exponential Family Using Order Statistics Criteria , 2012, ICIAR.

[33]  B. John Oommen,et al.  Order statistics-based parametric classification for multi-dimensional distributions , 2013, Pattern Recognit..

[34]  A. Madansky Identification of Outliers , 1988 .