Directional Quantile Classifiers

We introduce classifiers based on directional quantiles. We derive theoretical results for selecting optimal quantile levels given a direction, and, conversely, an optimal direction given a quantile level. We also show that the misclassification rate is infinitesimal if population distributions differ by at most a location shift and if the number of directions is allowed to diverge at the same rate of the problem's dimension. We illustrate the satisfactory performance of our proposed classifiers in both small and high dimensional settings via a simulation study and a real data example. The code implementing the proposed methods is publicly available in the R package Qtools.

[1]  R. Tibshirani,et al.  Diagnosis of multiple cancer types by shrunken centroids of gene expression , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[2]  Linglong Kong,et al.  Quantile tomography: using quantiles with multivariate data , 2008, Statistica Sinica.

[3]  Peter E. Hart,et al.  Nearest neighbor pattern classification , 1967, IEEE Trans. Inf. Theory.

[4]  Brian D. Ripley,et al.  Modern Applied Statistics with S Fourth edition , 2002 .

[5]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[6]  Mee Young Park,et al.  Penalized logistic regression for detecting gene interactions. , 2008, Biostatistics.

[7]  C. Viroli,et al.  Quantile-based classifiers. , 2016, Biometrika.

[8]  D. M. Titterington,et al.  Median-Based Classifiers for High-Dimensional Data , 2009 .

[9]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[10]  A. J. Stam LIMIT THEOREMS FOR UNIFORM DISTRIBUTIONS ON SPHERES IN HIGH-DIMENSIONAL EUCLIDEAN SPACES , 1982 .

[11]  T. Dassopoulos,et al.  Tissue Studies in Screened First-degree Relatives Reveal a Distinct Crohn's Disease Phenotype , 2014, Inflammatory bowel diseases.

[12]  A. Farcomeni,et al.  Quantile contours and allometric modelling for risk classification of abnormal ratios with an application to asymmetric growth-restriction in preterm infants , 2018, Statistical methods in medical research.

[13]  William N. Venables,et al.  Modern Applied Statistics with S , 2010 .

[14]  H. Joe Generating random correlation matrices based on partial correlations , 2006 .

[15]  Yuanhao Lai,et al.  Ensemble Quantile Classifier , 2019, Comput. Stat. Data Anal..

[16]  R. C. Bradley Basic properties of strong mixing conditions. A survey and some open questions , 2005, math/0511078.

[17]  D. Hand,et al.  Idiot's Bayes—Not So Stupid After All? , 2001 .

[18]  Li Wang,et al.  Hybrid huberized support vector machines for microarray classification and gene selection , 2008, Bioinform..