Concentration inequalities for two-sample rank processes with application to bipartite ranking

The ROC curve is the gold standard for measuring the performance of a test/scoring statistic regarding its capacity to discriminate between two statistical populations in a wide variety of applications, ranging from anomaly detection in signal processing to information retrieval, through medical diagnosis. Most practical performance measures used in scor-ing/ranking applications such as the AUC, the local AUC, the p -norm push, the DCG and others, can be viewed as summaries of the ROC curve. In this paper, the fact that most of these empirical criteria can be expressed as two-sample linear rank statistics is highlighted and concentration inequalities for collections of such random variables, referred to as two-sample rank processes here, are proved, when indexed by VC classes of scoring functions. Based on these nonasymptotic bounds, the generalization capacity of empirical maximizers of a wide class of ranking performance criteria is next investigated from a theoretical perspective. It is also supported by empirical evidence through convincing numerical experiments.

[1]  Soumendu Sundar Mukherjee,et al.  Weak convergence and empirical processes , 2019 .

[2]  Marie Frei,et al.  Decoupling From Dependence To Independence , 2016 .

[3]  Aditya Krishna Menon,et al.  Bipartite Ranking: a Risk-Theoretic Perspective , 2016, J. Mach. Learn. Res..

[4]  S. Clémençon,et al.  The TreeRank Tournament algorithm for multipartite ranking , 2015 .

[5]  Stéphan Clémençon,et al.  A statistical view of clustering performance through the theory of U-processes , 2014, J. Multivar. Anal..

[6]  S. Girard,et al.  An Introduction to Dimension Reduction in Nonparametric Kernel Regression , 2014 .

[7]  Stéphan Clémençon,et al.  Ranking data with ordinal labels: optimality and pairwise aggregation , 2013, Machine Learning.

[8]  N. Vayatis,et al.  Overlaying Classifiers: A Practical Approach to Optimal Scoring , 2010 .

[9]  Stéphan Clémençon,et al.  Tree-Based Ranking Methods , 2009, IEEE Transactions on Information Theory.

[10]  N. Vayatis,et al.  Ranking the Best Instances , 2006, J. Mach. Learn. Res..

[11]  Tong Zhang,et al.  Subset Ranking Using Regression , 2006, COLT.

[12]  Cynthia Rudin,et al.  Ranking with a P-Norm Push , 2006, COLT.

[13]  Peter Major,et al.  An estimate on the supremum of a nice class of stochastic integrals and U-statistics , 2006 .

[14]  P. Bartlett,et al.  Local Rademacher complexities and oracle inequalities in risk minimization , 2006 .

[15]  Dan Roth,et al.  Generalization Bounds for the Area Under the ROC Curve , 2005, J. Mach. Learn. Res..

[16]  S. Boucheron,et al.  Theory of classification : a survey of some recent advances , 2005 .

[17]  V. Koltchinskii,et al.  Weighted uniform consistency of kernel density estimators , 2004, math/0410170.

[18]  Natalie Neumeyer,et al.  A central limit theorem for two-sample U-processes , 2004 .

[19]  E. Giné,et al.  Rates of strong uniform consistency for multivariate kernel density estimators , 2002 .

[20]  Adam Krzyzak,et al.  A Distribution-Free Theory of Nonparametric Regression , 2002, Springer series in statistics.

[21]  S. R. Jammalamadaka,et al.  Empirical Processes in M-Estimation , 2001 .

[22]  László Györfi,et al.  A Probabilistic Theory of Pattern Recognition , 1996, Stochastic Modelling and Applied Probability.

[23]  M. Talagrand Sharper Bounds for Gaussian and Empirical Processes , 1994 .

[24]  Stefun D. Leigh U-Statistics Theory and Practice , 1992 .

[25]  M. C. Jones The performance of kernel density functions in kernel distribution function estimation , 1990 .

[26]  Colin McDiarmid,et al.  Surveys in Combinatorics, 1989: On the method of bounded differences , 1989 .

[27]  D. Pollard,et al.  $U$-Processes: Rates of Convergence , 1987 .

[28]  E. Giné,et al.  Some Limit Theorems for Empirical Processes , 1984 .

[29]  Ing Rj Ser Approximation Theorems of Mathematical Statistics , 1980 .

[30]  J. Hájek,et al.  Asymptotic Normality of Simple Linear Rank Statistics Under Alternatives II , 1968 .

[31]  P. Sen,et al.  Theory of rank tests , 1969 .

[32]  M. H. Bretherton,et al.  Statistics in Theory and Practice , 1966 .

[33]  E. Nadaraya,et al.  Some New Estimates for Distribution Functions , 1964 .

[34]  Jaroslav Hájek,et al.  ASYMPTOTICALLY MOST POWERFUL RANK-ORDER TESTS' , 1962 .

[35]  W. Hoeffding A Class of Statistics with Asymptotically Normal Distribution , 1948 .

[36]  F. Wilcoxon Individual Comparisons by Ranking Methods , 1945 .