Representing ordinal input variables in the context of ordinal classification

Ordinal input variables are common in many supervised and unsupervised machine learning problems. We focus on ordinal classification problems, where the target variable is also categorical and ordinal. In order to represent categorical input variables for measuring distances or applying continuous mapping functions, they have to be transformed to numeric values. This paper evaluates five different methods to do so. Two of them are commonly applied by practitioners, the first one based on binarising the ordinal input variable using standard indicator variables (NomBin), and the second one based on directly mapping each category to a consecutive natural number (Num). Furthermore, three novel proposals are evaluated in this paper: 1) an ordinal binarisation based on considering the order of the input variable (OrdBin), 2) the analysis of pairwise distances between input patterns to recover the latent variable generating the ordinal one (NumLVR), and 3) the refinement of the standard numeric transformation by recovering the distance between sets of patterns of consecutive categories (NumCDR). A thorough empirical evaluation is made, considering 12 datasets, 5 performance metrics and 4 classifiers (2 of them of nominal nature and 2 of ordinal nature). The results show that the Nom-Bin representation method leads to the worst results, and that both Num and NumCDR methods obtain very good performance, although NumCDR results are consistently better for almost all performance metrics and classifiers considered.

[1]  J. Fleiss Measuring nominal scale agreement among many raters. , 1971 .

[2]  P. McCullagh Regression Models for Ordinal Data , 1980 .

[3]  R. Bender,et al.  Ordinal Logistic Regression in Medical Research , 1997, Journal of the Royal College of Physicians of London.

[4]  Michael J. A. Berry,et al.  Data mining techniques - for marketing, sales, and customer support , 1997, Wiley computer publishing.

[5]  Eibe Frank,et al.  A Simple Approach to Ordinal Classification , 2001, ECML.

[6]  Roelof K. Brouwer,et al.  A feed-forward network for input that is both categorical and quantitative , 2002, Neural Networks.

[7]  Jaime S. Cardoso,et al.  Modelling ordinal relations with SVMs: An application to objective aesthetic evaluation of breast cancer conservative treatment , 2005, Neural Networks.

[8]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[9]  Willem Waegeman,et al.  An ensemble of Weighted Support Vector Machines for Ordinal Regression , 2007 .

[10]  Wei Chu,et al.  Support Vector Ordinal Regression , 2007, Neural Computation.

[11]  Andrea Esuli,et al.  Evaluation Measures for Ordinal Regression , 2009, 2009 Ninth International Conference on Intelligent Systems Design and Applications.

[12]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[13]  Pedro Antonio Gutiérrez,et al.  Sensitivity Versus Accuracy in Multiclass Problems Using Memetic Pareto Evolutionary Neural Networks , 2010, IEEE Transactions on Neural Networks.

[14]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[15]  Yi-Ping Hung,et al.  Ordinal hyperplanes ranker with cost sensitivities for age estimation , 2011, CVPR 2011.

[16]  Alan Agresti,et al.  Categorical Data Analysis , 2003 .

[17]  Ling Li,et al.  Reduction from Cost-Sensitive Ordinal Ranking to Weighted Binary Classification , 2012, Neural Computation.

[18]  Jun Xu,et al.  The proportional odds with partial proportionality constraints model for ordinal response variables. , 2012, Social science research.

[19]  Pedro Antonio Gutiérrez,et al.  Exploitation of Pairwise Class Distances for Ordinal Classification , 2013, Neural Computation.

[20]  Pedro Antonio Gutiérrez,et al.  Metrics to guide a multi-objective evolutionary algorithm for ordinal classification , 2014, Neurocomputing.

[21]  C. Hervás-Martínez,et al.  An organ allocation system for liver transplantation based on ordinal regression , 2014, Appl. Soft Comput..

[22]  Andrea Esuli,et al.  Feature Selection for Ordinal Text Classification , 2014, Neural Computation.

[23]  Pedro Antonio Gutiérrez,et al.  Ordinal Regression Methods: Survey and Experimental Study , 2016, IEEE Transactions on Knowledge and Data Engineering.