Multinomial Squared Direction Cosines Regression

In this paper we introduce Multinomial Squared Direction Cosines Regression as an alternative Multinomial Response Model. The proposed model offers an intuitive geometric interpretation to the task of estimating posterior class probabilities in multi-class problems. In specific, the latter probabilities correspond to the squared direction cosines between a given pattern and representative class exemplars that form a basis in the decision space. We demonstrate that the model allows for efficient training via a trust region based Newton's Method, provided that the number of model parameters is not too large. Experimental results on several benchmark classification problems show that it compares competitively against Logistic Regression Classifiers, Support Vector Machines, and Classification and Regression Tree models.

[1]  Jorge J. Moré,et al.  Computing a Trust Region Step , 1983 .

[2]  D. M. Titterington,et al.  Comment on “On Discriminative vs. Generative Classifiers: A Comparison of Logistic Regression and Naive Bayes” , 2008, Neural Processing Letters.

[3]  Chih-Jen Lin,et al.  Trust region Newton methods for large-scale logistic regression , 2007, ICML '07.

[4]  D. McFadden A Method of Simulated Moments for Estimation of Discrete Response Models Without Numerical Integration , 1989 .

[5]  Charles R. Johnson,et al.  Topics in Matrix Analysis , 1991 .

[6]  M. Weeks The Multinomial Probit Model Revisited: A Discussion of Parameter Estimability, Identification and Specification Testing , 1997 .

[7]  D K Smith,et al.  Numerical Optimization , 2001, J. Oper. Res. Soc..

[8]  Michael I. Jordan,et al.  On Discriminative vs. Generative Classifiers: A comparison of logistic regression and naive Bayes , 2001, NIPS.

[9]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[10]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[11]  D. Wise,et al.  A CONDITIONAL PROBIT MODEL FOR QUALITATIVE CHOICE: DISCRETE DECISIONS RECOGNIZING INTERDEPENDENCE AND HETEROGENEOUS PREFERENCES' , 1978 .

[12]  Alston S. Householder,et al.  Unitary Triangularization of a Nonsymmetric Matrix , 1958, JACM.

[13]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[14]  P. Green Iteratively reweighted least squares for maximum likelihood estimation , 1984 .

[15]  C. Kwak,et al.  Multinomial Logistic Regression , 2002, Nursing research.

[16]  Carlos F. Daganzo,et al.  Multinomial Probit: The Theory and its Application to Demand Forecasting. , 1980 .

[17]  D. Böhning Multinomial logistic regression algorithm , 1992 .