Robust conjoint analysis by controlling outlier sparsity

Preference measurement (PM) has a long history in marketing, healthcare, and the biobehavioral sciences, where conjoint analysis is commonly used. The goal of PM is to learn the utility function of an individual or a group of individuals from expressed preference data (buying patterns, surveys, ratings), possibly contaminated with outliers. For metric conjoint data, a robust partworth estimator is developed on the basis of a neat connection between ℓ0-(pseudo)norm-regularized regression, and the least-trimmed squared estimator. This connection suggests efficient solvers based on convex relaxation, which lead naturally to a family of robust estimators subsuming Huber's optimal M-class. Outliers are identified by tuning a regularization parameter, which amounts to controlling the sparsity of an outlier vector along the entire robustification path of least-absolute shrinkage and selection operator solutions. For choice-based conjoint analysis, a novel classifier is developed that is capable of attaining desirable tradeoffs between model fit and complexity, while at the same time controlling robustness and revealing the outliers present. Variants accounting for nonlinear utilities and consumer heterogeneity are also investigated.

[1]  S. Rosset,et al.  Piecewise linear regularized solution paths , 2007, 0708.2197.

[2]  A. N. Tikhonov,et al.  Solutions of ill-posed problems , 1977 .

[3]  Allan D. Shocker,et al.  Linear programming techniques for multidimensional analysis of preferences , 1973 .

[4]  PETER J. ROUSSEEUW,et al.  Computing LTS Regression for Large Data Sets , 2005, Data Mining and Knowledge Discovery.

[5]  Jordan J. Louviere,et al.  A Comparison of Experimental Design Strategies for Multinomial Logit Models : The Case of Generic Attributes , 2001 .

[6]  Eric T. Bradlow,et al.  Beyond conjoint analysis: Advances in preference measurement , 2008 .

[7]  David J. Curry,et al.  Prediction in Marketing Using the Support Vector Machine , 2005 .

[8]  Vithala R. Rao,et al.  Conjoint Measurement- for Quantifying Judgmental Data , 1971 .

[9]  Peter J. Rousseeuw,et al.  Robust regression and outlier detection , 1987 .

[10]  Gonzalo Mateos,et al.  USPACOR: Universal sparsity-controlling outlier rejection , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[11]  R. Tibshirani,et al.  Least angle regression , 2004, math/0406456.

[12]  Gonzalo Mateos,et al.  Distributed Sparse Linear Regression , 2010, IEEE Transactions on Signal Processing.

[13]  John N. Tsitsiklis,et al.  Parallel and distributed computation , 1989 .

[14]  M. Pontil,et al.  A Convex Optimization Approach to Modeling Consumer Heterogeneity in Conjoint Estimation , 2007 .

[15]  Moshe Ben-Akiva,et al.  Discrete Choice Analysis: Theory and Application to Travel Demand , 1985 .

[16]  G. Wahba Spline models for observational data , 1990 .

[17]  Peter E. Rossi,et al.  Marketing models of consumer heterogeneity , 1998 .

[18]  Joel A. Tropp,et al.  Just relax: convex programming methods for identifying sparse signals in noise , 2006, IEEE Transactions on Information Theory.

[19]  John R. Hauser,et al.  Conjoint Analysis, Related Modeling, and Applications , 2004 .

[20]  John R. Hauser,et al.  Optimization-Based and Machine-Learning Methods for Conjoint Analysis: Estimation and Question Design , 2007 .