Modeling Category Identification Using Sparse Instance Representation

We study the problem of category identification, which involves making inferences about category membership (e.g., a ‘cat’) given a set of features (e.g., has a tail, has four legs). We note that this problem is closely related to classification problems in machine learning, for which standard methods exist, and new methods continue to be developed. Using a large database of associations of features to animal stimuli, made by different people, we test several standard benchmark methods, including nearest neighbor, decision tree, and logistic regression methods. We also apply a new classifier, developed for image processing, which we call Sparse Instance Representation. We show that it is the best-performed, especially when constrained in a novel psychologically interpretable way. We conclude that our results argue for sparse exemplarbased representations of category structures.

[1]  Michael Elad,et al.  From Sparse Solutions of Systems of Equations to Sparse Modeling of Signals and Images , 2009, SIAM Rev..

[2]  Wolf Vanpaemel,et al.  Exemplar by feature applicability matrices and other Dutch normative data for semantic concepts , 2008, Behavior research methods.

[3]  Nasser M. Nasrabadi,et al.  Pattern Recognition and Machine Learning , 2006, Technometrics.

[4]  M. Lee,et al.  Common and distinctive features in stimulus similarity: A modified version of the contrast model , 2004, Psychonomic bulletin & review.

[5]  Allen Y. Yang,et al.  Robust Face Recognition via Sparse Representation , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Tom Goldstein,et al.  The Split Bregman Method for L1-Regularized Problems , 2009, SIAM J. Imaging Sci..

[7]  D. Medin,et al.  SUSTAIN: a network model of category learning. , 2004, Psychological review.

[8]  David J. Spiegelhalter,et al.  Machine Learning, Neural and Statistical Classification , 2009 .

[9]  J. Bunge,et al.  Estimating the Number of Species: A Review , 1993 .

[10]  Charles Kemp,et al.  Category and feature identification. , 2010, Acta psychologica.

[11]  Daniel J Navarro,et al.  Introduction to the special issue on formal modeling of semantic concepts. , 2010, Acta psychologica.

[12]  W. Vanpaemel,et al.  In search of abstraction: The varying abstraction model of categorization , 2008, Psychonomic bulletin & review.

[13]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[14]  Vladimir Vapnik,et al.  The Nature of Statistical Learning , 1995 .

[15]  Agnar Aamodt,et al.  Case-Based Reasoning: Foundational Issues, Methodological Variations, and System Approaches , 1994, AI Commun..

[16]  David G. Stork,et al.  Pattern Classification , 1973 .

[17]  J. Kruschke,et al.  ALCOVE: an exemplar-based connectionist model of category learning. , 1992, Psychological review.

[18]  R. Nosofsky Attention, similarity, and the identification-categorization relationship. , 1986, Journal of experimental psychology. General.

[19]  Lawrence Carin,et al.  Sparse multinomial logistic regression: fast algorithms and generalization bounds , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  Shigeo Abe DrEng Pattern Classification , 2001, Springer London.