Obtaining psychological embeddings through joint kernel and metric learning

Psychological embeddings provide a powerful formalism for characterizing human-perceived similarity among members of a stimulus set. Obtaining high-quality embeddings can be costly due to algorithm design, software deployment, and participant compensation. This work aims to advance state-of-the-art embedding techniques and provide a comprehensive software package that makes obtaining high-quality psychological embeddings both easy and relatively efficient. Contributions are made on four fronts. First, the embedding procedure allows multiple trial configurations (e.g., triplets) to be used for collecting similarity judgments from participants. For example, trials can be configured to collect triplet comparisons or to sort items into groups. Second, a likelihood model is provided for three classes of similarity kernels allowing users to easily infer the parameters of their preferred model using gradient descent. Third, an active selection algorithm is provided that makes data collection more efficient by proposing comparisons that provide the strongest constraints on the embedding. Fourth, the likelihood model allows the specification of group-specific attention weight parameters. A series of experiments are included to highlight each of these contributions and their impact on converging to a high-quality embedding. Collectively, these incremental improvements provide a powerful and complete set of tools for inferring psychological embeddings. The relevant tools are available as the Python package PsiZ, which can be cloned from GitHub (https://github.com/roads/psiz).

[1]  Yuchun Fang,et al.  Experiments in Mental Face Retrieval , 2005, AVBPA.

[2]  Pietro Perona,et al.  The Caltech-UCSD Birds-200-2011 Dataset , 2011 .

[3]  R. Shepard Stimulus and response generalization: A stochastic model relating generalization to distance in psychological space , 1957 .

[4]  Angela J. Yu,et al.  Extracting Human Face Similarity Judgments: Pairs or Triplets? , 2016, CogSci.

[5]  W. T. Maddox,et al.  Recency effects as a window to generalization: separating decisional and perceptual sequential effects in category learning. , 2006, Journal of experimental psychology. Learning, memory, and cognition.

[6]  W. Torgerson Multidimensional scaling: I. Theory and method , 1952 .

[7]  J. Tanaka,et al.  Object categories and expertise: Is the basic level in the eye of the beholder? , 1991, Cognitive Psychology.

[8]  R. Nosofsky Overall similarity and the identification of separable-dimension stimuli: A choice model analysis , 1985, Perception & psychophysics.

[9]  Brett D. Roads,et al.  The easy-to-hard training advantage with real-world medical images , 2018, Cognitive Research: Principles and Implications.

[10]  Marin Ferecatu,et al.  A Statistical Framework for Image Category Search from a Mental Picture , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Michael C. Mozer,et al.  Improving Human-Machine Cooperative Classification Via Cognitive Theories of Similarity , 2017, Cogn. Sci..

[12]  Shaogang Gong,et al.  Audio- and Video-based Biometric Person Authentication , 1997, Lecture Notes in Computer Science.

[13]  Kilian Q. Weinberger,et al.  Stochastic triplet embedding , 2012, 2012 IEEE International Workshop on Machine Learning for Signal Processing.

[14]  R. Nosofsky Attention, similarity, and the identification-categorization relationship. , 1986 .

[15]  Joshua B. Tenenbaum,et al.  Bayesian Modeling of Human Concept Learning , 1998, NIPS.

[16]  R. Nosofsky,et al.  Tests of an Exemplar-Memory Model of Classification Learning in a High-Dimensional Natural-Science Category Domain , 2017, Journal of experimental psychology. General.

[17]  R. Shepard The analysis of proximities: Multidimensional scaling with an unknown distance function. I. , 1962 .

[18]  Robert D. Nowak,et al.  How to Model Implicit Knowledge? Similarity Learning Methods to Assess Perceptions of Visual Representations , 2016, EDM.

[19]  J. Kruskal Multidimensional scaling by optimizing goodness of fit to a nonmetric hypothesis , 1964 .

[20]  R. Shepard,et al.  Toward a universal law of generalization for psychological science. , 1987, Science.

[21]  D. Medin,et al.  SUSTAIN: a network model of category learning. , 2004, Psychological review.

[22]  R. Shepard The analysis of proximities: Multidimensional scaling with an unknown distance function. II , 1962 .

[23]  R. Shepard Geometrical approximations to the structure of musical pitch. , 1982, Psychological review.

[24]  J. Gower Some distance properties of latent root and vector methods used in multivariate analysis , 1966 .

[25]  R. Nosofsky American Psychological Association, Inc. Choice, Similarity, and the Context Theory of Classification , 2022 .

[26]  Michael C. Hout,et al.  Multidimensional Scaling , 2003, Encyclopedic Dictionary of Archaeology.

[27]  R. Nosofsky Attention, similarity, and the identification-categorization relationship. , 1986, Journal of experimental psychology. General.

[28]  J. Kruskal Nonmetric multidimensional scaling: A numerical method , 1964 .

[29]  Subhransu Maji,et al.  Similarity Comparisons for Interactive Fine-Grained Categorization , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[30]  Michael S. Bernstein,et al.  Learning Perceptual Kernels for Visualization Design , 2014, IEEE Transactions on Visualization and Computer Graphics.

[31]  Adam Tauman Kalai,et al.  Adaptively Learning the Crowd Kernel , 2011, ICML.

[32]  Joseph L. Zinnes,et al.  Theory and Methods of Scaling. , 1958 .

[33]  R. Shepard Stimulus and response generalization: tests of a model relating generalization to distance in psychological space. , 1958, Journal of experimental psychology.

[34]  Lalit Jain,et al.  NEXT: A system to easily connect crowdsourcing and adaptive data collection , 2017 .

[35]  J. Kruschke,et al.  ALCOVE: an exemplar-based connectionist model of category learning. , 1992, Psychological review.

[36]  R. Luce,et al.  Individual Choice Behavior: A Theoretical Analysis. , 1960 .

[37]  Lalit Jain,et al.  NEXT: A System for Real-World Development, Evaluation, and Application of Active Learning , 2015, NIPS.

[38]  Ryan P. Adams,et al.  Elliptical slice sampling , 2009, AISTATS.

[39]  R. Duncan Luce,et al.  Individual Choice Behavior: A Theoretical Analysis , 1979 .

[40]  Serge J. Belongie,et al.  Cost-Effective HITs for Relative Similarity Comparisons , 2014, HCOMP.

[41]  J. Chang,et al.  Analysis of individual differences in multidimensional scaling via an n-way generalization of “Eckart-Young” decomposition , 1970 .

[42]  W. T. Maddox,et al.  The Role of Similarity in Generalization , 2006 .

[43]  David J. Kriegman,et al.  Generalized Non-metric Multidimensional Scaling , 2007, AISTATS.