Simulating the Fidelity of Data for Large Stimulus Set Sizes and Variable Dimension Estimation in Multidimensional Scaling

Multidimensional scaling (MDS) is a statistical technique commonly used to model the psychological similarity among sets of stimulus items. Typically, MDS has been used with relatively small stimulus sets (30 items or fewer), in part due to the laborious nature of computational analysis and data collection. Modern computing power and newly advanced techniques for speeding data collection have made it possible to conduct MDS with many more stimuli. However, it is as yet unclear if MDS is as well-equipped to model the similarity of large stimulus sets as it is for more modest ones. Here, we conducted 337,500 simulation experiments, wherein hypothetical “true” MDS spaces were created, along with error-perturbed data from simulated “participants.” We examined the fidelity with which the spaces resulting from our “participants” captured the organization of the “true” spaces, as a function of item set size, amount of error in the data (i.e., noise), and dimensionality estimation. We found that although higher set sizes decrease model fit (i.e., they produce increased “stress”), they largely tended to increase determinacy of MDS spaces. These results are predicated, however, on the appropriate estimation of dimensionality of the MDS space. We argue that it is not only reasonable to adopt large stimulus set sizes but tends to be advantageous to do so. Applying MDS to larger sets is appealing, as it affords researchers greater flexibility in stimulus selection, more opportunity for exploration of their stimuli, and a higher likelihood that observed relationships are not due to stimulus-specific idiosyncrasies.

[1]  Marc Strickert,et al.  High-Throughput Multi-dimensional Scaling (HiT-MDS) for cDNA-Array Expression Data , 2005, ICANN.

[2]  Robert L. Goldstone,et al.  Time Course of Comparison , 1994 .

[3]  Patrick J. F. Groenen,et al.  Modern Multidimensional Scaling: Theory and Applications , 2003 .

[4]  I Spence,et al.  A TABLE OF EXPECTED STRESS VALUES FOR RANDOM RANKINGS IN NONMETRIC MULTIDIMENSIONAL SCALING. , 1973, Multivariate behavioral research.

[5]  Jengnan Tzeng,et al.  Multidimensional scaling for large genomic data sets , 2008, BMC Bioinformatics.

[6]  Michael C Hout,et al.  The Novel Object and Unusual Name (NOUN) Database: A collection of novel images for use in experimental research , 2016, Behavior research methods.

[7]  Tamaryn Menneer,et al.  Using multidimensional scaling to quantify similarity in visual search and beyond , 2015, Attention, Perception, & Psychophysics.

[8]  Paul E. Green,et al.  Multidimensional Scaling: Concepts and Applications , 1989 .

[9]  Stephen D. Goldinger,et al.  MM-MDS: A Multidimensional Scaling Database with Similarity Ratings for 240 Object Categories from the Massive Memory Picture Database , 2014, PloS one.

[10]  R. Shepard Stimulus and response generalization: A stochastic model relating generalization to distance in psychological space , 1957 .

[11]  R. Shepard Metric structures in ordinal data , 1966 .

[12]  Gyslain Giguère,et al.  Collecting and analyzing data in multidimensional scaling experiments: A guide for psychologists using SPSS , 2006 .

[13]  R. Shepard,et al.  Stimulus generalization in the learning of classifications. , 1963, Journal of experimental psychology.

[14]  Trevor F. Cox,et al.  Nonmetric multidimensional scaling , 2000 .

[15]  N. Jaworska,et al.  A Review of Multidimensional Scaling (MDS) and its Utility in Various Psychological Domains , 2009 .

[16]  Joseph L. Zinnes,et al.  Theory and Methods of Scaling. , 1958 .

[17]  Herbert H. Stenson,et al.  GOODNESS OF FIT FOR RANDOM RANKINGS IN KRUSKAL'S NONMETRIC SCALING PROCEDURE * , 1969 .

[18]  R. Shepard Stimulus and response generalization: tests of a model relating generalization to distance in psychological space. , 1958, Journal of experimental psychology.

[19]  Aude Oliva,et al.  Visual long-term memory has a massive storage capacity for object details , 2008, Proceedings of the National Academy of Sciences.

[20]  J. Leeuw Applications of Convex Analysis to Multidimensional Scaling , 2000 .

[21]  Forrest W. Young,et al.  Introduction to Multidimensional Scaling: Theory, Methods, and Applications , 1981 .

[22]  Ian Spence,et al.  Monte Carlo studies in nonmetric scaling , 1978 .

[23]  Paul D. Isaac,et al.  On the determination of appropriate dimensionality in data with error , 1974 .

[24]  Richard C. W. Kao,et al.  On a connection between factor analysis and multidimensional unfolding , 1960 .

[25]  C. R. Sherman,et al.  Nonmetric multidimensional scaling: A monte carlo study of the basic parameters , 1972 .

[26]  Matthew Chalmers,et al.  Fast Multidimensional Scaling Through Sampling, Springs and Interpolation , 2003, Inf. Vis..

[27]  R N Shepard,et al.  Multidimensional Scaling, Tree-Fitting, and Clustering , 1980, Science.

[28]  Michael C. Hout,et al.  SpAM is convenient but also satisfying: Reply to Verheyen et al. (2016). , 2016, Journal of experimental psychology. General.

[29]  Grigori Yourganov,et al.  The Perception of Naturalness Correlates with Low-Level Visual Features of Environmental Scenes , 2014, PloS one.

[30]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[31]  Lawrence E. Jones,et al.  The effects of random error and subsampling of dimensions on recovery of configurations by non-metric multidimensional scaling , 1974 .

[32]  Michael C. Hout,et al.  Multidimensional Scaling , 2003, Encyclopedic Dictionary of Archaeology.

[33]  M. Lee Determining the Dimensionality of Multidimensional Scaling Representations for Cognitive Modeling. , 2001, Journal of mathematical psychology.

[34]  Wolf Vanpaemel,et al.  Caveats for the spatial arrangement method: Comment on Hout, Goldinger, and Ferguson (2013). , 2016, Journal of experimental psychology. General.

[35]  Gerrit Storms,et al.  Predicting Lexical Norms Using a Word Association Corpus , 2015, CogSci.

[36]  R. Shepard,et al.  How a cognitive psychologist came to seek universal laws , 2004, Psychonomic bulletin & review.

[37]  R. Nosofsky Attention, similarity, and the identification-categorization relationship. , 1986, Journal of experimental psychology. General.

[38]  R. Haber,et al.  Perception and memory for pictures: Single-trial learning of 2500 visual stimuli , 1970 .

[39]  J. Kruskal Multidimensional scaling by optimizing goodness of fit to a nonmetric hypothesis , 1964 .

[40]  Michael C. Hout,et al.  The versatility of SpAM: a fast, efficient, spatial method of data collection for multidimensional scaling. , 2013, Journal of experimental psychology. General.

[41]  R. Shepard,et al.  Toward a universal law of generalization for psychological science. , 1987, Science.

[42]  H. Egeth,et al.  Massive memory revisited: Limitations on storage capacity for object details in visual long-term memory , 2015, Learning & memory.

[43]  Patrick Mair,et al.  Goodness-of-Fit Assessment in Multidimensional Scaling and Unfolding , 2016, Multivariate behavioral research.

[44]  Ron Kimmel,et al.  Spectral multidimensional scaling , 2013, Proceedings of the National Academy of Sciences.

[45]  Forrest W. Young Nonmetric multidimensional scaling: Recovery of metric information , 1970 .

[46]  Dimitris K. Agrafiotis,et al.  Multidimensional scaling and visualization of large molecular similarity tables , 2001, J. Comput. Chem..

[47]  N. Kriegeskorte,et al.  Inverse MDS: Inferring Dissimilarity Structure from Multiple Item Arrangements , 2012, Front. Psychology.

[48]  R. Goldstone An efficient method for obtaining similarity data , 1994 .

[49]  Man-Suk Oh,et al.  A simple and efficient Bayesian procedure for selecting dimensionality in multidimensional scaling , 2012, J. Multivar. Anal..