A tutorial on cue combination and Signal Detection Theory: Using changes in sensitivity to evaluate how observers integrate sensory information

Many sensory inputs contain multiple sources of information (‘cues’), such as two sounds of different frequencies, or a voice heard in unison with moving lips. Often, each cue provides a separate estimate of the same physical attribute, such as the size or location of an object. An ideal observer can exploit such redundant sensory information to improve the accuracy of their perceptual judgments. For example, if each cue is modeled as an independent, Gaussian, random variable, then combining Ncues should provide up to a √N improvement in detection/discrimination sensitivity. Alternatively, a less efficient observer may base their decision on only a subset of the available information, and so gain little or no benefit from having access to multiple sources of information. Here we use Signal Detection Theory to formulate and compare various models of cue-combination, many of which are commonly used to explain empirical data. We alert the reader to the key assumptions inherent in each model, and provide formulas for deriving quantitative predictions. Code is also provided for simulating each model, allowing expected levels of measurement error to be quantified. Based on these results, it is shown that predicted sensitivity often differs surprisingly little between qualitatively distinct models of combination. This means that sensitivity alone is not sufficient for understanding decision efficiency, and the implications of this are discussed.

[1]  Thomas H. Weisswange,et al.  Bayesian Cue Integration as a Developmental Outcome of Reward Mediated Learning , 2011, PloS one.

[2]  Alexander S. Ecker Data analysis code for the paper "State dependence of noise correlations in macaque primary visual cortex" , 2014 .

[3]  James M. Hillis,et al.  Slant from texture and disparity cues: optimal cue combination. , 2004, Journal of vision.

[4]  Samuel Kotz,et al.  Exact Distribution of the Max/Min of Two Gaussian Random Variables , 2008, IEEE Transactions on Very Large Scale Integration (VLSI) Systems.

[5]  P. Cavanagh,et al.  Different processing strategies underlie voluntary averaging in low and high noise. , 2012, Journal of vision.

[6]  James M. Hillis,et al.  Combining Sensory Information: Mandatory Fusion Within, but Not Between, Senses , 2002, Science.

[7]  Robert A Jacobs,et al.  Bayesian integration of visual and auditory signals for spatial localization. , 2003, Journal of the Optical Society of America. A, Optics, image science, and vision.

[8]  J. Guillemot,et al.  Temporary Deafness Can Impair Multisensory Integration , 2013, Psychological science.

[9]  Marko Nardini,et al.  Late Development of Cue Integration Is Linked to Sensory Fusion in Cortex , 2015, Current Biology.

[10]  T. Maung on in C , 2010 .

[11]  E. Viding,et al.  Load theory of selective attention and cognitive control. , 2004, Journal of experimental psychology. General.

[12]  Lawrence G. McDade,et al.  Behavioral Indices of Multisensory Integration: Orientation to Visual Cues is Affected by Auditory Stimuli , 1989, Journal of Cognitive Neuroscience.

[13]  Gregory C. DeAngelis,et al.  Bridging the gap between theories of sensory cue integration and the physiology of multisensory neurons , 2013, Nature Reviews Neuroscience.

[14]  J. Solomon Intrinsic uncertainty explains second responses. , 2007, Spatial vision.

[15]  Roger S. Anderson,et al.  Changes in Ricco’s Area with Background Luminance in the S-Cone Pathway , 2013, Optometry and vision science : official publication of the American Academy of Optometry.

[16]  Marius Usher,et al.  The Timescale of Perceptual Evidence Integration Can Be Adapted to the Environment , 2013, Current Biology.

[17]  Christopher R Fetsch,et al.  Neural correlates of reliability-based cue weighting during multisensory integration , 2011, Nature Neuroscience.

[18]  A. Baddeley,et al.  The multi-component model of working memory: Explorations in experimental cognitive psychology , 2006, Neuroscience.

[19]  Neil A. Macmillan,et al.  Detection Theory: A User's Guide , 1991 .

[20]  Sygal Amitay,et al.  Learning to detect a tone in unpredictable noise. , 2014, The Journal of the Acoustical Society of America.

[21]  G. Meinhardt,et al.  Cue combination anisotropies in contour integration: The role of lower spatial frequencies. , 2015, Journal of vision.

[22]  J. C. Middlebrooks,et al.  Listener weighting of cues for lateral angle: the duplex theory of sound localization revisited. , 2002, The Journal of the Acoustical Society of America.

[23]  Steven C. Dakin,et al.  Local and global limitations on direction integration assessed using equivalent noise analysis , 2005, Vision Research.

[24]  M. Shadlen,et al.  The effect of stimulus strength on the speed and accuracy of a perceptual decision. , 2005, Journal of vision.

[25]  Joshua A Solomon,et al.  Visual discrimination of orientation statistics in crowded and uncrowded arrays. , 2010, Journal of vision.

[26]  Bradley C. Love,et al.  Learning in Noise: Dynamic Decision-Making in a Variable Environment. , 2009, Journal of mathematical psychology.

[27]  Eero P. Simoncelli,et al.  Partitioning neuronal variability , 2014, Nature Neuroscience.

[28]  Neil W. Roach,et al.  Resolving multisensory conflict: a strategy for balancing the costs and benefits of audio-visual integration , 2006, Proceedings of the Royal Society B: Biological Sciences.

[29]  M. Nardini,et al.  Fusion of visual cues is not mandatory in children , 2010, Proceedings of the National Academy of Sciences.

[30]  Charles S. Watson,et al.  Receiver‐Operating Characteristics Determined by a Mechanical Analog to the Rating Scale , 1964 .

[31]  T. Wolbers,et al.  How cognitive aging affects multisensory integration of navigational cues , 2014, Neurobiology of Aging.

[32]  P. Bertelson,et al.  Multisensory integration, perception and ecological validity , 2003, Trends in Cognitive Sciences.

[33]  Patrick Cavanagh,et al.  Crowding in a detection task: External noise triggers change in processing strategy , 2011, Vision Research.

[34]  A R Palmer,et al.  Intensity coding in low-frequency auditory-nerve fibers of the guinea pig. , 1991, The Journal of the Acoustical Society of America.

[35]  Christopher R Fetsch,et al.  Dynamic Reweighting of Visual and Vestibular Cues during Self-Motion Perception , 2009, The Journal of Neuroscience.

[36]  Robert A Lutfi,et al.  Informational masking in hearing-impaired and normal-hearing listeners: sensation level and decision weights. , 2004, The Journal of the Acoustical Society of America.

[37]  M C Teich,et al.  Pulse-number distribution for the neural spike train in the cat's auditory nerve. , 1983, The Journal of the Acoustical Society of America.

[38]  C D Creelman,et al.  Triangles in ROC space: History and theory of “nonparametric” measures of sensitivity and response bias , 1996, Psychonomic bulletin & review.

[39]  Walt Jesteadt,et al.  A measure of internal noise based on sample discrimination. , 2003, The Journal of the Acoustical Society of America.

[40]  J. Saunders,et al.  Do humans optimally integrate stereo and texture information for judgments of surface slant? , 2003, Vision Research.

[41]  Johan Wagemans,et al.  Integration of contour and surface information in shape detection , 2011, Vision Research.

[42]  F. A. Geldard,et al.  MULTIPLE CUTANEOUS STIMULATION: THE DISCRIMINATION OF VIBRATORY PATTERNS. , 1965, The Journal of the Acoustical Society of America.

[43]  M. Wallace,et al.  Superior colliculus neurons use distinct operational modes in the integration of multisensory stimuli. , 2005, Journal of neurophysiology.

[44]  T. Stanford,et al.  Evaluating the Operations Underlying Multisensory Integration in the Cat Superior Colliculus , 2005, The Journal of Neuroscience.

[45]  Pete R. Jones,et al.  Development of Cue Integration in Human Navigation , 2008, Current Biology.

[46]  G. Grimmett,et al.  Probability and random processes , 2002 .

[47]  Robert A. Jacobs,et al.  Comparing perceptual learning across tasks: A review , 2002 .

[48]  A. Pouget,et al.  Neural correlations, population coding and computation , 2006, Nature Reviews Neuroscience.

[49]  Kurt Wiesenfeld,et al.  Mechanoelectrical transduction assisted by Brownian motion: a role for noise in the auditory system , 1998, Nature Neuroscience.

[50]  C. Nelson,et al.  The functional emergence of prefrontally-guided working memory systems in four- to eight-year-old children , 1998, Neuropsychologia.

[51]  Steven C. Dakin,et al.  Averaging, not internal noise, limits the development of coherent motion processing , 2014, Developmental Cognitive Neuroscience.

[52]  Daniel E. Shub,et al.  Reduction of internal noise in auditory perceptual learning. , 2013, The Journal of the Acoustical Society of America.

[53]  T. Wickens Elementary Signal Detection Theory , 2001 .

[54]  Jeff Miller,et al.  Divided attention: Evidence for coactivation with redundant signals , 1982, Cognitive Psychology.

[55]  Lynn Hasher,et al.  Working Memory, Comprehension, and Aging: A Review and a New View , 1988 .

[56]  T. Sejnowski,et al.  Correlated neuronal activity and the flow of neural information , 2001, Nature Reviews Neuroscience.

[57]  Rémy Allard,et al.  Zero-dimensional noise is not suitable for characterizing processing properties of detection mechanisms. , 2013, Journal of vision.

[58]  F. Rieke,et al.  Mechanisms Regulating Variability of the Single Photon Responses of Mammalian Rod Photoreceptors , 2002, Neuron.

[59]  N. Macmillan,et al.  Response bias : characteristics of detection theory, threshold theory, and nonparametric indexes , 1990 .

[60]  David Whitney,et al.  Perceiving Crowd Attention , 2014, Psychological science.

[61]  C. Tyler,et al.  Signal detection theory in the 2AFC paradigm: attention, channel uncertainty and probability summation , 2000, Vision Research.

[62]  Huanping Dai,et al.  Psychophysical reverse correlation with multiple response alternatives. , 2010, Journal of experimental psychology. Human perception and performance.

[63]  E D Young,et al.  Rate responses of auditory nerve fibers to tones in noise near masked threshold. , 1986, The Journal of the Acoustical Society of America.

[64]  Marko Nardini,et al.  Visual and Non-Visual Navigation in Blind Patients with a Retinal Prosthesis , 2015, PloS one.

[65]  James L. McClelland,et al.  The time course of perceptual choice: the leaky, competing accumulator model. , 2001, Psychological review.

[66]  Robert A Lutfi,et al.  Sample discrimination of frequency by hearing-impaired and normal-hearing listeners. , 2008, The Journal of the Acoustical Society of America.

[67]  Hiroshi Ban,et al.  The integration of motion and disparity cues to depth in dorsal visual cortex , 2012, Nature Neuroscience.

[68]  R. Blake,et al.  Further developments in binocular summation , 1981, Perception & psychophysics.

[69]  A. Pouget,et al.  Efficient computation and cue integration with noisy population codes , 2001, Nature Neuroscience.

[70]  L. R. Young,et al.  Influence of combined visual and vestibular cues on human perception and control of horizontal rotation , 2004, Experimental Brain Research.

[71]  N. Cowan,et al.  The Magical Mystery Four , 2010, Current directions in psychological science.

[72]  Hans Colonius,et al.  The race model inequality for censored reaction time distributions , 2010, Attention, perception & psychophysics.

[73]  Fernando R Nodal,et al.  Functional topography of converging visual and auditory inputs to neurons in the rat superior colliculus. , 2004, Journal of Neurophysiology.

[74]  M. Ernst,et al.  Humans integrate visual and haptic information in a statistically optimal fashion , 2002, Nature.

[75]  Laurence T Maloney,et al.  Effective integration of serially presented stochastic cues. , 2012, Journal of vision.

[76]  Alexander S. Ecker,et al.  State Dependence of Noise Correlations in Macaque Primary Visual Cortex , 2014, Neuron.

[77]  J. Swets The Relative Operating Characteristic in Psychology , 1973, Science.

[78]  L. F Abbott,et al.  Lapicque’s introduction of the integrate-and-fire model neuron (1907) , 1999, Brain Research Bulletin.

[79]  Joshua A Solomon,et al.  Efficiencies for the statistics of size discrimination. , 2011, Journal of vision.

[80]  G. A. Miller THE PSYCHOLOGICAL REVIEW THE MAGICAL NUMBER SEVEN, PLUS OR MINUS TWO: SOME LIMITS ON OUR CAPACITY FOR PROCESSING INFORMATION 1 , 1956 .

[81]  Alice F. Healy,et al.  The decision rule in probabilistic categorization: What it is and how it is learned. , 1977 .

[82]  A. Gorea,et al.  Summary statistics for size over space and time. , 2014, Journal of vision.

[83]  M. Wallace,et al.  Enhanced multisensory integration in older adults , 2006, Neurobiology of Aging.

[84]  Alexandre Pouget,et al.  Optimal multisensory decision-making in a reaction-time task , 2014, eLife.

[85]  D. Knill,et al.  The Bayesian brain: the role of uncertainty in neural coding and computation , 2004, Trends in Neurosciences.

[86]  Philip L. Smith,et al.  A comparison of sequential sampling models for two-choice reaction time. , 2004, Psychological review.

[87]  A Kohlrausch,et al.  Differences in auditory performance between monaural and dichotic conditions. I: masking thresholds in frozen noise. , 1992, The Journal of the Acoustical Society of America.

[88]  E. Schröger,et al.  Speeded responses to audiovisual signal changes result from bimodal integration. , 1998, Psychophysiology.

[89]  Paul B Hibbard,et al.  Statistically optimal integration of biased sensory estimates. , 2011, Journal of vision.

[90]  Pete R. Jones,et al.  Development of Auditory Selective Attention: Why Children Struggle to Hear in Noisy Environments , 2015, Developmental psychology.

[91]  V. Richards,et al.  Relative estimates of combination weights, decision criteria, and internal noise based on correlation coefficients. , 1994, The Journal of the Acoustical Society of America.

[92]  Christian Wallraven,et al.  Serial exploration of faces: comparing vision and touch. , 2012, Journal of vision.

[93]  Konrad Paul Kording,et al.  Sensory Cue Integration , 2011 .

[94]  Pascal Mamassian,et al.  Noise and Correlations in Parallel Perceptual Decision Making , 2011 .

[95]  E. Javel,et al.  Stochastic properties of cat auditory nerve responses to electric and acoustic stimuli and application to intensity discrimination. , 2000, The Journal of the Acoustical Society of America.

[96]  D. Wolpert,et al.  When Feeling Is More Important Than Seeing in Sensorimotor Adaptation , 2002, Current Biology.

[97]  M. Landy,et al.  Weighted linear cue combination with possibly correlated error , 2003, Vision Research.

[98]  Jeffrey D. Schall,et al.  Neural basis of deciding, choosing and acting , 2001, Nature Reviews Neuroscience.

[99]  S. Dakin Information limit on the spatial integration of local orientation signals. , 2001, Journal of the Optical Society of America. A, Optics, image science, and vision.

[100]  J. Townsend,et al.  An accuracy-response time capacity assessment function that measures performance against standard parallel predictions. , 2012, Psychological review.

[101]  T. Stanford,et al.  Multisensory integration: current issues from the perspective of the single neuron , 2008, Nature Reviews Neuroscience.

[102]  R. Zemel,et al.  Inference and computation with population codes. , 2003, Annual review of neuroscience.

[103]  Roger W Li,et al.  Perceptual learning improves efficiency by re-tuning the decision 'template' for position discrimination , 2004, Nature Neuroscience.

[104]  Daniel E. Shub,et al.  The Role of Response Bias in Perceptual Learning , 2015, Journal of experimental psychology. Learning, memory, and cognition.

[105]  Wei Ji Ma,et al.  Bayesian inference with probabilistic population codes , 2006, Nature Neuroscience.

[106]  L. Harris,et al.  Optimal audiovisual integration in people with one eye. , 2014, Multisensory research.

[107]  Nicholas Altieri,et al.  A measure for assessing the effects of audiovisual speech integration , 2013, Behavior Research Methods.

[108]  W. Newsome,et al.  The Variable Discharge of Cortical Neurons: Implications for Connectivity, Computation, and Information Coding , 1998, The Journal of Neuroscience.

[109]  Theodore G. Birdsall,et al.  Definitions of d′ and η as Psychophysical Measures , 1958 .

[110]  D. M. Green,et al.  Multiple Observations of Signals in Noise , 1959 .

[111]  D. Burr,et al.  The Ventriloquist Effect Results from Near-Optimal Bimodal Integration , 2004, Current Biology.

[112]  J. Swets Indices of discrimination or diagnostic accuracy: their ROCs and implied models. , 1986, Psychological bulletin.

[113]  Michel Treisman,et al.  Combining Information: Probability Summation and Probability Averaging in Detection and Discrimination , 1998 .

[114]  U. Noppeney,et al.  Superadditive responses in superior temporal sulcus predict audiovisual benefits in object categorization. , 2010, Cerebral cortex.

[115]  M. Landy,et al.  Measurement and modeling of depth cue combination: in defense of weak fusion , 1995, Vision Research.

[116]  J. A. Movshon,et al.  The dependence of response amplitude and variance of cat visual cortical neurones on stimulus contrast , 1981, Experimental Brain Research.

[117]  D. Tank,et al.  Imaging Large-Scale Neural Activity with Cellular Resolution in Awake, Mobile Mice , 2007, Neuron.

[118]  Rolf Ulrich,et al.  Testing the race model inequality: An algorithm and computer programs , 2007, Behavior research methods.

[119]  G. DeAngelis,et al.  Multisensory Integration in Macaque Visual Cortex Depends on Cue Reliability , 2008, Neuron.

[120]  Christopher Summerfield,et al.  Near-optimal Integration of Magnitude in the Human Parietal Cortex , 2016, Journal of Cognitive Neuroscience.

[121]  M. Nardini,et al.  When vision is not an option: children’s integration of auditory and haptic information is suboptimal , 2014, Developmental science.

[122]  S. DeVries,et al.  Bipolar Cells Use Kainate and AMPA Receptors to Filter Visual Information into Separate Channels , 2000, Neuron.

[123]  Paul J. Laurienti,et al.  Applying capacity analyses to psychophysical evaluation of multisensory interactions , 2010, Inf. Fusion.

[124]  G. DeAngelis,et al.  Multisensory integration: psychophysics, neurophysiology, and computation , 2009, Current Opinion in Neurobiology.

[125]  M. Landy,et al.  Combination of texture and color cues in visual segmentation , 2012, Vision Research.

[126]  Jason M. Gold,et al.  Characterizing perceptual learning with external noise , 2004, Cogn. Sci..

[127]  Roger Ratcliff,et al.  The Diffusion Decision Model: Theory and Data for Two-Choice Decision Tasks , 2008, Neural Computation.

[128]  M. Landy,et al.  Ideal-Observer Models of Cue Integration , 2012 .

[129]  C. Spence,et al.  Multisensory Integration: Space, Time and Superadditivity , 2005, Current Biology.

[130]  B. Berg,et al.  Observer efficiency and weights in a multiple observation task. , 1990, The Journal of the Acoustical Society of America.

[131]  G. DeAngelis,et al.  Neural correlates of multisensory cue integration in macaque MSTd , 2008, Nature Neuroscience.

[132]  Robert Fox,et al.  The psychophysical inquiry into binocular summation , 1973 .

[133]  Ryan A. Stevenson,et al.  Audiovisual integration in human superior temporal sulcus: Inverse effectiveness and the neural processing of speech and object recognition , 2009, NeuroImage.

[134]  Robert A. Lutfi,et al.  Correlation coefficients and correlation ratios as estimates of observer weights in multiple‐observation tasks , 1995 .

[135]  David C. Burr,et al.  Young Children Do Not Integrate Visual and Haptic Form Information , 2008, Current Biology.

[136]  G R Grice,et al.  Combination rule for redundant information in reaction time tasks with divided attention , 1984, Perception & psychophysics.

[137]  W. J. McGill,et al.  A study of the near-miss involving Weber’s law and pure-tone intensity discrimination , 1968 .

[138]  J SWETS,et al.  Decision processes in perception. , 1961, Psychological review.

[139]  Colin Blakemore,et al.  Vision: Coding and Efficiency , 1991 .

[140]  J. Gold,et al.  The neural basis of decision making. , 2007, Annual review of neuroscience.

[141]  P. Matthews,et al.  Independent anatomical and functional measures of the V1/V2 boundary in human visual cortex. , 2005, Journal of vision.

[142]  B. Röder,et al.  Basic Multisensory Functions Can Be Acquired After Congenital Visual Pattern Deprivation in Humans , 2012, Developmental neuropsychology.

[143]  B. Wright,et al.  Different patterns of human discrimination learning for two interaural cues to sound-source location , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[144]  B. Breitmeyer,et al.  Recent models and findings in visual backward masking: A comparison, review, and update , 2000, Perception & psychophysics.

[145]  M HERSHENSON,et al.  Reaction time as a measure of intersensory facilitation. , 1962, Journal of experimental psychology.

[146]  Alexandre Pouget,et al.  A computational perspective on the neural basis of multisensory spatial representations , 2002, Nature Reviews Neuroscience.

[147]  R. Jacobs What determines visual cue reliability? , 2002, Trends in Cognitive Sciences.