Large-Scale Analysis of Auditory Segregation Behavior Crowdsourced via a Smartphone App

The human auditory system is adept at detecting sound sources of interest from a complex mixture of several other simultaneous sounds. The ability to selectively attend to the speech of one speaker whilst ignoring other speakers and background noise is of vital biological significance—the capacity to make sense of complex ‘auditory scenes’ is significantly impaired in aging populations as well as those with hearing loss. We investigated this problem by designing a synthetic signal, termed the ‘stochastic figure-ground’ stimulus that captures essential aspects of complex sounds in the natural environment. Previously, we showed that under controlled laboratory conditions, young listeners sampled from the university subject pool (n = 10) performed very well in detecting targets embedded in the stochastic figure-ground signal. Here, we presented a modified version of this cocktail party paradigm as a ‘game’ featured in a smartphone app (The Great Brain Experiment) and obtained data from a large population with diverse demographical patterns (n = 5148). Despite differences in paradigms and experimental settings, the observed target-detection performance by users of the app was robust and consistent with our previous results from the psychophysical study. Our results highlight the potential use of smartphone apps in capturing robust large-scale auditory behavioral data from normal healthy volunteers, which can also be extended to study auditory deficits in clinical populations with hearing impairments and central auditory disorders.

[1]  Discriminating coherence in spectro-temporal patterns. , 1995, The Journal of the Acoustical Society of America.

[2]  S E Trehub,et al.  Aging and auditory temporal sequencing: Ordering the elements of repeating tone patterns , 1989, Perception & psychophysics.

[3]  D. Poeppel,et al.  The role of temporal structure in the investigation of sensory memory, auditory scene analysis, and speech perception: a healthy-aging perspective. , 2015, International journal of psychophysiology : official journal of the International Organization of Psychophysiology.

[4]  Todd M. Gureckis,et al.  CUNY Academic , 2016 .

[5]  H S Colburn,et al.  Reducing informational masking by sound segregation. , 1994, The Journal of the Acoustical Society of America.

[6]  Josh H. McDermott The cocktail party problem , 2009, Current Biology.

[7]  Mitchell Steinschneider,et al.  Neural correlates of auditory stream segregation in primary auditory cortex of the awake monkey , 2001, Hearing Research.

[8]  Maik C. Stüttgen,et al.  Computation of measures of effect size for neuroscience data sets , 2011, The European journal of neuroscience.

[9]  S. Shamma,et al.  Segregation of complex acoustic scenes based on temporal coherence , 2013, eLife.

[10]  S M Abel,et al.  The role of high-frequency hearing in age-related speech understanding deficits , 2000, Scandinavian audiology.

[11]  Fiona McNab,et al.  Dissociating Distractor-Filtering at Encoding and During Maintenance , 2014, Journal of experimental psychology. Human perception and performance.

[12]  A. Duquesnoy Effect of a single interfering noise or speech source upon the binaural sentence intelligibility of aged persons. , 1983, The Journal of the Acoustical Society of America.

[13]  Alexander Gutschalk,et al.  Time is of the essence for auditory scene analysis , 2013, eLife.

[14]  Rick A Adams,et al.  Crowdsourcing for Cognitive Science – The Utility of Smartphones , 2014, PloS one.

[15]  Shihab Shamma,et al.  Temporal coherence versus harmonicity in auditory stream formation. , 2013, The Journal of the Acoustical Society of America.

[16]  L E Humes,et al.  Factors associated with individual differences in clinical measures of speech recognition among the elderly. , 1994, Journal of speech and hearing research.

[17]  M. Chait,et al.  Brain Bases for Auditory Stimulus-Driven Figure–Ground Segregation , 2011, The Journal of Neuroscience.

[18]  Shihab Shamma,et al.  Auditory stream segregation for alternating and synchronous tones. , 2013, Journal of experimental psychology. Human perception and performance.

[19]  Andrew R. Dykstra,et al.  Functional imaging of auditory scene analysis , 2014, Hearing Research.

[20]  Melissa K. Gregg,et al.  Attention, Awareness, and the Perception of Auditory Scenes , 2011, Front. Psychology.

[21]  L. V. Noorden Temporal coherence in the perception of tone sequences , 1975 .

[22]  The Oxford handbook of auditory science , 2015 .

[23]  S. Shamma,et al.  Temporal Coherence in the Perceptual Organization and Cortical Representation of Auditory Scenes , 2009, Neuron.

[24]  David Baker,et al.  Algorithm discovery by protein folding game players , 2011, Proceedings of the National Academy of Sciences.

[25]  M. Lynch,et al.  Effects of aging on processing of novel musical structure. , 1994, Journal of gerontology.

[26]  Mounya Elhilali,et al.  Investigating the Neural Correlates of a Streaming Percept in an Informational-Masking Paradigm , 2014, PloS one.

[27]  A. Zekveld,et al.  Cognitive Load During Speech Perception in Noise: The Influence of Age, Hearing Loss, and Cognition on the Pupil Response , 2011, Ear and hearing.

[28]  E. C. Cmm,et al.  on the Recognition of Speech, with , 2008 .

[29]  A. Bregman,et al.  Demonstrations of auditory scene analysis : the perceptual organization of sound , 1995 .

[30]  J. Fozard,et al.  Age- and gender-specific reference ranges for hearing level and longitudinal changes in hearing level. , 1996, The Journal of the Acoustical Society of America.

[31]  Manuel Blum,et al.  reCAPTCHA: Human-Based Character Recognition via Web Security Measures , 2008, Science.

[32]  Philip Perez,et al.  Why do hair cells and spiral ganglion neurons in the cochlea die during aging? , 2011, Aging and disease.

[33]  Mounya Elhilali,et al.  Segregating Complex Sound Sources through Temporal Coherence , 2014, PLoS Comput. Biol..

[34]  S. Shamma,et al.  Temporal coherence and attention in auditory scene analysis , 2011, Trends in Neurosciences.

[35]  P W Alberti,et al.  Auditory detection, discrimination and speech processing in ageing, noise-sensitive and hearing-impaired listeners. , 1990, Scandinavian audiology.

[36]  S. Shamma,et al.  Auditory Cortical Processing in Real-World Listening: The Auditory System Going Real , 2014, The Journal of Neuroscience.

[37]  J. Arezzo,et al.  Auditory stream segregation in monkey auditory cortex: effects of frequency separation, presentation rate, and tone duration. , 2004, The Journal of the Acoustical Society of America.

[38]  P. Dayan,et al.  A computational and neural model of momentary subjective well-being , 2014, Proceedings of the National Academy of Sciences.

[39]  A. Oxenham,et al.  Neural Correlates of Auditory Perceptual Awareness under Informational Masking , 2008, PLoS biology.

[40]  E. Schröger,et al.  Age-related changes in the use of regular patterns for auditory scene analysis , 2012, Hearing Research.

[41]  Mitchell Steinschneider,et al.  Formation of auditory streams , 2010 .