Efficient codes for memory determine pitch representations

Abstract Perceptual systems have finite memory resources and must store incoming signals in compressed formats. To test whether representations of a sound’s pitch might derive from this need for compression, we compared discrimination of harmonic and inharmonic sounds across delays. In contrast to inharmonic spectra, harmonic spectra can be summarized, and thus compressed, using their fundamental frequency (f0). Despite being comparable for sounds presented back-to-back, discrimination was better for harmonic than inharmonic stimuli when sounds were separated in time, implicating memory representations unique to harmonic sounds. Patterns of individual differences indicated that listeners use different representations depending on the time delay between sounds, directly comparing the spectra of temporally adjacent sounds, but transitioning to comparing f0s across delays. The need to compress sound into memory appears to determine reliance on f0-based pitch, and may explain its importance in music, in which listeners must extract relationships between notes separated in time.

[1]  L. Demany,et al.  Dissociation of pitch from timbre in auditory short-term memory. , 1991, The Journal of the Acoustical Society of America.

[2]  Christopher Plack Comprar Oxford Handbook of Auditory Science Hearing | Christopher Plack | 978-0-19-923355-7 | Oxford University Press , 2010 .

[3]  Jonathan W. Pillow,et al.  Error-correcting dynamics in visual working memory , 2018 .

[4]  HIDEKI KAWAHARA,et al.  Technical foundations of TANDEM-STRAIGHT, a speech analysis, modification and synthesis framework , 2011 .

[5]  Josh H McDermott,et al.  Ecological origins of perceptual grouping principles in the auditory system , 2019, Proceedings of the National Academy of Sciences.

[6]  Kerry M. M. Walker,et al.  Across-species differences in pitch perception are consistent with differences in cochlear filtering , 2019, eLife.

[7]  N. Cowan On short and long auditory stores. , 1984, Psychological bulletin.

[8]  Albert S. Bregman,et al.  The Auditory Scene. (Book Reviews: Auditory Scene Analysis. The Perceptual Organization of Sound.) , 1990 .

[9]  Brian R Glasberg,et al.  Derivation of auditory filter shapes from notched-noise data , 1990, Hearing Research.

[10]  Sarah J Wilson,et al.  The musical environment and auditory plasticity: hearing the pitch of percussion , 2013, Front. Psychol..

[11]  B. Repp Categorical Perception: Issues, Methods, Findings , 1984 .

[12]  R. Patterson,et al.  The Processing of Temporal Pitch and Melody Information in Auditory Cortex , 2002, Neuron.

[13]  Josh H. McDermott,et al.  Adaptive and Selective Time Averaging of Auditory Scenes , 2018, Current Biology.

[14]  N. Cowan,et al.  The role of absolute and relative amounts of time in forgetting within immediate memory: The case of tone-pitch comparisons , 1997 .

[15]  Paul Iverson,et al.  Name that tune: Identifying popular recordings from brief excerpts , 1999, Psychonomic bulletin & review.

[16]  Daniel Pressnitzer,et al.  Rapid Formation of Robust Auditory Memories: Insights from Noise , 2010, Neuron.

[17]  W. Dowling,et al.  Contour, interval, and pitch recognition in memory for melodies. , 1971, The Journal of the Acoustical Society of America.

[18]  M. Corballis Wandering tales: evolutionary origins of mental time travel and language , 2013, Front. Psychol..

[19]  G. Bower,et al.  THE PSYCHOLOGY OF LEARNING AND M·OTIVATION , 2001 .

[20]  M. Posner,et al.  Retention of Abstract Ideas. , 1970 .

[21]  Roy D. Patterson,et al.  Distortion products and the perceived pitch of harmonic complex tones , 2001 .

[22]  D. Pisoni Auditory and phonetic memory codes in the discrimination of consonants and vowels , 1973, Perception & psychophysics.

[23]  C. Plack Oxford Handbook of Auditory Science: Hearing , 2010 .

[24]  T Houtgast,et al.  Subharmonic pitches of a pure tone at low S/N ratio. , 1976, The Journal of the Acoustical Society of America.

[25]  Timothy F. Brady,et al.  A probabilistic model of visual working memory: Incorporating higher order regularities into working memory capacity estimates. , 2013, Psychological review.

[26]  L. Demany,et al.  Auditory Change Detection: Simple Sounds Are Not Memorized Better Than Complex Sounds , 2008, Psychological science.

[27]  L. Kishon-Rabin,et al.  Pitch Discrimination: Are Professional Musicians Better than Non-Musicians? , 2001, Journal of basic and clinical physiology and pharmacology.

[28]  Barbara Tillmann,et al.  Impaired short-term memory for pitch in congenital amusia , 2016, Brain Research.

[29]  B. Delgutte,et al.  Pitch of complex tones: rate-place and interspike interval representations in the auditory nerve. , 2005, Journal of neurophysiology.

[30]  A. Oxenham Pitch Perception , 2012, The Journal of Neuroscience.

[31]  Hideki Kawahara,et al.  Inharmonic speech reveals the role of harmonicity in the cocktail party problem , 2018, Nature Communications.

[32]  Terrence J. Sejnowski,et al.  Unsupervised Learning , 2018, Encyclopedia of GIS.

[33]  Josh H McDermott,et al.  Schema learning for the cocktail party problem , 2018, Proceedings of the National Academy of Sciences.

[34]  Yaoda Xu Understanding the object benefit in visual short-term memory: The roles of feature proximity and connectedness , 2006, Perception & Psychophysics.

[35]  D W Massaro,et al.  Retroactive interference in short-term recognition memory for pitch. , 1970, Journal of experimental psychology.

[36]  A. Yuille,et al.  Object perception as Bayesian inference. , 2004, Annual review of psychology.

[37]  Julie M. Harris,et al.  Optimal integration of shading and binocular disparity for depth perception. , 2012, Journal of vision.

[38]  Josh H. McDermott,et al.  Cortical Pitch Regions in Humans Respond Primarily to Resolved Harmonics and Are Located in Specific Tonotopic Regions of Anterior Auditory Cortex , 2013, The Journal of Neuroscience.

[39]  Bevil R. Conway,et al.  Divergence in the Functional Organization of Human and Macaque Auditory Cortex Revealed by fMRI Responses to Harmonic Tones , 2019, Nature Neuroscience.

[40]  Emily J. Allen,et al.  Representations of Pitch and Timbre Variation in Human Auditory Cortex , 2017, The Journal of Neuroscience.

[41]  W A Wickelgren,et al.  Distinctive features and errors in short-term memory for English vowels. , 1965, The Journal of the Acoustical Society of America.

[42]  R. Carlyon,et al.  The role of resolved and unresolved harmonics in pitch perception and frequency modulation discrimination. , 1994, The Journal of the Acoustical Society of America.

[43]  Wayne A. Wickelgren,et al.  Associative strength theory of recognition memory for pitch , 1969 .

[44]  R. G. Crowder,et al.  Decay of auditory memory in vowel discrimination. , 1982, Journal of experimental psychology. Learning, memory, and cognition.

[45]  T. Pasternak,et al.  Working memory in primate sensory systems , 2005, Nature Reviews Neuroscience.

[46]  L. Demany,et al.  Speech versus nonspeech in pitch memory. , 1996, The Journal of the Acoustical Society of America.

[47]  George Sperling,et al.  The information available in brief visual presentations. , 1960 .

[48]  Masataka Goto,et al.  RWC Music Database: Music genre database and musical instrument sound database , 2003, ISMIR.

[49]  A. Oxenham,et al.  Influence of musical and psychoacoustical training on pitch discrimination , 2006, Hearing Research.

[50]  Edgar Erdfelder,et al.  G*Power 3: A flexible statistical power analysis program for the social, behavioral, and biomedical sciences , 2007, Behavior research methods.

[51]  Josh H. McDermott,et al.  Indifference to dissonance in native Amazonians reveals cultural variation in music perception , 2016, Nature.

[52]  D. Bendor,et al.  The neuronal representation of pitch in primate auditory cortex , 2005, Nature.

[53]  D. Pisoni Auditory short-term memory and vowel perception , 1975, Memory & cognition.

[54]  Aniruddh D. Patel,et al.  Songbirds use spectral shape, not pitch, for sound pattern recognition , 2016, Proceedings of the National Academy of Sciences.

[55]  J. Harris,et al.  The decline of pitch discrimination with time. , 1952, Journal of experimental psychology.

[56]  Edward K. Vogel,et al.  The capacity of visual working memory for features and conjunctions , 1997, Nature.

[57]  Andrew J Oxenham,et al.  Does fundamental-frequency discrimination measure virtual pitch discrimination? , 2010, The Journal of the Acoustical Society of America.

[58]  M. Ernst,et al.  Humans integrate visual and haptic information in a statistically optimal fashion , 2002, Nature.

[59]  Timothy F. Brady,et al.  Compression in visual working memory: using statistical regularities to form more efficient memory representations. , 2009, Journal of experimental psychology. General.

[60]  Michael S. Lewicki,et al.  Efficient coding of natural sounds , 2002, Nature Neuroscience.

[61]  David Whitaker,et al.  The retention and disruption of color information in human short-term visual memory. , 2012, Journal of vision.

[62]  H. B. Barlow,et al.  Possible Principles Underlying the Transformations of Sensory Messages , 2012 .

[63]  Josh H. McDermott,et al.  Distortion products in auditory fMRI research: Measurements and solutions , 2016, NeuroImage.

[64]  Michael I. Posner,et al.  Abstraction and The Process of Recognition , 1970 .

[65]  Laurent Demany,et al.  On the binding of successive sounds: perceiving shifts in nonperceived pitches. , 2005, The Journal of the Acoustical Society of America.

[66]  Andrew J Oxenham,et al.  An autocorrelation model with place dependence to account for the effect of harmonic number on fundamental frequency discrimination. , 2005, The Journal of the Acoustical Society of America.

[67]  Nori Jacoby,et al.  Perceptual fusion of musical notes by native Amazonians suggests universal representations of musical intervals , 2020, Nature Communications.

[68]  Daniel P. W. Ellis,et al.  Inharmonic speech: a tool for the study of speech perception and separation , 2012, SAPA@INTERSPEECH.

[69]  Pascal Belin,et al.  Human voice perception , 2011, Current Biology.

[70]  Denis G. Pelli,et al.  ECVP '07 Abstracts , 2007, Perception.

[71]  F. Attneave Some informational aspects of visual perception. , 1954, Psychological review.

[72]  Mark W. Greenlee,et al.  Stimulus-specific mechanisms of visual short-term memory , 1991, Vision Research.

[73]  Kerry M. M. Walker,et al.  Pitch perception is adapted to species-specific cochlear filtering , 2018, bioRxiv.

[74]  C. Spearman The proof and measurement of association between two things. , 2015, International journal of epidemiology.

[75]  Andrew J Oxenham,et al.  A Neural Representation of Pitch Salience in Nonprimary Human Auditory Cortex Revealed with Functional Magnetic Resonance Imaging , 2004, The Journal of Neuroscience.

[76]  G. Fairweather,et al.  Relative effectiveness of psychotherapeutic programs: A multicriteria comparison of four programs for three different patient groups. , 1960 .

[77]  Andrew J Oxenham,et al.  Perceptual grouping affects pitch judgments across time and frequency. , 2011, Journal of experimental psychology. Human perception and performance.

[78]  L. Demany,et al.  Memory for pitch versus memory for loudness. , 1999, The Journal of the Acoustical Society of America.

[79]  Carl F. Falk,et al.  The Relationship Between Unstandardized and Standardized Alpha, True Reliability, and the Underlying Measurement Model , 2011, Journal of personality assessment.

[80]  Daniel Pressnitzer,et al.  Fundamental differences in change detection between vision and audition , 2010, Experimental Brain Research.

[81]  Hideki Kawahara,et al.  Tandem-STRAIGHT: A temporally stable power spectral representation for periodic signals and applications to interference-free spectrum, F0, and aperiodicity estimation , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[82]  Josh H. McDermott,et al.  Individual Differences Reveal the Basis of Consonance , 2010, Current Biology.

[83]  W. A. Phillips On the distinction between sensory storage and short-term visual memory , 1974 .

[84]  L. Cronbach Coefficient alpha and the internal structure of tests , 1951 .

[85]  Yueqi Guo,et al.  Complex pitch perception mechanisms are shared by humans and a New World monkey , 2015, Proceedings of the National Academy of Sciences.

[86]  R. Peters,et al.  Pitch for nonsimultaneous successive harmonics in quiet and noise. , 1981, The Journal of the Acoustical Society of America.

[87]  •'AYNE A. •'ICKELGREN Distinctive Features and Errors in Short-Term Memory for English Consonants , 2004 .

[88]  P. Sachs,et al.  SMARCAD1 ATPase activity is required to silence endogenous retroviruses in embryonic stem cells , 2019, Nature Communications.

[89]  J. Wilmer How to use individual differences to isolate functional organization, biology, and utility of visual functions; with illustrative proposals for stereopsis. , 2008, Spatial Vision.

[90]  Michael W. Weiss,et al.  Something in the Way She Sings , 2012, Psychological science.

[91]  Josh H. McDermott,et al.  Illusory sound texture reveals multi-second statistical completion in auditory scene analysis , 2019, Nature Communications.

[92]  Josh H. McDermott,et al.  Universal and Non-universal Features of Musical Pitch Perception Revealed by Singing , 2019, Current Biology.

[93]  R. Fay,et al.  Pitch : neural coding and perception , 2005 .

[94]  Josh H. McDermott,et al.  Diversity in pitch perception revealed by task dependence , 2017, Nature Human Behaviour.

[95]  Christian Kaernbach,et al.  The decay of pitch memory during rehearsal. , 2008, The Journal of the Acoustical Society of America.

[96]  Eero P. Simoncelli,et al.  Summary statistics in auditory perception , 2013, Nature Neuroscience.

[97]  Kerry M. M. Walker,et al.  Neural Ensemble Codes for Stimulus Periodicity in Auditory Cortex , 2010, The Journal of Neuroscience.

[98]  Josh H McDermott,et al.  Headphone screening to facilitate web-based auditory experiments , 2017, Attention, Perception, & Psychophysics.

[99]  Wiktor Mlynarski,et al.  Learning Midlevel Auditory Codes from Natural Sound Statistics , 2017, Neural Computation.

[100]  M. Chun,et al.  Organization of visual short-term memory. , 2000, Journal of experimental psychology. Learning, memory, and cognition.

[101]  M. Motagh,et al.  Complex hazard cascade culminating in the Anak Krakatau sector collapse , 2019, Nature Communications.

[102]  Liberty S. Hamilton,et al.  Intonational speech prosody encoding in the human auditory cortex , 2017, Science.

[103]  Carol L. Krumhansl,et al.  Plink: "Thin Slices" of Music , 2010 .

[104]  Hideki Kawahara,et al.  STRAIGHT, exploitation of the other aspect of VOCODER: Perceptually isomorphic decomposition of speech sounds , 2006 .

[105]  Torsten Dau,et al.  Pitch Discrimination in Musicians and Non-Musicians: Effects of Harmonic Resolvability and Processing Effort , 2015, Journal of the Association for Research in Otolaryngology.

[106]  A. Faulkner Pitch discrimination of harmonic complex signals: residue pitch or multiple component discriminations? , 1985, The Journal of the Acoustical Society of America.

[107]  Michael T. Turvey,et al.  An auditory analogue of the sperling partial report procedure: Evidence for brief auditory storage , 1972 .

[108]  Christian E Stilp,et al.  Rapid efficient coding of correlated complex acoustic properties , 2010, Proceedings of the National Academy of Sciences.

[109]  A. Oxenham,et al.  The psychophysics of pitch , 2005 .

[110]  A. Bachem,et al.  Time Factors in Relative and Absolute Pitch Determination , 1954 .

[111]  D. J. McKeefry,et al.  Speed selectivity in visual short term memory for motion , 2007, Vision Research.

[112]  E. Erdfelder,et al.  Statistical power analyses using G*Power 3.1: Tests for correlation and regression analyses , 2009, Behavior research methods.

[113]  Matthias Abend Cognitive Foundations Of Musical Pitch , 2016 .

[114]  R. Ruibal,et al.  Mapping of Interactions in the Pitch Memory Store , 2004 .

[115]  F. Bartlett,et al.  Remembering: A Study in Experimental and Social Psychology , 1932 .

[116]  B C Moore,et al.  Frequency discrimination of complex tones with overlapping and non-overlapping harmonics. , 1990, The Journal of the Acoustical Society of America.

[117]  R. Shepard Circularity in Judgments of Relative Pitch , 1964 .

[118]  Andrew Faulkner,et al.  Vividness of Visual Imagery and Incidental Recall of Verbal Cues, When Phenomenological Availability Reflects Long-Term Memory Accessibility , 2013, Front. Psychology.

[119]  C. Micheyl,et al.  Neural Representation of Harmonic Complex Tones in Primary Auditory Cortex of the Awake Monkey , 2013, The Journal of Neuroscience.