Modeling Pitch Perception With an Active Auditory Model Extended by Octopus Cells

Pitch is an essential category for musical sensations. Models of pitch perception are vividly discussed up to date. Most of them rely on definitions of mathematical methods in the spectral or temporal domain. Our proposed pitch perception model is composed of an active auditory model extended by octopus cells. The active auditory model is the same as used in the Stimulation based on Auditory Modeling (SAM), a successful cochlear implant sound processing strategy extended here by modeling the functional behavior of the octopus cells in the ventral cochlear nucleus and by modeling their connections to the auditory nerve fibers (ANFs). The neurophysiological parameterization of the extended model is fully described in the time domain. The model is based on latency-phase en- and decoding as octopus cells are latency-phase rectifiers in their local receptive fields. Pitch is ubiquitously represented by cascaded firing sweeps of octopus cells. Based on the firing patterns of octopus cells, inter-spike interval histograms can be aggregated, in which the place of the global maximum is assumed to encode the pitch.

[1]  Miriam Furst,et al.  A New Approach to Model Pitch Perception Using Sparse Coding , 2017, PLoS Comput. Biol..

[2]  F. Klefenz,et al.  A parallel systolic array ASIC for real-time execution of the Hough transform , 2002 .

[3]  J. Stephen Downie,et al.  The music information retrieval evaluation exchange (2005-2007): A window into music information retrieval research , 2008 .

[4]  Kaushik Roy,et al.  Convolutional Spike Timing Dependent Plasticity based Feature Learning in Spiking Neural Networks , 2017, ArXiv.

[5]  Jürgen Adamy,et al.  A brain-like neural network for periodicity analysis , 2005, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[6]  Frank Klefenz,et al.  A neurobiologically inspired vowel recognizer using hough-transform - a novel approach to auditory image processing , 2006, VISAPP.

[7]  Zhiyuan Yan,et al.  A Supervised Stdp-Based Training Algorithm for Living Neural Networks , 2018, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[8]  Alireza Bagheri,et al.  Training Probabilistic Spiking Neural Networks with First- To-Spike Decoding , 2017, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[9]  Gholamreza Karimi,et al.  Triplet-based spike timing dependent plasticity (TSTDP) modeling using VHDL-AMS , 2015, Neurocomputing.

[10]  E. George,et al.  Revisiting Place-Pitch Match in CI Recipients Using 3D Imaging Analysis , 2016, The Annals of otology, rhinology, and laryngology.

[11]  Frank Klefenz,et al.  Making use of auditory models for better mimicking of normal hearing processes with cochlear implants: first results with the SAM coding strategy , 2013 .

[12]  Takashi Matsubara,et al.  Conduction Delay Learning Model for Unsupervised and Supervised Classification of Spatio-Temporal Spike Patterns , 2017, Front. Comput. Neurosci..

[13]  Jutta Kretzberg,et al.  A comparative study of seven human cochlear filter models. , 2016, The Journal of the Acoustical Society of America.

[14]  Ammar Belatreche,et al.  EDL: An Extended Delay Learning Based Remote Supervised Method for Spiking Neurons , 2015, ICONIP.

[15]  Angel Jiménez-Fernandez,et al.  A spiking neural network for real-time Spanish vowel phonemes recognition , 2017, Neurocomputing.

[16]  David B. Grayden,et al.  An integrated model of pitch perception incorporating place and temporal pitch codes with application to cochlear implant research , 2017, Hearing Research.

[17]  Brett Anthony Swanson,et al.  Cochlear Implant Rate Pitch and Melody Perception as a Function of Place and Number of Electrodes , 2016, Trends in hearing.

[18]  S. Neely,et al.  Distortion product emissions from a cochlear model with nonlinear mechanoelectrical transduction in outer hair cells. , 2010, The Journal of the Acoustical Society of America.

[19]  U. Baumann,et al.  Place dependent stimulation rates improve pitch perception in cochlear implantees with single-sided deafness , 2016, Hearing Research.

[20]  Peter Husar,et al.  Making Use of Auditory Models for Better Mimicking of Normal Hearing Processes With Cochlear Implants: The SAM Coding Strategy , 2013, IEEE Transactions on Biomedical Circuits and Systems.

[21]  A. Hudspeth,et al.  Vibrational modes and damping in the cochlear partition , 2015 .

[22]  Peter Birkholz,et al.  A Time-Warping Pitch Tracking Algorithm Considering Fast f0 Changes , 2017, INTERSPEECH.

[23]  Peter A. Tass,et al.  Dendritic and Axonal Propagation Delays Determine Emergent Structures of Neuronal Networks with Plastic Synapses , 2017, Scientific Reports.

[24]  Richard F Lyon,et al.  Cascades of two-pole-two-zero asymmetric resonators are good models of peripheral auditory function. , 2011, The Journal of the Acoustical Society of America.

[25]  M. Domínguez-Morales,et al.  Musical notes classification with neuromorphic auditory system using FPGA and a convolutional spiking network , 2015, 2015 International Joint Conference on Neural Networks (IJCNN).

[26]  Andreas Jakobsson,et al.  An adaptive penalty multi-pitch estimator with self-regularization , 2016, Signal Process..

[27]  Sofia Strömbergsson Today's Most Frequently Used F0 Estimation Methods, and Their Accuracy in Estimating Male and Female Pitch in Clean Speech , 2016, INTERSPEECH.

[28]  Peter Nopp,et al.  Deep electrode insertion and sound coding in cochlear implants , 2015, Hearing Research.

[29]  Tobi Delbruck,et al.  Real-time classification and sensor fusion with a spiking deep belief network , 2013, Front. Neurosci..

[30]  David B. Grayden,et al.  Learning Pitch with STDP: A Computational Model of Place and Temporal Pitch Perception Using Spiking Neural Networks , 2016, PLoS Comput. Biol..

[31]  David M. Landsberger,et al.  Encoding a Melody Using Only Temporal Information for Cochlear-Implant and Normal-Hearing Listeners , 2017, Trends in hearing.

[32]  Yaochu Jin,et al.  An efficient method for online detection of polychronous patterns in spiking neural networks , 2017, Neurocomputing.

[33]  Malu Zhang,et al.  Efficient training of supervised spiking neural networks via the normalized perceptron based learning rule , 2017, Neurocomputing.

[34]  Speech intelligibility is best predicted by intensity, not cochlea-scaled entropy. , 2017, The Journal of the Acoustical Society of America.

[35]  K. Grosh,et al.  Response to a pure tone in a nonlinear mechanical-electrical-acoustical model of the cochlea. , 2012, Biophysical journal.

[36]  Kerry M. M. Walker,et al.  Harmonic Training and the Formation of Pitch Representation in a Neural Network Model of the Auditory Brain , 2016, Front. Comput. Neurosci..

[37]  Johan H. M. Frijns,et al.  Place pitch versus electrode location in a realistic computational model of the implanted human cochlea , 2014, Hearing Research.

[38]  J. Marozeau,et al.  Cochlear Implants Can Talk but Cannot Sing in Tune , 2014 .

[39]  Nace L. Golding,et al.  Synaptic integration in dendrites: exceptional need for speed , 2012, The Journal of physiology.

[40]  D. Mountain,et al.  A piezoelectric model of outer hair cell function. , 1994, The Journal of the Acoustical Society of America.

[41]  J. Ison,et al.  Cellular Computations Underlying Detection of Gaps in Sounds and Lateralizing Sound Sources , 2017, Trends in Neurosciences.

[42]  Rainer Martin,et al.  A computational study of auditory models in music recognition tasks for normal-hearing and hearing-impaired listeners , 2017, EURASIP J. Audio Speech Music. Process..

[43]  Gerald Langner,et al.  The Neural Code of Pitch and Harmony , 2015 .

[44]  Audrey K. Ellerbee,et al.  Noninvasive in vivo imaging reveals differences between tectorial membrane and basilar membrane traveling waves in the mouse cochlea , 2015, Proceedings of the National Academy of Sciences.

[45]  Ying Xu,et al.  A FPGA Implementation of the CAR-FAC Cochlear Model , 2018, Front. Neurosci..

[46]  Emmanouil Benetos,et al.  Polyphonic Music Sequence Transduction with Meter-Constrained LSTM Networks , 2018, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[47]  B. Moore Frequency difference limens for short-duration tones. , 1973, The Journal of the Acoustical Society of America.

[48]  Boris Gourévitch,et al.  Subcortical pathways: Towards a better understanding of auditory disorders , 2018, Hearing Research.

[49]  Gianluca Susi,et al.  Bio-Inspired Temporal-Decoding Network Topologies for the Accurate Recognition of Spike Patterns , 2015 .

[50]  Elena Cerezuela-Escudero,et al.  A Binaural Neuromorphic Auditory Sensor for FPGA: A Spike Signal Processing Approach , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[51]  Clemens Zierhofer,et al.  Electric-acoustic pitch comparisons in single-sided-deaf cochlear implant users: Frequency-place functions and rate pitch , 2014, Hearing Research.

[52]  Majid Ahmadi,et al.  STDP-based unsupervised learning of memristive spiking neural network by Morris-Lecar model , 2017, 2017 International Joint Conference on Neural Networks (IJCNN).

[53]  Liberty S. Hamilton,et al.  Intonational speech prosody encoding in the human auditory cortex , 2017, Science.

[54]  Qiang Fu,et al.  Improving learning algorithm performance for spiking neural networks , 2017, 2017 IEEE 17th International Conference on Communication Technology (ICCT).

[55]  K. Brandenburg,et al.  Overview of numerical models of cell types in the cochlear nucleus , 2009 .

[56]  Anthony S. Maida,et al.  A spiking network that learns to extract spike signatures from speech signals , 2016, Neurocomputing.

[57]  Olga Sourina,et al.  Learning Polychronous Neuronal Groups Using Joint Weight-Delay Spike-Timing-Dependent Plasticity , 2016, Neural Computation.

[58]  Denis Jouvet,et al.  Performance analysis of several pitch detection algorithms on simulated and real noisy speech data , 2017, 2017 25th European Signal Processing Conference (EUSIPCO).

[59]  F. Klefenz,et al.  A NEURAL NET FOR 2D-SLOPE AND SINUSOIDAL SHAPE DETECTION , 2014 .

[60]  Frank Baumgarte A Physiological Ear Model for Auditory Masking Applicable to Perceptual Coding , 1997 .

[61]  Chaitali Chakrabarti,et al.  Algorithm and hardware design of discrete-time spiking neural networks based on back propagation with binary activations , 2017, 2017 IEEE Biomedical Circuits and Systems Conference (BioCAS).

[62]  A. Oxenham How We Hear: The Perception and Neural Coding of Sound , 2018, Annual review of psychology.

[63]  John G. Harris,et al.  Periodicity detection and localization using spike timing from the AER EAR , 2009, 2009 IEEE International Symposium on Circuits and Systems.

[64]  Hideki Kawahara,et al.  YIN, a fundamental frequency estimator for speech and music. , 2002, The Journal of the Acoustical Society of America.

[65]  David Talkin,et al.  A Robust Algorithm for Pitch Tracking ( RAPT ) , 2005 .

[66]  Andrew J Oxenham,et al.  Revisiting place and temporal theories of pitch. , 2013, Acoustical science and technology.

[67]  Philip X Joris Entracking as a Brain Stem Code for Pitch: The Butte Hypothesis. , 2016, Advances in experimental medicine and biology.

[68]  Torsten Dau,et al.  Nonlinear time-domain cochlear model for transient stimulation and human otoacoustic emission. , 2012, The Journal of the Acoustical Society of America.

[69]  Gian Carlo Cardarilli,et al.  Hardware design of LIF with Latency neuron model with memristive STDP synapses , 2017, Integr..

[70]  Meredith T Caldwell,et al.  What Does Music Sound Like for a Cochlear Implant User? , 2017, Otology & neurotology : official publication of the American Otological Society, American Neurotology Society [and] European Academy of Otology and Neurotology.

[71]  David M Landsberger,et al.  The Relationship Between Insertion Angles, Default Frequency Allocations, and Spiral Ganglion Place Pitch in Cochlear Implants , 2015, Ear and hearing.

[72]  John Rinzel,et al.  A Neuronal Network Model for Pitch Selectivity and Representation , 2016, Front. Comput. Neurosci..

[73]  Subhrajit Roy,et al.  Spiking Neural Classifier with Lumped Dendritic Nonlinearity and Binary Synapses: A Current Mode VLSI Implementation and Analysis , 2018, Neural Computation.

[74]  Multi methods pitch tracking , 2012 .

[75]  P. Boyle,et al.  Temporal Fine Structure Processing, Pitch, and Speech Perception in Adult Cochlear Implant Recipients , 2017, Ear and hearing.

[76]  David B. Grayden,et al.  An investigation of dendritic delay in octopus cells of the mammalian cochlear nucleus , 2012, Front. Comput. Neurosci..

[77]  Ian C Bruce,et al.  The history and future of neural modeling for cochlear implants , 2016, Network.

[78]  Laurel H Carney,et al.  Updated parameters and expanded simulation options for a model of the auditory periphery. , 2014, The Journal of the Acoustical Society of America.

[79]  Katrin Krumbholz,et al.  Understanding Pitch Perception as a Hierarchical Process with Top-Down Modulation , 2009, PLoS Comput. Biol..

[80]  Tim Jürgens,et al.  The effects of electrical field spatial spread and some cognitive factors on speech-in-noise performance of individual cochlear implant users—A computer model study , 2018, PloS one.

[81]  Ray Meddis,et al.  A revised model of the inner-hair cell and auditory-nerve complex. , 2002, The Journal of the Acoustical Society of America.

[82]  Dalius Krunglevicius Modified STDP Triplet Rule Significantly Increases Neuron Training Stability in the Learning of Spatial Patterns , 2016, Adv. Artif. Neural Syst..

[83]  Yingxue Wang,et al.  Active Processing of Spatio-Temporal Input Patterns in Silicon Dendrites , 2013, IEEE Transactions on Biomedical Circuits and Systems.

[84]  Jeroen J Briaire,et al.  A Novel Algorithm to Derive Spread of Excitation Based on Deconvolution , 2016, Ear and hearing.

[85]  M. Liberman,et al.  Generating Synchrony from the Asynchronous: Compensation for Cochlear Traveling Wave Delays by the Dendrites of Individual Brainstem Neurons , 2012, The Journal of Neuroscience.

[86]  Frieder Stolzenburg,et al.  Harmony perception by periodicity detection , 2013, ArXiv.

[87]  A. Saremi,et al.  Effect of metabolic presbyacusis on cochlear responses: a simulation approach using a physiologically-based model. , 2013, The Journal of the Acoustical Society of America.