A machine for neural computation of acoustical patterns with application to real time speech recognition
暂无分享,去创建一个
400 analog electronic neurons have been assembled and connected for the analysis and recognition of acoustical patterns, including speech. Input to the net comes from a set of 18 band pass filters (Qmax 300 dB/octave; 180 to 6000 Hz, log scale). The net is organized into two parts, the first performs in real time the decomposition of the input patterns into their primitives of energy, space (frequency) and time relations. The other part decodes the set of primitives.216 neurons are dedicated to pattern decomposition. The output of the individual filters is rectified and fed to two sets of 18 neurons in an opponent center‐surround organization of synaptic connections (‘‘on center’’ and (‘‘off center’’). These units compute maxima and minima of energy at different frequencies.The next two sets of neutrons compute the temporal boundaries (‘‘on’’) and ‘‘off’’) and the following two the movement of the energy maxima (formants) up or down the frequency axis. There are in addition ‘‘hyperacuity’’ units which exp...
[1] S. Blumstein,et al. Perceptual invariance and onset spectra for stop consonants in different vowel environments. , 1980, The Journal of the Acoustical Society of America.
[2] P. Mueller,et al. General Principles of Operations in Neuron Nets with Application to Acoustical Pattern Recognition , 1962 .