Semantic-Physically Conflicting Speech Perception and Human Cognitive Principle Inspired ASR System Design

Human exhibits great capacity in processing various kinds of complex and conflicting speeches in high interference acoustic scenes like cocktail party and many other public environments. Discovery of the auditory cognitive conflict control mechanism has great inspiration to automatic speech recognition (ASR) system design. The present research investigated the auditory cognitive conflict control mechanism in human brain through the event-related potential (ERP) technique under auditory perception conditions. Based on the wildly accepted Stroop paradigm, two types of auditory cognition experiments were designed to cover intra-channel and inter-channel conflicting speech perception. The ERP investigations indicated that a hierarchical conflict control mechanism plays an indispensable role in the both types of conflicting speech perception. A hierarchical feedback-controlled speech recognition system framework was thus proposed as an engineering model of the observations in this study. The framework offers a novel mechanism with multi-level feedback loops to modify the feature extraction algorithm and a multi-granularity feature fusion criteria to identify the speeches. This ASR architecture would greatly improve online speech recognition performance.

[1]  W. Klimesch,et al.  P1 and traveling alpha waves: evidence for evoked oscillations. , 2007, Journal of neurophysiology.

[2]  Marie T. Banich,et al.  Neural Mechanisms of Cognitive Control: An Integrative Model of Stroop Task Performance and fMRI Data , 2006, Journal of Cognitive Neuroscience.

[3]  C. C. Duncan,et al.  Event-related potentials in clinical research: Guidelines for eliciting, recording, and quantifying mismatch negativity, P300, and N400 , 2009, Clinical Neurophysiology.

[4]  Qinglin Zhang,et al.  Brain mechanism of Stroop interference effect in Chinese characters , 2006, Brain Research.

[5]  Simon Finnigan,et al.  ERP measures indicate both attention and working memory encoding decrements in aging. , 2011, Psychophysiology.

[6]  Matthias Weisbrod,et al.  Prefrontal-cingulate activation during executive control: which comes first? , 2004, Brain research. Cognitive brain research.

[7]  Steven B. Most,et al.  Auditory Stroop reveals implicit gender associations in adults and children , 2007 .

[8]  J. Ridley Studies of Interference in Serial Verbal Reactions , 2001 .

[9]  Mario Liotti,et al.  Is conflict monitoring supramodal? Spatiotemporal dynamics of cognitive control processes in an auditory Stroop task , 2012, Cognitive, affective & behavioral neuroscience.

[10]  H. Müller,et al.  Conflict-specific effects of accessory stimuli on cognitive control in the Stroop task and the Simon task. , 2013, Experimental psychology.

[11]  C. Muchnik,et al.  Auditory conflict processing: behavioral and electrophysiologic manifestations of the stroop effect. , 2010, Journal of the American Academy of Audiology.

[12]  Jonathan D. Cohen,et al.  Conflict monitoring and anterior cingulate cortex: an update , 2004, Trends in Cognitive Sciences.

[13]  H. Tiitinen,et al.  Mismatch negativity (MMN), the deviance-elicited auditory deflection, explained. , 2010, Psychophysiology.

[14]  F. Barbosa,et al.  The development of the N1 and N2 components in auditory oddball paradigms: a systematic review with narrative analysis and suggested normative values , 2015, Journal of Neural Transmission.

[15]  I. Winkler Interpreting the Mismatch Negativity , 2007 .

[16]  T. Stafford,et al.  Modelling Natural Action Selection: Biologically constrained action selection improves cognitive control in a model of the Stroop task , 2011 .

[17]  Javad Salehi Fadardi,et al.  The addiction-stroop test: Theoretical considerations and procedural recommendations. , 2006, Psychological bulletin.

[18]  M. Botvinick,et al.  Conflict monitoring and cognitive control. , 2001, Psychological review.

[19]  R. West,et al.  The Effects of Aging on Controlled Attention and Conflict Processing in the Stroop Task , 2004, Journal of Cognitive Neuroscience.