The Use of Spectral Information in the Development of Novel Techniques for Speech- Based Cognitive Load Classification

[1]  Jiming Liu,et al.  An Adaptive User Interface Based On Personalized Learning , 2003, IEEE Intell. Syst..

[2]  Susanto Rahardja,et al.  Kalman filtering speech enhancement incorporating masking properties for mobile communication in a car environment , 2004, 2004 IEEE International Conference on Multimedia and Expo (ICME) (IEEE Cat. No.04TH8763).

[3]  G. A. Miller THE PSYCHOLOGICAL REVIEW THE MAGICAL NUMBER SEVEN, PLUS OR MINUS TWO: SOME LIMITS ON OUR CAPACITY FOR PROCESSING INFORMATION 1 , 1956 .

[4]  Eliathamby Ambikairajah,et al.  An investigation of formant frequencies for cognitive load classification , 2010, INTERSPEECH.

[5]  Susanto Rahardja,et al.  Perceptual Kalman Filtering Speech Enhancement , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[6]  John H. L. Hansen,et al.  Speech Under Stress: Analysis, Modeling and Recognition , 2007, Speaker Classification.

[7]  E. Ambikairajah,et al.  An improved soft threshold method for DCT speech enhancement , 2008, 2008 Second International Conference on Communications and Electronics.

[8]  Sharon K Tindall-Ford,et al.  When two sensory modes are better than one , 1997 .

[9]  Ronald W. Schafer,et al.  Digital Processing of Speech Signals , 1978 .

[10]  F.R.H. Zijlstra,et al.  Efficiency in work behaviour: A design approach for modern tools , 1993 .

[11]  K. Hendy,et al.  Measuring Subjective Workload: When Is One Scale Better Than Many? , 1993 .

[12]  E. Ambikairajah,et al.  Extraction of FM components from speech signals using all-pole model , 2008 .

[13]  Soo Ngee Koh,et al.  Noisy speech enhancement using discrete cosine transform , 1998, Speech Commun..

[14]  Sharon L. Oviatt,et al.  When do we interact multimodally?: cognitive load and multimodal communication patterns , 2004, ICMI '04.

[15]  Rubo Zhang,et al.  Speech Enhancement Based on Hilbert-Huang Transform Theory , 2006, First International Multi-Symposiums on Computer and Computational Sciences (IMSCCS'06).

[16]  J. Harrington,et al.  Techniques in Speech Acoustics , 1999, Computational Linguistics.

[17]  S. Hart,et al.  Development of NASA-TLX (Task Load Index): Results of Empirical and Theoretical Research , 1988 .

[18]  Douglas A. Reynolds,et al.  Approaches to language identification using Gaussian mixture models and shifted delta cepstral features , 2002, INTERSPEECH.

[19]  Paavo Alku,et al.  On separating glottal source and vocal tract information in telephony speaker verification , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[20]  S. Boll,et al.  Suppression of acoustic noise in speech using spectral subtraction , 1979 .

[21]  Christian A. Müller,et al.  Assessment of a User's Time Pressure and Cognitive Load on the Basis of Features of Speech , 2011, Resource-Adaptive Cognitive Processes.

[22]  J.H.L. Hansen,et al.  Speech enhancement for crosstalk interference , 1997, IEEE Signal Processing Letters.

[23]  John H. L. Hansen,et al.  Speech under stress conditions: overview of the effect on speech production and on system performance , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[24]  Rosalind W. Picard,et al.  Modeling drivers' speech under stress , 2003, Speech Commun..

[25]  Eliathamby Ambikairajah,et al.  A Study of Voice Source and Vocal Tract Filter Based Features in Cognitive Load Classification , 2010, 2010 20th International Conference on Pattern Recognition.

[26]  Schuyler Quackenbush,et al.  Objective measures of speech quality , 1995 .

[27]  J. C. Byers,et al.  Comparison of Four Subjective Workload Rating Scales , 1992 .

[28]  Bernd Freisleben,et al.  Fast and Robust Speaker Clustering Using the Earth Mover'S Distance and Mixmax Models , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[29]  Bayya Yegnanarayana,et al.  Speech processing using group delay functions , 1991, Signal Process..

[30]  F. Paas,et al.  Variability of Worked Examples and Transfer of Geometrical Problem-Solving Skills: A Cognitive-Load Approach , 1994 .

[31]  Alan V. Oppenheim,et al.  All-pole modeling of degraded speech , 1978 .

[32]  Kuldip K. Paliwal,et al.  Spectral subband centroid features for speech recognition , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[33]  Haizhou Li,et al.  Speech enhancement for telephony name speech recognition , 2008, 2008 IEEE International Conference on Multimedia and Expo.

[34]  F. Paas,et al.  Instructional control of cognitive load in the training of complex cognitive tasks , 1994 .

[35]  Ching Y. Suen,et al.  A generative-discriminative hybrid for sequential data classification [image classification example] , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[36]  Jonas Beskow,et al.  Wavesurfer - an open source speech tool , 2000, INTERSPEECH.

[37]  Hoirin Kim,et al.  Noise Robust Speaker Identification Using Sub-Band Weighting in Multi-Band Approach , 2007, IEICE Trans. Inf. Syst..

[38]  Douglas A. Reynolds,et al.  Speaker Verification Using Adapted Gaussian Mixture Models , 2000, Digit. Signal Process..

[39]  Kuldip K. Paliwal,et al.  A speech enhancement method based on Kalman filtering , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[40]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[41]  Fang Chen,et al.  Galvanic skin response (GSR) as an index of cognitive load , 2007, CHI Extended Abstracts.

[42]  John Sweller,et al.  Cognitive Load During Problem Solving: Effects on Learning , 1988, Cogn. Sci..

[43]  R. Martin,et al.  Speech enhancement in hearing aids - from noise suppression to rendering of auditory scenes , 2008, 2008 IEEE 25th Convention of Electrical and Electronics Engineers in Israel.

[44]  Kuldip K. Paliwal,et al.  A Comparative Study of Filter Bank Spacing for Speech Recognition , 2003 .

[45]  David Malah,et al.  Speech enhancement using a minimum mean-square error log-spectral amplitude estimator , 1984, IEEE Trans. Acoust. Speech Signal Process..

[46]  Eliathamby Ambikairajah,et al.  Speech enhancement for nonstationary noise environment , 2002, Asia-Pacific Conference on Circuits and Systems.

[47]  Dick de Waard,et al.  The measurement of drivers' mental workload , 1996 .

[48]  J. Stroop Studies of interference in serial verbal reactions. , 1992 .

[49]  Eliathamby Ambikairajah,et al.  Glottal features for speech-based cognitive load classification , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[50]  Fang Chen,et al.  Speech-based cognitive load monitoring system , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[51]  George R. Doddington,et al.  Recognition of speech under stress and in noise , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[52]  M. A. Kohler,et al.  Language identification using shifted delta cepstra , 2002, The 2002 45th Midwest Symposium on Circuits and Systems, 2002. MWSCAS-2002..

[53]  F. Paas,et al.  Cognitive Architecture and Instructional Design , 1998 .

[54]  Haizhou Li,et al.  Language Identification: A Tutorial , 2011, IEEE Circuits and Systems Magazine.

[55]  Fang Chen,et al.  Cognitive Load Measurement from User's Linguistic Speech Features for Adaptive Interaction Design , 2009, INTERACT.

[56]  Keikichi Hirose,et al.  EMD based soft-thresholding for speech enhancement , 2007, INTERSPEECH.

[57]  Eliathamby Ambikairajah,et al.  FM features for automatic forensic speaker recognition , 2008, INTERSPEECH.

[58]  Paavo Alku,et al.  Glottal wave analysis with Pitch Synchronous Iterative Adaptive Inverse Filtering , 1991, Speech Commun..

[59]  Eliathamby Ambikairajah,et al.  Formant Frequencies under Cognitive Load: Effects and Classification , 2011, EURASIP J. Adv. Signal Process..

[60]  B. Kerr,et al.  Processing demands during mental operations , 1973, Memory & cognition.

[61]  Alex Acero Source-filter models for time-scale pitch-scale modification of speech , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[62]  Paul Chandler,et al.  Levels of Expertise and Instructional Design , 1998, Hum. Factors.

[63]  Valerie J. Gawron,et al.  Human performance measures handbook , 2000 .

[64]  Hynek Hermansky,et al.  Sub-band based recognition of noisy speech , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[65]  G. Mirka,et al.  Cervicobrachial muscle response to cognitive load in a dual-task scenario , 2004, Ergonomics.

[66]  Eliathamby Ambikairajah,et al.  Cognitive load classification using formant features , 2010, 10th International Conference on Information Science, Signal Processing and their Applications (ISSPA 2010).

[67]  Andrew R. Webb,et al.  Statistical Pattern Recognition , 1999 .

[68]  Masaaki Honda,et al.  Human Speech Production Mechanisms , 2003 .

[69]  Philipos C. Loizou,et al.  Speech enhancement based on perceptually motivated bayesian estimators of the magnitude spectrum , 2005, IEEE Transactions on Speech and Audio Processing.

[70]  Eliathamby Ambikairajah,et al.  Investigation of Spectral Centroid Magnitude and Frequency for Speaker Recognition , 2010, Odyssey.

[71]  E. Mendoza,et al.  Acoustic analysis of induced vocal stress by means of cognitive workload tasks. , 1998, Journal of voice : official journal of the Voice Foundation.

[72]  Fang Chen,et al.  Investigating speech features and automatic measurement of cognitive load , 2008, 2008 IEEE 10th Workshop on Multimedia Signal Processing.

[73]  F. Paas,et al.  Cognitive Load Measurement as a Means to Advance Cognitive Load Theory , 2003 .

[74]  P. Chandler,et al.  Cognitive Load Theory and the Format of Instruction , 1991 .

[75]  Vidhyasaharan Sethu,et al.  Investigation of the robustness of a non-uniform filterbank for cognitive load classification , 2011, 2011 8th International Conference on Information, Communications & Signal Processing.

[76]  Fang Chen,et al.  Automatic cognitive load detection from speech features , 2007, OZCHI '07.

[77]  Jessica Villing,et al.  Dialogue Behaviour under High Cognitive Load , 2009, SIGDIAL Conference.

[78]  N. Cowan The magical number 4 in short-term memory: A reconsideration of mental storage capacity , 2001, Behavioral and Brain Sciences.

[79]  Ronald W. Schafer,et al.  Introduction to Digital Speech Processing , 2007, Found. Trends Signal Process..

[80]  Fang Chen,et al.  Exploring classification techniques in speech based cognitive load monitoring , 2008, INTERSPEECH.

[81]  F. Paas Training strategies for attaining transfer of problem-solving skill in statistics: A cognitive-load approach. , 1992 .

[82]  Eliathamby Ambikairajah,et al.  Speech enhancement based on a perceptual modification of wiener filtering , 2002, INTERSPEECH.

[83]  Christian Gütl,et al.  AdeLE (Adaptive e-Learning with Eye-Tracking): Theoretical Background, System Architecture and Application Scenarios , 2005 .

[84]  John H. L. Hansen,et al.  Discrete-Time Processing of Speech Signals , 1993 .

[85]  Hagai Aronowitz,et al.  A distance measure between GMMs based on the unscented transform and its application to speaker recognition , 2005, INTERSPEECH.

[86]  Two times three little pigs: Dysfluency, cognitive complexity and autism , 2007 .

[87]  K S Rao,et al.  Emotion recognition from speech signal using epoch parameters , 2010, 2010 International Conference on Signal Processing and Communications (SPCOM).

[88]  Roland Brünken,et al.  Cognitive Load Theory: THEORY , 2010 .

[89]  Fang Chen,et al.  Phase based features for cognitive load measurement system , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[90]  N. Huang,et al.  The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis , 1998, Proceedings of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences.

[91]  A. Cuhadar,et al.  Evaluation of Speech Enhancement Techniques for Speaker Identification in Noisy Environments , 2008 .

[92]  W. B. Knowles,et al.  Operator Loading Tasks , 1963, Human factors.

[93]  Vidhyasaharan Sethu,et al.  Robust Speech-Based Cognitive Load Classification Using a Multi-band Approach , 2010 .

[94]  F. Paas,et al.  Memory load and the cognitive pupillary response in aging. , 2004, Psychophysiology.

[95]  John Sweller,et al.  Cognitive Load Theory: Instructional Implications of the Interaction between Information Structures and Cognitive Architecture , 2004 .

[96]  Klaus R. Scherer,et al.  Acoustic correlates of task load and stress , 2002, INTERSPEECH.

[97]  Eliathamby Ambikairajah,et al.  Improvement of Vietnamese Tone Classification using FM and MFCC Features , 2009, 2009 IEEE-RIVF International Conference on Computing and Communication Technologies.

[98]  Sridha Sridharan,et al.  Feature warping for robust speaker verification , 2001, Odyssey.

[99]  Christian A. Müller,et al.  Recognizing Time Pressure and Cognitive Load on the Basis of Speech: An Experimental Study , 2001, User Modeling.

[100]  Md. Kamrul Hasan,et al.  Soft thresholding for DCT speech enhancement , 2002 .

[101]  Alexandros Potamianos,et al.  Multi-band speech recognition in noisy environments , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[102]  Shun'ichi Tano,et al.  Information presentation based on estimation of human multimodal cognitive load , 2001, Proceedings Joint 9th IFSA World Congress and 20th NAFIPS International Conference (Cat. No. 01TH8569).

[103]  Vidhyasaharan Sethu,et al.  Speech enhancement based on empirical mode decomposition , 2008 .

[104]  F. Thomas Eggemeier,et al.  Workload assessment methodology. , 1986 .

[105]  Wendy S. Ark,et al.  The Emotion Mouse , 1999, HCI.

[106]  D B Pisoni,et al.  Effects of cognitive workload on speech production: acoustic analyses and perceptual consequences. , 1993, The Journal of the Acoustical Society of America.

[107]  Joost Schilperoord,et al.  On the Cognitive Status of Pauses in Discourse Production , 2002 .

[108]  E. Ambikairajah,et al.  Group delay features for speaker recognition , 2007, 2007 6th International Conference on Information, Communications & Signal Processing.

[109]  S. Miyake Multivariate workload evaluation combining physiological and subjective measures. , 2001, International journal of psychophysiology : official journal of the International Organization of Psychophysiology.

[110]  Vidhyasaharan Sethu,et al.  Speaker dependency of spectral features and speech production cues for automatic emotion classification , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[111]  H.K. Kwan,et al.  Adaptive subband Wiener filtering for speech enhancement using critical-band gammatone filterbank , 2005, 48th Midwest Symposium on Circuits and Systems, 2005..

[112]  Petros Maragos,et al.  Energy separation in signal modulations with application to speech analysis , 1993, IEEE Trans. Signal Process..

[113]  Sharon L. Oviatt,et al.  Human-centered design meets cognitive load theory: designing interfaces that help people think , 2006, MM '06.

[114]  Williams Ce,et al.  The effects of different levels of task complexity on three vocal measures. , 1987 .

[115]  Toshio Irino,et al.  Noise suppression using a time-varying, analysis/synthesis gamma chirp filterbank , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[116]  Eliathamby Ambikairajah,et al.  A non-uniform subband approach to speech-based cognitive load classification , 2009, 2009 7th International Conference on Information, Communications and Signal Processing (ICICS).

[117]  Eliathamby Ambikairajah,et al.  Perceptual speech enhancement exploiting temporal masking properties of human auditory system , 2010, Speech Commun..

[118]  Vidhyasaharan Sethu,et al.  Group delay features for emotion detection , 2007, INTERSPEECH.

[119]  John H. L. Hansen,et al.  Analysis and detection of cognitive load and frustration in drivers' speech , 2010, INTERSPEECH.

[120]  Jianwu Dang,et al.  An investigation of dependencies between frequency components and speaker characteristics for text-independent speaker identification , 2008, Speech Commun..

[121]  Daniel P. W. Ellis,et al.  Evaluation of Distance Measures Between Gaussian Mixture Models of MFCCs , 2007, ISMIR.

[122]  Sexton Jb,et al.  Analyzing cockpit communications: the links between language, performance, error, and workload. , 2000 .

[123]  Thippur V. Sreenivas,et al.  Codebook constrained Wiener filtering for speech enhancement , 1996, IEEE Trans. Speech Audio Process..

[124]  D. Leutner,et al.  Direct Measurement of Cognitive Load in Multimedia Learning , 2003 .