Innovative speech processing for mobile terminals: an annotated bibliography

Abstract This paper gives an overview of recent bibliographic references dealing with speech processing in mobile terminals. Its purpose is to point out state of the art issues in the area; thus a fairly large list of references taken from many conferences proceedings and journals is given and commented. General considerations about speech processing in mobile communications are firstly introduced; then we deal with audio processing for speech enhancement in mobile terminals and with low bit-rate speech coding. Speech recognition is addressed with some accent put on mobile applications. A short overview of implementation aspects of speech processing algorithms in mobile terminals is also given. Finally, open issues and problems are listed.

[1]  Eberhard Hänsler,et al.  The hands-free telephone problem- An annotated bibliography , 1992, Signal Process..

[2]  Peter Händel,et al.  Low-distortion spectral subtraction for speech enhancement , 1995, EUROSPEECH.

[3]  Sven Nordholm,et al.  Adaptive array noise suppression of handsfree speaker input in cars , 1993 .

[4]  Jae S. Lim,et al.  Adaptive noise cancellation in a fighter cockpit environment , 1984, ICASSP.

[5]  Chin-Hui Lee,et al.  On stochastic feature and model compensation approaches to robust speech recognition , 1998, Speech Commun..

[6]  Pascal Scalart,et al.  Speech enhancement based on a priori signal to noise estimation , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[7]  Eric Moulines,et al.  Frequency-domain adaptive filtering with applications to acoustic echo cancellation , 1994 .

[8]  Anthony G. Constantinides,et al.  Residual signal in sub-band acoustic echo cancellers , 1996, 1996 8th European Signal Processing Conference (EUSIPCO 1996).

[9]  Rainer Martin,et al.  Coupled adaptive filters for acoustic echo control and noise reduction , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[10]  Karl Hellwig,et al.  Speech codec for the European mobile radio system , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[11]  A. Gilloire,et al.  The fast Newton transversal filter: an efficient scheme for acoustic echo cancellation in mobile radio , 1994, IEEE Trans. Signal Process..

[12]  J. S. Bird,et al.  Speech enhancement for mobile telephony , 1990 .

[13]  Chafic Mokbel,et al.  Deconvolution of telephone line effects for speech recognition , 1996, Speech Commun..

[14]  Régine Le Bouquin-Jeannès,et al.  Joint system for acoustic echo cancellation and noise reduction , 1995, EUROSPEECH.

[15]  François Capman,et al.  Acoustic echo cancellation and noise reduction in the frequency-domain: A global optimisation , 1996, 1996 8th European Signal Processing Conference (EUSIPCO 1996).

[16]  Yves Grenier A microphone array for car environments , 1993, Speech Commun..

[17]  H.M. Hafez,et al.  Background acoustic noise reduction in mobile telephony , 1986, 36th IEEE Vehicular Technology Conference.

[18]  Olivier Cappé,et al.  Elimination of the musical noise phenomenon with the Ephraim and Malah noise suppressor , 1994, IEEE Trans. Speech Audio Process..

[19]  Matthew Lennig,et al.  Directory assistance automation in Bell Canada: Trial results , 1995, Speech Commun..

[20]  B. S. Atal,et al.  PREDICTIVE CODING OF SPEECH USING ANALYSIS-BY-SYNTHESIS TECHNIQUES , 1990, 1990 Conference Record Twenty-Fourth Asilomar Conference on Signals, Systems and Computers, 1990..

[21]  Martin Vetterli,et al.  Adaptive filtering in subbands with critical sampling: analysis, experiments, and application to acoustic echo cancellation , 1992, IEEE Trans. Signal Process..

[22]  Peter Heitkämper,et al.  Adaptive gain control and echo cancellation for hands-free telephone systems , 1993, EUROSPEECH.

[23]  Pierre Duhamel,et al.  State of the Art in Acoustic Echo Cancellation , 1996 .

[24]  Redwan Salami,et al.  GSM enhanced full rate speech codec , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[25]  David Malah,et al.  Speech enhancement using a minimum mean-square error log-spectral amplitude estimator , 1984, IEEE Trans. Acoust. Speech Signal Process..

[26]  Philip Lockwood,et al.  Evaluation of root-normalised front-end (RN LFCC) for speech recognition in wireless GSM network environments , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[27]  Rainer Martin,et al.  Spectral Subtraction Based on Minimum Statistics , 2001 .

[28]  Jan M. Rabaey,et al.  Reconfigurable processing: the solution to low-power programmable DSP , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[29]  S. Boll,et al.  Suppression of acoustic noise in speech using spectral subtraction , 1979 .

[30]  Pavel Sovka,et al.  The study of speech/pause detectors for speech enhancement methods , 1995, EUROSPEECH.

[31]  C. García-Mateo,et al.  Modeling Techniques for Speech Coding: A Selected Survey , 1996 .

[32]  Patrick A. Naylor,et al.  Enhancement of hands-free telecommunications , 1994 .

[33]  Eberhard HÄnsler,et al.  The hands-free telephone problem: an annotated bibliography update , 1994 .

[34]  Alan McCree,et al.  New methods for adaptive noise suppression , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[35]  Pascal Scalart,et al.  A system for speech enhancement in the context of hands-free radiotelephony with combined noise reduction and acoustic echo cancellation , 1996, Speech Commun..

[36]  M. Koya,et al.  A Hands-Free Mobile Telephone Using Echo Canceller Technique , 1986, ICC.

[37]  Chafic Mokbel,et al.  Towards improving ASR robustness for PSN and GSM telephone applications , 1997, Speech Commun..

[38]  Rainer Martin,et al.  Combined acoustic echo control and noise reduction for hands-free telephony — State of the art and perspectives , 1996, 1996 8th European Signal Processing Conference (EUSIPCO 1996).

[39]  Richard M. Schwartz,et al.  Enhancement of speech corrupted by acoustic noise , 1979, ICASSP.

[40]  Denis Jouvet Reconnaissance de mots connectes independamment du locuteur par des methodes statistiques , 1988 .

[41]  Chin-Hui Lee,et al.  Utterance verification of keyword strings using word-based minimum verification error (WB-MVE) training , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[42]  André Gilloire Performance evaluation of acoustic echo control: required values and measurement procedures , 1994 .

[43]  Jean Rouat,et al.  A new algorithm for double talk detection and separation in the context of digital mobile radio telephone , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[44]  Denis Jouvet,et al.  Operational and experimental French telecommunication services using CNET speech recognition and text-to-speech synthesis , 1994, Proceedings of 2nd IEEE Workshop on Interactive Voice Technology for Telecommunications Applications.

[45]  Jont B. Allen,et al.  Multimicrophone signal‐processing technique to remove room reverberation from speech signals , 1977 .

[46]  Rafik Goubran,et al.  Parallel adaptive filter structures for acoustic noise cancellation , 1992, [Proceedings] 1992 IEEE International Symposium on Circuits and Systems.

[47]  F. Westall,et al.  Digital signal processing in telecommunications , 1993 .

[48]  Jean-Pierre Adoul,et al.  Description of ITU-T Recommendation G.729 Annex A: reduced complexity 8 kbit/s CS-ACELP codec , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[49]  Jin Yang Frequency domain noise suppression approaches in mobile telephone systems , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[50]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[51]  Rainer Martin,et al.  COMBINED ACOUSTIC ECHO CONTROL AND NOISE REDUCTION BASED ON RESIDUAL ECHO ESTIMATION , .

[52]  B. Widrow,et al.  Adaptive noise cancelling: Principles and applications , 1975 .

[53]  A.V. Oppenheim,et al.  Enhancement and bandwidth compression of noisy speech , 1979, Proceedings of the IEEE.

[54]  Frederick Jelinek,et al.  Up from trigrams! - the struggle for improved language models , 1991, EUROSPEECH.

[55]  Gerhard Doblinger,et al.  Computationally efficient speech enhancement by spectral minima tracking in subbands , 1995, EUROSPEECH.

[56]  Beghdad AYAD ACOUSTIC ECHO AND NOISE REDUCTION : A NOVEL APPROACH , .

[57]  Koichi Shinoda,et al.  High speed speech recognition using tree-structured probability density function , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[58]  George Vysotsky,et al.  VoiceDialingSM - The first speech recognition based service delivered to customer's home from the telephone network , 1995, Speech Commun..

[59]  Peter Kroon,et al.  A High-Quality Multirate Real-Time CELP Coder , 1992, IEEE J. Sel. Areas Commun..

[60]  Ira A. Gerson,et al.  A 5600 bps vselp speech coder candidate for half-rate gsm , 1993, Proceedings., IEEE Workshop on Speech Coding for Telecommunications,.

[61]  I. Boyd,et al.  The voice activity detector for the Pan-European digital cellular mobile telephone service , 1988, International Conference on Acoustics, Speech, and Signal Processing,.

[62]  Petri Haavisto,et al.  An improved noise compensation algorithm for speech recognition in noise , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[63]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[64]  R. McAulay,et al.  Speech enhancement using a soft-decision noise suppression filter , 1980 .

[65]  Christophe Beaugeant,et al.  USING PSYCHOACOUSTIC CRITERIA IN ACOUSTIC ECHO CANCELLATION ALGORITHMS , .

[66]  David Malah,et al.  Speech enhancement using optimal non-linear spectral amplitude estimation , 1983, ICASSP.

[67]  Gérard Faucon,et al.  Using the coherence function for noise reduction , 1992 .

[68]  Tyseer Aboulnasr,et al.  A robust variable step-size LMS-type algorithm: analysis and simulations , 1997, IEEE Trans. Signal Process..

[69]  C. Pelaez,et al.  Experiments on noise reduction techniques with robust voice detector in car environment , 1993, EUROSPEECH.

[70]  A. Hirano,et al.  A noise-robust stochastic gradient algorithm with an adaptive step-size suitable for mobile hands-free telephones , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[71]  Peter Vary,et al.  Noise suppression by spectral magnitude estimation —mechanism and theoretical limits— , 1985 .

[72]  Kristian Kroschel,et al.  Subband array processing for speech enhancement , 1993, EUROSPEECH.

[73]  Chafic Mokbel,et al.  Bayesian adaptation of speech recognizers to field speech data , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[74]  Søren Holdt Jensen Acoustic Echo Canceller for Hands-free Mobile Radiotelephony , 1992 .

[75]  Neviano Dal Degan,et al.  Acoustic noise analysis and speech enhancement techniques for mobile radio applications , 1988 .

[76]  Ephraim Speech enhancement using a minimum mean square error short-time spectral amplitude estimator , 1984 .

[77]  François Capman,et al.  Acoustic echo cancellation using a fast QR-RLS algorithm and multirate schemes , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[78]  Eric Moulines,et al.  Low-delay frequency domain LMS algorithm , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[79]  Rafik A. Goubran,et al.  Improved tracking adaptive noise canceler for nonstationary environments , 1992, IEEE Trans. Signal Process..

[80]  M. Sambur,et al.  Adaptive noise canceling for speech signals , 1978 .

[81]  Régine Le Bouquin-Jeannès,et al.  Speech enhancement using a wiener filtering under signal presence uncertainty , 1996, 1996 8th European Signal Processing Conference (EUSIPCO 1996).

[82]  Pascal Scalart,et al.  Analysis of two structures for combined acoustic echo cancellation and noise reduction , 1996, 1996 8th European Signal Processing Conference (EUSIPCO 1996).

[83]  Rainer Martin Design and optimization of a two microphone speech enhancement system , 1995, EUROSPEECH.

[84]  Etsi Secretariat,et al.  Digital cellular telecommunications system (Phase 2+); Enhanced Full Rate (EFR) speech transcoding , 1998 .

[85]  S. Hayashi,et al.  Description Of The Proposed ITU-T 8 Kb/S Speech Coding Standard , 1995, Proceedings. IEEE Workshop on Speech Coding for Telecommunications.

[86]  Pavel Sovka,et al.  Noise suppression system for a car , 1993, EUROSPEECH.

[87]  Ed F. Deprettere,et al.  Regular-pulse excitation-A novel approach to effective and efficient multipulse coding of speech , 1986, IEEE Trans. Acoust. Speech Signal Process..

[88]  Jae S. Lim,et al.  The unimportance of phase in speech enhancement , 1982 .

[89]  Tetsuo Kosaka,et al.  Fast output probability computation using scalar quantization and independent dimension multi-mixture , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[90]  R. L. Bouquin Enhancement of noisy speech signals: application to mobile radio communications , 1996 .

[91]  Chafic Mokbel,et al.  Adapting PSN recognition models to the GSM environment by using spectral transformation , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[92]  D. Jouvet,et al.  Towards improving ASR robustness for PSN & GSM telephone applications , 1996, Proceedings of IVTTA '96. Workshop on Interactive Voice Technology for Telecommunications Applications.

[93]  Rainer Martin,et al.  Combined acoustic echo cancellation, dereverberation and noise reduction: a two microphone approach , 1994 .

[94]  Gerhard Fettweis DSP cores for mobile communications: where are we going? , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.