Automatic Detection, Indexing, and Retrieval of Multiple Attributes from Cross-lingual Multimedia Data

This chapter contains sections titled: Introduction Detecting and Using Multiple Attributes from the Audio Keyword Retrieval Using Word -Based and Phoneme-Based Recognition Engines Query Expansion AHS Research Prototype Conclusion

[1]  Eric Fosler-Lussier,et al.  Towards robustness to fast speech in ASR , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[2]  Andreas Stolcke,et al.  Combining Words and Speech Prosody for Automatic Topic Segmentation , 2007 .

[3]  Gökhan Tür,et al.  Automatic detection of sentence boundaries and disfluencies based on recognized words , 1998, ICSLP.

[4]  Konstantinos Koumpis,et al.  Extractive summarization of voicemail using lexical and prosodic feature subset selection , 2001, INTERSPEECH.

[5]  Stanley Boykin,et al.  Audio Hot Spotting and Retrieval using Multiple Features , 2004, HLT-NAACL 2004.

[6]  Mark A. Clements,et al.  Phonetic Searching vs. LVCSR: How to Find What You Really Want in Audio Archives , 2002, Int. J. Speech Technol..

[7]  John H. L. Hansen,et al.  Getting started with SUSAS: a speech under simulated and actual stress database , 1997, EUROSPEECH.

[8]  Douglas A. Reynolds,et al.  Modeling of the glottal flow derivative waveform with application to speaker identification , 1999, IEEE Trans. Speech Audio Process..

[9]  Herbert Gish,et al.  The 2000 BBN Byblos LVCSR system , 2000, INTERSPEECH.

[10]  Steve Renals,et al.  The THISL SDR System At TREC-8 , 1999, TREC.

[11]  Douglas A. Reynolds,et al.  Speaker identification and verification using Gaussian mixture speaker models , 1995, Speech Commun..

[12]  Eric Fosler-Lussier,et al.  Combining multiple estimators of speaking rate , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[13]  Gökhan Tür,et al.  Prosody-based automatic segmentation of speech into sentences and topics , 2000, Speech Commun..

[14]  James Allan Knowledge Management and Speech Recognition , 2001 .

[15]  Ellen M. Voorhees,et al.  The TREC Spoken Document Retrieval Track: A Success Story , 2000, TREC.

[16]  Amit Srivastava,et al.  Integrated technologies for indexing spoken language , 2000, CACM.

[17]  Gökhan Tür,et al.  Combining words and prosody for information extraction from speech , 1999, EUROSPEECH.

[18]  Jing Zheng,et al.  Word-level rate of speech modeling using rate-specific phones and pronunciations , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[19]  Gökhan Tür,et al.  Integrating Prosodic and Lexical Cues for Automatic Topic Segmentation , 2001, CL.

[20]  K. Stevens,et al.  Classification of glottal vibration from acoustic measurements , 1995 .