论文信息 - Automatic Detection, Indexing, and Retrieval of Multiple Attributes from Cross-lingual Multimedia Data

Automatic Detection, Indexing, and Retrieval of Multiple Attributes from Cross-lingual Multimedia Data

This chapter contains sections titled: Introduction Detecting and Using Multiple Attributes from the Audio Keyword Retrieval Using Word -Based and Phoneme-Based Recognition Engines Query Expansion AHS Research Prototype Conclusion

[1] Eric Fosler-Lussier,et al. Towards robustness to fast speech in ASR , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[2] Andreas Stolcke,et al. Combining Words and Speech Prosody for Automatic Topic Segmentation , 2007 .

[3] Gökhan Tür,et al. Automatic detection of sentence boundaries and disfluencies based on recognized words , 1998, ICSLP.

[4] Konstantinos Koumpis,et al. Extractive summarization of voicemail using lexical and prosodic feature subset selection , 2001, INTERSPEECH.

[5] Stanley Boykin,et al. Audio Hot Spotting and Retrieval using Multiple Features , 2004, HLT-NAACL 2004.

[6] Mark A. Clements,et al. Phonetic Searching vs. LVCSR: How to Find What You Really Want in Audio Archives , 2002, Int. J. Speech Technol..

[7] John H. L. Hansen,et al. Getting started with SUSAS: a speech under simulated and actual stress database , 1997, EUROSPEECH.

[8] Douglas A. Reynolds,et al. Modeling of the glottal flow derivative waveform with application to speaker identification , 1999, IEEE Trans. Speech Audio Process..

[9] Herbert Gish,et al. The 2000 BBN Byblos LVCSR system , 2000, INTERSPEECH.

[10] Steve Renals,et al. The THISL SDR System At TREC-8 , 1999, TREC.

[11] Douglas A. Reynolds,et al. Speaker identification and verification using Gaussian mixture speaker models , 1995, Speech Commun..

[12] Eric Fosler-Lussier,et al. Combining multiple estimators of speaking rate , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[13] Gökhan Tür,et al. Prosody-based automatic segmentation of speech into sentences and topics , 2000, Speech Commun..

[14] James Allan. Knowledge Management and Speech Recognition , 2001 .

[15] Ellen M. Voorhees,et al. The TREC Spoken Document Retrieval Track: A Success Story , 2000, TREC.

[16] Amit Srivastava,et al. Integrated technologies for indexing spoken language , 2000, CACM.

[17] Gökhan Tür,et al. Combining words and prosody for information extraction from speech , 1999, EUROSPEECH.

[18] Jing Zheng,et al. Word-level rate of speech modeling using rate-specific phones and pronunciations , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[19] Gökhan Tür,et al. Integrating Prosodic and Lexical Cues for Automatic Topic Segmentation , 2001, CL.

[20] K. Stevens,et al. Classification of glottal vibration from acoustic measurements , 1995 .