Prosody-based automatic segmentation of speech into sentences and topics
暂无分享,去创建一个
Gökhan Tür | Dilek Z. Hakkani-Tür | Andreas Stolcke | Elizabeth Shriberg | A. Stolcke | Elizabeth Shriberg | Gökhan Tür
[1] George K. Kokkinakis,et al. Automatic Stochastic Tagging of Natural Language Texts , 1995, Comput. Linguistics.
[2] Jody Kreiman,et al. Perception of Sentence and Paragraph Bound-aries in Natural Conversation , 1982 .
[3] Julia Hirschberg,et al. Some intonational characteristics of discourse structure , 1992, ICSLP.
[4] Van Nostrand,et al. Error Bounds for Convolutional Codes and an Asymptotically Optimum Decoding Algorithm , 1967 .
[5] Jacqueline Vaissière,et al. Language-Independent Prosodic Features , 1983 .
[6] Marti A. Hearst. Text Tiling: Segmenting Text into Multi-paragraph Subtopic Passages , 1997, CL.
[7] M. Swerts. Prosodic features at discourse boundaries of different strength. , 1997, The Journal of the Acoustical Society of America.
[8] Ralph Weischedel,et al. NAMED ENTITY EXTRACTION FROM SPEECH , 1998 .
[9] Slava M. Katz,et al. Estimation of probabilities from sparse data for the language model component of a speech recognizer , 1987, IEEE Trans. Acoust. Speech Signal Process..
[10] Gillian Brown,et al. Questions of intonation , 1980 .
[11] Gökhan Tür,et al. Automatic detection of sentence boundaries and disfluencies based on recognized words , 1998, ICSLP.
[12] Larry P. Heck,et al. Speaker tracking and detection with multiple speakers , 1999, EUROSPEECH.
[13] Andreas Stolcke,et al. Can Prosody Aid the Automatic Classification of Dialog Acts in Conversational Speech? , 1998, Language and speech.
[14] Alvin F. Martin,et al. The 1999 NIST speaker recognition evaluation, using summed two-channel telephone data for speaker detection and speaker tracking , 1999, EUROSPEECH.
[15] Gökhan Tür,et al. Combining words and prosody for information extraction from speech , 1999, EUROSPEECH.
[16] Larry Gillick,et al. A hidden Markov model approach to text segmentation and event tracking , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).
[17] Mari Ostendorf,et al. Prosodic and lexical indications of discourse structure in human-machine interactions , 1997, Speech Commun..
[18] Mark Liberman,et al. THE TDT-2 TEXT AND SPEECH CORPUS , 1999 .
[19] Marti A. Hearst,et al. Adaptive Multilingual Sentence Boundary Disambiguation , 1997, CL.
[20] Leo Breiman,et al. Classification and Regression Trees , 1984 .
[21] Anne Cutler,et al. Prosody: Models and measurements , 1983 .
[22] Rich Caruana,et al. Introduction to IND and recursive partitioning, version 1.0 , 1991 .
[23] Daben Liu,et al. Fast speaker change detection for broadcast news transcription and indexing , 1999, EUROSPEECH.
[24] Larry P. Heck,et al. Modeling dynamic prosodic variation for speaker verification , 1998, ICSLP.
[25] J M Terken,et al. Beyond Sentence Prosody: Paragraph Intonation in Dutch , 1993, Phonetica.
[26] A. B.,et al. SPEECH COMMUNICATION , 2001 .
[27] George Doddington. The Topic Detection and Tracking Phase 2 (TDT2) evaluation plan , 1998 .
[28] Lalit R. Bahl,et al. A tree-based statistical language model for natural language speech recognition , 1989, IEEE Trans. Acoust. Speech Signal Process..
[29] Andreas Stolcke,et al. Automatic linguistic segmentation of conversational speech , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.
[30] L. Baum,et al. A Maximization Technique Occurring in the Statistical Analysis of Probabilistic Functions of Markov Chains , 1970 .
[31] Peter A. Heeman,et al. Intonational boundaries, speech repairs and discourse markers: modeling spoken dialog , 1997 .
[32] Yiming Yang,et al. Topic Detection and Tracking Pilot Study Final Report , 1998 .
[33] F.J. Koopmans-van Beinum,et al. Relationship between discourse structure and dynamic speech rate , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.
[34] Hideki Kozima,et al. Text Segmentation Based on Similarity between Words , 1993, ACL.
[35] John J. Godfrey,et al. SWITCHBOARD: telephone speech corpus for research and development , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[36] Vassilios Digalakis,et al. Genones: optimizing the degree of mixture tying in a large vocabulary hidden Markov model based speech recognizer , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.
[37] Julia Hirschberg,et al. A Prosodic Analysis of Discourse Segments in Direction-Giving Monologues , 1996, ACL.
[38] Nina Gro nnum Thorsen,et al. Intonation and text in Standard Danish , 1985 .
[39] G. Bruce,et al. Textual Aspects of Prosody in Swedish , 1982, Phonetica.
[40] Andreas Stolcke,et al. A prosody only decision-tree model for disfluency detection , 1997, EUROSPEECH.
[41] Elizabeth Shriberg,et al. Phonetic Consequences of Speech Disfluency , 1999 .
[42] Gökhan Tür,et al. Integrating Prosodic and Lexical Cues for Automatic Topic Segmentation , 2001, CL.
[43] I. Lehiste. The Phonetic Structure of Paragraphs , 1975 .
[44] M. Swerts,et al. Prosody as a Marker of Information Flow in Spoken Discourse , 1994 .
[45] Hajime Tsukada,et al. Prosodic Features of Utterances in Task-Oriented Dialogues , 1997, Computing Prosody.