Laplacian Eigenmaps for Automatic Story Segmentation of Broadcast News

We propose Laplacian Eigenmaps (LE)-based approaches to automatic story segmentation on speech recognition transcripts of broadcast news. We reinforce story boundaries by applying LE analysis to sentence connective strength matrix and reveal the intrinsic geometric structure of stories. Specifically, we construct a Euclidean space in which each sentence is mapped to a vector. As a result, the original inter-sentence connective strength is reflected by the Euclidean distances between the corresponding vectors and cohesive relations between sentences become geometrically evident. Taking advantage of LE, we present three story segmentation approaches: LE-TextTiling, spectral clustering and LE-DP. In LE-DP, we formalize story segmentation as a straightforward criterion minimization problem and give a fast dynamic programming solution to it. Extensive story segmentation experiments on three corpora demonstrate that the proposed LE-based approaches achieve superior performances and significantly outperform several state-of-the-art methods. For instance, LE-TextTiling obtains a relative F1-measure increase of 17.8% on CCTV Mandarin BN corpus as compared to conventional TextTiling; LE-DP achieves a high F1-measure of 0.7460, which significantly outperforms a recent CRF-prosody approach with an F1-measure of 0.6783 on TDT2 Mandarin BN corpus.

[1]  Chiu-yu Tseng,et al.  Fluent speech prosody: Framework and modeling , 2005, Speech Commun..

[2]  Shih-Fu Chang,et al.  Discovery and fusion of salient multimodal features toward news story segmentation , 2003, IS&T/SPIE Electronic Imaging.

[3]  Andreas Stolcke,et al.  Enriching speech recognition with automatic detection of sentence boundaries and disfluencies , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[4]  Julia Hirschberg,et al.  Story Segmentation of Broadcast News in English, Mandarin and Arabic , 2006, NAACL.

[5]  L. Xie,et al.  On the effectiveness of subwords for lexical cohesion based story segmentation of Chinese broadcast news , 2011, Inf. Sci..

[6]  Larry Gillick,et al.  A hidden Markov model approach to text segmentation and event tracking , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[7]  Wai Kit Lo,et al.  Automatic Story Segmentation using a Bayesian Decision Framework for Statistical Models of Lexical Chain Features , 2009, ACL/IJCNLP.

[8]  Shih-Fu Chang,et al.  COLUMBIA-IBM NEWS VIDEO STORY SEGMENTATION IN TRECVID 2004 , 2005 .

[9]  W. Kahan,et al.  The Rotation of Eigenvectors by a Perturbation. III , 1970 .

[10]  P. Maher,et al.  Handbook of Matrices , 1999, The Mathematical Gazette.

[11]  Marti A. Hearst Text Tiling: Segmenting Text into Multi-paragraph Subtopic Passages , 1997, CL.

[12]  Chuan Liu,et al.  Combined Use of Speaker- and Tone-Normalized Pitch Reset with Pause Duration for Automatic Story Segmentation in Mandarin Broadcast News , 2007, NAACL.

[13]  Lin-shan Lee,et al.  Spoken document understanding and organization , 2005, IEEE Signal Processing Magazine.

[14]  Athanasios Kehagias,et al.  A Dynamic Programming Algorithm for Linear Text Segmentation , 2004, Journal of Intelligent Information Systems.

[15]  Ulrike von Luxburg,et al.  A tutorial on spectral clustering , 2007, Stat. Comput..

[16]  Lei Xie,et al.  Multi-Scale TextTiling for Automatic Story Segmentation in Chinese Broadcast News , 2008, AIRS.

[17]  Lei Xie,et al.  Modeling the statistical behavior of lexical chains to capture word cohesiveness for automatic story segmentation , 2007, INTERSPEECH.

[18]  Lei Xie,et al.  Subword Lexical Chaining for Automatic Story Segmentation in Chinese Broadcast News , 2008, PCM.

[19]  Sergei Vassilvitskii,et al.  k-means++: the advantages of careful seeding , 2007, SODA '07.

[20]  W. Bruce Croft,et al.  Text Segmentation by Topic , 1997, ECDL.

[21]  Kenney Ng,et al.  Subword-based approaches for spoken document retrieval , 2000, Speech Commun..

[22]  Michael Halliday,et al.  Cohesion in English , 1976 .

[23]  Freddy Y. Y. Choi Advances in domain independent linear text segmentation , 2000, ANLP.

[24]  John D. Lafferty,et al.  Statistical Models for Text Segmentation , 1999, Machine Learning.

[25]  Oskari Heinonen,et al.  Optimal Multi-Paragraph Text Segmentation by Dynamic Programming , 1998, ACL.

[26]  Haizhou Li,et al.  Modeling Broadcast News Prosody Using Conditional Random Fields for Story Segmentation , 2010 .

[27]  Jitendra Malik,et al.  Normalized cuts and image segmentation , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[28]  Gökhan Tür,et al.  Integrating Prosodic and Lexical Cues for Automatic Topic Segmentation , 2001, CL.

[29]  Lei Xie,et al.  Subword Latent Semantic Analysis for Texttiling-Based Automatic Story Segmentation of Chinese Broadcast News , 2008, 2008 6th International Symposium on Chinese Spoken Language Processing.

[30]  Alan F. Smeaton,et al.  SeLeCT: a lexical cohesion based news story segmentation system , 2004, AI Commun..

[31]  Gina-Anne Levow,et al.  Prosody-based Topic Segmentation for Mandarin Broadcast News , 2004, NAACL.

[32]  Alexander I. Rudnicky,et al.  A texttiling based approach to topic boundary detection in meetings , 2006, INTERSPEECH.

[33]  Igor Malioutov,et al.  Minimum Cut Model for Spoken Lecture Segmentation , 2006, ACL.

[34]  Mikhail Belkin,et al.  Laplacian Eigenmaps for Dimensionality Reduction and Data Representation , 2003, Neural Computation.

[35]  Zhi-Qiang Liu,et al.  Self-Validated Labeling of Markov Random Fields for Image Segmentation , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[36]  Lei Xie,et al.  Discovering salient prosodic cues and their interactions for automatic story segmentation in Mandarin broadcast news , 2008, Multimedia Systems.

[37]  Salim Roukos,et al.  Story Segmentation and Topic Detection in the Broadcast News Domain , 1999 .

[38]  Mikhail Belkin,et al.  Towards a theoretical foundation for Laplacian-based manifold methods , 2005, J. Comput. Syst. Sci..

[39]  Chung-Hsien Wu,et al.  Story Segmentation and Topic Classification of Broadcast News via a Topic-Based Segmental Model and a Genetic Algorithm , 2009, IEEE Transactions on Audio, Speech, and Language Processing.

[40]  James Allan,et al.  Topic detection and tracking: event-based information organization , 2002 .

[41]  Mary P. Harper,et al.  Structural event detection for rich transcription of speech , 2004 .

[42]  Gökhan Tür,et al.  Prosody-based automatic segmentation of speech into sentences and topics , 2000, Speech Commun..

[43]  Chin-Hui Lee,et al.  A detection-based approach to broadcast news video story segmentation , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.