Applications of Lexical Cohesion Analysis in the Topic Detection and Tracking Domain

iii Acknowledgements iv Chapter

[1]  Roy Rada,et al.  Development and application of a metric on semantic nets , 1989, IEEE Trans. Syst. Man Cybern..

[2]  M. F. Porter,et al.  An algorithm for suffix stripping , 1997 .

[3]  Hitoshi Isahara,et al.  A Statistical Model for Domain-Independent Text Segmentation , 2001, ACL.

[4]  Ted Dunning,et al.  Accurate Methods for the Statistics of Surprise and Coincidence , 1993, CL.

[5]  Manabu Okumura,et al.  Text Segmentation with Multiple Surface Linguistic Cues , 1998, COLING-ACL.

[6]  Edward Fox,et al.  Extending the boolean and vector space models of information retrieval with p-norm queries and multiple concept types , 1983 .

[7]  Alan F. Smeaton,et al.  Segmenting broadcast news streams using lexical chains , 2002 .

[8]  Joe Carthy,et al.  Combining semantic and syntactic document classifiers to improve first story detection , 2001, SIGIR '01.

[9]  Jinxi Xu,et al.  The Design and Implementation of a Part of Speech Tagger for English , 1994 .

[10]  Christiane Fellbaum,et al.  Lexical Chains as Representations of Context for the Detection and Correction of Malapropisms , 1998 .

[11]  Filippo Menczer,et al.  A cluster-based approach to tracking, detection and segmentation of broadcast news , 1999 .

[12]  Yaakov Yaari,et al.  Segmentation of Expository Texts by Hierarchical Agglomerative Clustering , 1997, ArXiv.

[13]  David Evans,et al.  The Columbia Multi-Document Summarizer for DUC 2002 , 2002 .

[14]  Graeme Hirst,et al.  Automatically generating hypertext by computing semantic similarity , 1997 .

[15]  Jong-Hak Lee,et al.  Analyses of multiple evidence combination , 1997, SIGIR '97.

[16]  Joe Carthy,et al.  First Story Detection using a Composite Document Representation , 2001, HLT.

[17]  Sanda M. Harabagiu From Lexical Cohesion to Textual Coherence: A Data Driven Perspective , 1999, Int. J. Pattern Recognit. Artif. Intell..

[18]  Christian Plaunt,et al.  Subtopic structuring for full-length document access , 1993, SIGIR.

[19]  Peter J. L. Wallis,et al.  Information Retrieval based on Paraphrase , 1993 .

[20]  Makoto Iwayama,et al.  Passage-Level Document Retrieval Using Lexical Chains , 2000, RIAO.

[21]  David W. Conrath,et al.  Semantic Similarity Based on Corpus Statistics and Lexical Taxonomy , 1997, ROCLING/IJCLCLP.

[22]  Yiming Yang,et al.  Topic Detection and Tracking Pilot Study Final Report , 1998 .

[23]  Jade Goldstein-Stewart,et al.  Selecting Text Spans for Document Summaries: Heuristics and Metrics , 1999, AAAI/IAAI.

[24]  Hideki Kozima,et al.  Text Segmentation Based on Similarity between Words , 1993, ACL.

[25]  W. Bruce Croft Combining Approaches to Information Retrieval , 2002 .

[26]  Joe Carthy,et al.  Lexical semantic relatedness and online new event detection (poster session) , 2000, SIGIR '00.

[27]  Jonathan Yamron,et al.  Statistical models of topical content , 2002 .

[28]  Rada Mihalcea,et al.  eXtended WordNet: progress report , 2001, HTL 2001.

[29]  Adam Kilgarriff,et al.  What’s in a Thesaurus? , 2000, LREC.

[30]  Stefan Kaufmann Second‐Order Cohesion , 2000, Comput. Intell..

[31]  W. Bruce Croft,et al.  On-line new event detection, clustering, and tracking (information retrieval, internet) , 1999 .

[32]  Marti A. Hearst Text Tiling: Segmenting Text into Multi-paragraph Subtopic Passages , 1997, CL.

[33]  Richard M. Schwartz,et al.  Hedge Trimmer: A Parse-and-Trim Approach to Headline Generation , 2003, HLT-NAACL 2003.

[34]  Rong Jin,et al.  A New Probabilistic Model for Title Generation , 2002, COLING.

[35]  G. Youmans A New Tool for Discourse Analysis: The Vocabulary-Management Profile. , 1991 .

[36]  Michael McGill,et al.  An Evaluation of Factors Affecting Document Ranking by Information Retrieval Systems. , 1979 .

[37]  Jeffrey Katzer,et al.  A study of the overlap among document representations , 1983, SIGIR '83.

[38]  George A. Miller,et al.  A Semantic Concordance , 1993, HLT.

[39]  David M. Blei,et al.  Topic segmentation with an aspect hidden Markov model , 2001, SIGIR '01.

[40]  Kathleen McKeown,et al.  Improving Word Sense Disambiguation in Lexical Chaining , 2003, IJCAI.

[41]  James Flood Understanding Reading Comprehension: Cognition, Language, and the Structure of Prose. , 1984 .

[42]  Michael Halliday,et al.  Cohesion in English , 1976 .

[43]  Rick Kazman,et al.  Accessing multimedia through concept clustering , 1997, CHI.

[44]  Edward A. Fox,et al.  Coefficients of combining concept classes in a collection , 1988, SIGIR '88.

[45]  Justin Zobel,et al.  Term-ordered query evaluation versus document-ordered query evaluation for large document databases , 1998, SIGIR '98.

[46]  Philip Resnik,et al.  Semantic Similarity in a Taxonomy: An Information-Based Measure and its Application to Problems of Ambiguity in Natural Language , 1999, J. Artif. Intell. Res..

[47]  Dragos Stefan Munteanu,et al.  GLEANS: A Generator of Logical Extracts and Abstracts for Nice Summaries , 2002 .

[48]  Jeffrey C. Reynar An Automatic Method of Finding Topic Boundaries , 1994, ACL.

[49]  Malcolm Slaney,et al.  Hierarchical segmentation using latent semantic indexing in scale space , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[50]  Susan T. Dumais,et al.  The vocabulary problem in human-system communication , 1987, CACM.

[51]  Akira Ito,et al.  Context-sensitive word distance by adaptive scaling of a semantic space , 1997 .

[52]  Okumura Manabu,et al.  Word Sense Disambiguation and Text Segmentation Based on Lexical Cohesion , 1994, COLING.

[53]  Nicola Stokes,et al.  Spoken and Written News Story Segmentation Using Lexical Chains , 2003, NAACL.

[54]  James Allan,et al.  On-Line New Event Detection and Tracking , 1998, SIGIR.

[55]  Laura Alonso Alemany,et al.  Cohesion and coherence for Automatic Summarization , 2003, EACL.

[56]  Slava M. Katz,et al.  Technical terminology: some linguistic properties and an algorithm for identification in text , 1995, Natural Language Engineering.

[57]  James Allan,et al.  Approaches to passage retrieval in full text information systems , 1993, SIGIR.

[58]  James Allan,et al.  First story detection in TDT is hard , 2000, CIKM '00.

[59]  Ross Wilkinson,et al.  Effective retrieval of structured documents , 1994, SIGIR '94.

[60]  Kathleen R. McKeown,et al.  Information fusion for multidocument summarization: paraphrasing and generation , 2003 .

[61]  Ellen M. Voorhees,et al.  Query expansion using lexical-semantic relations , 1994, SIGIR '94.

[62]  Michael Sussna,et al.  Word sense disambiguation for free-text indexing using a massive semantic network , 1993, CIKM '93.

[63]  William C. Mann,et al.  RHETORICAL STRUCTURE THEORY: A THEORY OF TEXT ORGANIZATION , 1987 .

[64]  Vibhu O. Mittal,et al.  Ultra-Summarization: A Statistical Approach to Generating Highly Condensed Non-Extractive Summaries (poster abstract). , 1998, SIGIR 1999.

[65]  Graeme Hirst,et al.  Near-synonymy and the structure of lexical knowledge , 1995 .

[66]  Yiming Yang,et al.  A Comparative Study on Feature Selection in Text Categorization , 1997, ICML.

[67]  Min-Yen Kan,et al.  Role of Verbs in Document Analysis , 1998, ACL.

[68]  Joe Carthy,et al.  Lexical Chaining for Web-Based Retrieval of Breaking News , 2000, AH.

[69]  Piek Vossen,et al.  EuroWordNet: A multilingual database with lexical semantic networks , 1998, Springer Netherlands.

[70]  Johanna D. Moore,et al.  Latent Semantic Analysis for Text Segmentation , 2001, EMNLP.

[71]  Alexander A. Morgan,et al.  MITRE TDT-2000 SEGMENTATION SYSTEM , 2000 .

[72]  Stephen J. Green Building hypertext links in newspaper articles using semantic similarity , 1997 .

[73]  David Yarowsky,et al.  One Sense per Collocation , 1993, HLT.

[74]  James Pustejovsky,et al.  Corelex: systematic polysemy and underspecification , 1998 .

[75]  Inderjit S. Dhillon,et al.  A Divisive Information-Theoretic Feature Clustering Algorithm for Text Classification , 2003, J. Mach. Learn. Res..

[76]  Takenobu Tokunaga,et al.  Combining multiple evidence from different types of thesaurus for query expansion , 1999, SIGIR '99.

[77]  M. Halliday Spoken and Written Language , 1989 .

[78]  Rebecca J. Passonneau,et al.  Intention-Based Segmentation: Human Reliability and Correlation with Linguistic Cues , 1993, ACL.

[79]  Noel E. O'Connor,et al.  TV news story segmentation, personalisation and recommendation , 2003 .

[80]  Graeme Hirst,et al.  Lexical Cohesion Computed by Thesaural relations as an indicator of the structure of text , 1991, CL.

[81]  Yiming Yang,et al.  A study of retrospective and on-line event detection , 1998, SIGIR '98.

[82]  Graeme Hirst,et al.  Semantic distance in WordNet: An experimental, application-oriented evaluation of five measures , 2004 .

[83]  Mark Stevenson,et al.  Combining Disambiguation Techniques to Enrich an Ontology , 2002 .

[84]  Padmini Srinivasan,et al.  A cluster-based approach to broadcast news , 2002 .

[85]  Alistair Moffat,et al.  Retrieval of Partial Documents , 1993, TREC.

[86]  James Allan,et al.  Temporal summaries of new topics , 2001, SIGIR '01.

[87]  Peter Schäuble,et al.  Document and passage retrieval based on hidden Markov models , 1994, SIGIR '94.

[88]  Bo-Yeong Kang,et al.  A Novel Approach to Semantic Indexing Based on Concept , 2003, ACL.

[89]  Justin Zobel,et al.  Effective ranking with arbitrary passages , 2001 .

[90]  M. Sherwood-Smith,et al.  Lexical chains for topic tracking , 2002, IEEE International Conference on Systems, Man and Cybernetics.

[91]  Yllias Chali,et al.  The University of Lethbridge Text Summarizer at DUC 2002 , 2002 .

[92]  Slava M. Katz Distribution of content words and phrases in text and language modelling , 1996, Natural Language Engineering.

[93]  Christiane Fellbaum,et al.  Using Wordnet for Text Retrieval , 1998 .

[94]  Mark Sanderson,et al.  Word sense disambiguation and information retrieval , 1994, SIGIR '94.

[95]  Carl Gutwin,et al.  KEA: practical automatic keyphrase extraction , 1999, DL '99.

[96]  Eduard H. Hovy,et al.  Automatic Evaluation of Summaries Using N-gram Co-occurrence Statistics , 2003, NAACL.

[97]  Larry Gillick,et al.  A hidden Markov model approach to text segmentation and event tracking , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[98]  T. Landauer,et al.  Indexing by Latent Semantic Analysis , 1990 .

[99]  James Allan,et al.  Introduction to topic detection and tracking , 2002 .

[100]  Marti A. Hearst,et al.  A Critique and Improvement of an Evaluation Metric for Text Segmentation , 2002, CL.

[101]  Andrew McCallum,et al.  Distributional clustering of words for text classification , 1998, SIGIR '98.

[102]  Hideki Kozima,et al.  Similarity between Words Computed by Spreading Activation on an English Dictionary , 1993, EACL.

[103]  Thorsten Joachims,et al.  Learning to classify text using support vector machines - methods, theory and algorithms , 2002, The Kluwer international series in engineering and computer science.

[104]  Mark A. Stairmand Textual context analysis for information retrieval , 1997, SIGIR '97.

[105]  Vibhu O. Mittal,et al.  OCELOT: a system for summarizing Web pages , 2000, SIGIR '00.

[106]  Rick Kazman,et al.  Four Paradigms for Indexing Video Conferences , 1996, IEEE Multim..

[107]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[108]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[109]  Tadashi Nomoto,et al.  A Grammatico-Statistical Approach to Discourse Partitioning , 1994, COLING.

[110]  Gene H. Golub,et al.  Matrix computations , 1983 .

[111]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[112]  L. R. Rasmussen,et al.  In information retrieval: data structures and algorithms , 1992 .

[113]  Julio Gonzalo,et al.  Lexical ambiguity and Information Retrieval revisited , 1999, EMNLP.

[114]  Julio Gonzalo,et al.  Indexing with WordNet synsets can improve text retrieval , 1998, WordNet@ACL/COLING.

[115]  John Tait,et al.  Word sense disambiguation in information retrieval revisited , 2003, SIGIR.

[116]  W. Bruce Croft,et al.  Lexical ambiguity and information retrieval , 1992, TOIS.

[117]  Ted Pedersen,et al.  Fishing for Exactness , 1996, ArXiv.

[118]  David Yarowsky,et al.  One Sense Per Discourse , 1992, HLT.

[119]  Kathleen F. McCoy,et al.  Efficient text summarization using lexical chains , 2000, IUI '00.

[120]  Stan Szpakowicz,et al.  Not as Easy as It Seems: Automating the Construction of Lexical Chains Using Roget's Thesaurus , 2003, AI.

[121]  Hsin-Hsi Chen,et al.  NLP and IR Approaches to Monolingual and Multilingual Link Detection , 2002, COLING.

[122]  G. Miller,et al.  A Semantic Network of English Verbs , 1998 .

[123]  Regina Barzilay,et al.  Using Lexical Chains for Text Summarization , 1997 .

[124]  Rebecca J. Passonneau,et al.  Discourse Segmentation by Human and Automated Means , 1997, CL.

[125]  Gerard Salton,et al.  Automatic Text Decomposition and Structuring , 1994, Inf. Process. Manag..

[126]  Alexander Budanitsky,et al.  Lexical Semantic Relatedness and Its Application in Natural Language Processing , 1999 .

[127]  Yllias Chali,et al.  Text Summarization Using Lexical Chains , 2001 .

[128]  Mark Liberman,et al.  Corpora for topic detection and tracking , 2002 .

[129]  Alan F. Smeaton,et al.  SeLeCT: a lexical cohesion based news story segmentation system , 2004, AI Commun..

[130]  Freddy Y. Y. Choi Advances in domain independent linear text segmentation , 2000, ANLP.

[131]  Mark T. Maybury,et al.  Towards content-based browsing of broadcast news video , 1997 .

[132]  Alan F. Smeaton,et al.  Broadcast News Gisting Using Lexical Cohesion Analysis , 2004, ECIR.

[133]  Olatz Ansa,et al.  Enriching very large ontologies using the WWW , 2000, ECAI Workshop on Ontology Learning.

[134]  James P. Callan,et al.  Passage-level evidence in document retrieval , 1994, SIGIR '94.

[135]  Lindsay J. Evett,et al.  Text Segmentation Using Reiteration and Collocation , 1998, COLING-ACL.

[136]  Tim Berners-Lee,et al.  The World-Wide Web , 1994, CACM.

[137]  James Allan,et al.  Detections , Bounds , and Timelines : UMass and TDT-3 , 2000 .

[138]  Salim Roukos,et al.  Story segmentation and topic detection for recognized speech , 1999, EUROSPEECH.

[139]  Gerard Salton,et al.  Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer , 1989 .

[140]  Mitchell P. Marcus,et al.  Topic segmentation: algorithms and applications , 1998 .

[141]  Candace L. Sidner,et al.  Attention, Intentions, and the Structure of Discourse , 1986, CL.

[142]  Satya Dharanipragada,et al.  Segmentation and Detection at IBM , 2002 .

[143]  A. T. Arampatzis,et al.  Adaptive and temporally-dependent document filtering , 2001 .

[144]  J. Allan,et al.  On-Line New Event Detection using Single Pass Clustering , 1998 .

[145]  Dan I. Moldovan,et al.  Lexical Chains for Question Answering , 2002, COLING.

[146]  Christiane Fellbaum,et al.  Temporal Indexing Through Lexical Chaining , 1998 .

[147]  Christiane Fellbaum,et al.  Nouns in WordNet , 1998 .

[148]  James Allan,et al.  UMass at TDT 2000 , 2000 .

[149]  John D. Lafferty,et al.  Statistical Models for Text Segmentation , 1999, Machine Learning.

[150]  Regina Barzilay,et al.  Lexical Chains for Summarization , 1997 .

[151]  Andrew Smith,et al.  Detecting Subject Boundaries Within Text: A Language Independent Statistical Approach , 1997, EMNLP.

[152]  Daniel Marcu,et al.  From discourse structures to text summaries , 1997 .

[153]  Hinrich Schütze,et al.  Ambiguity resolution in language learning , 1997 .

[154]  Ralph Grishman,et al.  Using NOMLEX to Produce Nominalization Patterns for Information Extraction , 1998, ACL 1998.

[155]  Kathleen F. McCoy,et al.  Efficiently Computed Lexical Chains as an Intermediate Representation for Automatic Text Summarization , 2002, CL.

[156]  Ellen M. Voorhees,et al.  Using WordNet to disambiguate word senses for text retrieval , 1993, SIGIR.

[157]  Gonzalo Navarro,et al.  A guided tour to approximate string matching , 2001, CSUR.

[158]  Julia Hirschberg,et al.  Empirical Studies on the Disambiguation of Cue Phrases , 1993, Comput. Linguistics.

[159]  Liang Zhou,et al.  Headline Summarization at ISI , 2003 .

[160]  W. Bruce Croft,et al.  The INQUERY Retrieval System , 1992, DEXA.

[161]  Mark Sanderson,et al.  Universities of Leeds, Sheffield and York http://eprints.whiterose.ac.uk/ , 2022 .