Duluth : Word Sense Induction Applied to Web Page Clustering

The Duluth systems that participated in task 11 of SemEval‐2013 carried out word sense induction (WSI) in order to cluster Web search results. They relied on an approach that represented Web snippets using second‐order co‐ occurrences. These systems were all implemented using SenseClusters, a freely available open source software package.

[1]  Ted Pedersen,et al.  Name Discrimination by Clustering Similar Contexts , 2005, CICLing.

[2]  Roberto Navigli,et al.  SemEval-2013 Task 11: Word Sense Induction and Disambiguation within an End-User Application , 2013, SemEval@NAACL-HLT.

[3]  Suresh Manandhar,et al.  SemEval-2010 Task 14: Word Sense Induction &Disambiguation , 2010, SemEval@ACL.

[4]  Ted Pedersen,et al.  Discovering identities in web contexts with unsupervised clustering , 2007 .

[5]  Ted Pedersen,et al.  Word Sense Discrimination by Clustering Contexts in Vector and Similarity Spaces , 2004, CoNLL.

[6]  Julio Gonzalo,et al.  The role of named entities in Web People Search , 2009, EMNLP.

[7]  Ted Pedersen UMND2 : SenseClusters Applied to the Sense Induction Task of Senseval-4 , 2007, SemEval@ACL.

[8]  Ted Pedersen,et al.  Selecting the “Right” Number of Senses Based on Clustering Criterion Functions , 2006, EACL.

[9]  Ted Pedersen,et al.  Automatic Cluster Stopping with Criterion Functions and the Gap Statistic , 2006, NAACL.

[10]  Julia Hirschberg,et al.  V-Measure: A Conditional Entropy-Based External Cluster Evaluation Measure , 2007, EMNLP.

[11]  Ted Pedersen Determining Smoker Status using Supervised and Unsupervised Learning with Lexical Features , 2006 .

[12]  Roberto Navigli,et al.  Clustering and Diversifying Web Search Results with Graph-Based Word Sense Induction , 2013, CL.

[13]  Ted Pedersen Duluth-WSI: SenseClusters Applied to the Sense Induction Task of SemEval-2 , 2010, SemEval@ACL.

[14]  T. Landauer,et al.  A Solution to Plato's Problem: The Latent Semantic Analysis Theory of Acquisition, Induction, and Representation of Knowledge. , 1997 .

[15]  Ted Pedersen Computational Approaches to Measuring the Similarity of Short Contexts : A Review of Applications and Methods , 2008, ArXiv.

[16]  George Karypis,et al.  Empirical and Theoretical Comparisons of Selected Criterion Functions for Document Clustering , 2004, Machine Learning.