Incorporating Multi-partite Networks and Expertise to Construct Related-Term Graphs

Term suggestion techniques recommend query terms to a user based on his initial query. Providing adequate term suggestions is a challenging task. Most existing commercial search engines suggest search terms based on the frequency of prior used terms that match the first few letters typed by the user. We present a novel mechanism to construct semantic term-relation graphs to suggest semantically relevant search terms. We build term relation graphs based on multi-partite networks of existing social media. These linkage networks are extracted from Wikipedia to eventually form term relation graphs. We propose incorporating contributor-category networks to model the contributor expertise. This step has been shown to significantly enhance the accuracy of the inferred relatedness of the term-semantic graphs. Experiments showed the obvious advantage of our algorithms over existing approaches

[1]  Tse Chung Building Term Suggestion Relational Graphs from Collective Intelligence , 2009 .

[2]  Frank van Harmelen,et al.  Using Google distance to weight approximate ontology matches , 2007, WWW '07.

[3]  Graeme Hirst,et al.  Evaluating WordNet-based Measures of Lexical Semantic Relatedness , 2006, CL.

[4]  D. Watts,et al.  Small Worlds: The Dynamics of Networks between Order and Randomness , 2001 .

[5]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[6]  Ulrik Brandes,et al.  Network analysis of collaboration structure in Wikipedia , 2009, WWW '09.

[7]  Evgeniy Gabrilovich,et al.  Feature Generation for Text Categorization Using World Knowledge , 2005, IJCAI.

[8]  Doug Beeferman,et al.  Agglomerative clustering of a search engine query log , 2000, KDD '00.

[9]  M Girvan,et al.  Structure of growing social networks. , 2001, Physical review. E, Statistical, nonlinear, and soft matter physics.

[10]  Evgeniy Gabrilovich,et al.  Overcoming the Brittleness Bottleneck using Wikipedia: Enhancing Text Categorization with Encyclopedic Knowledge , 2006, AAAI.

[11]  John Riedl,et al.  Item-based collaborative filtering recommendation algorithms , 2001, WWW '01.

[12]  Mark T. Maybury Human Language Technologies for Knowledge Management , 2001, HTLKM@ACL.

[13]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[14]  Craig Silverstein,et al.  Analysis of a Very Large Altavista Query Log" SRC Technical note #1998-14 , 1998 .

[15]  Eduardo Mena,et al.  Web-Based Measure of Semantic Relatedness , 2008, WISE.

[16]  Ricardo A. Baeza-Yates,et al.  Extracting semantic relations from query logs , 2007, KDD '07.

[17]  Padhraic Smyth,et al.  Algorithms for estimating relative importance in networks , 2003, KDD '03.

[18]  Volker Tresp,et al.  Soft Clustering on Graphs , 2005, NIPS.

[19]  Gordon I. McCalla,et al.  User Modelling in I-Help: What, Why, When and How , 2001, User Modeling.

[20]  Yi Zhang,et al.  Novelty and redundancy detection in adaptive filtering , 2002, SIGIR '02.

[21]  Pavel Serdyukov,et al.  Enterprise and desktop search , 2010, WWW '10.

[22]  Juan-Zi Li,et al.  Expert Finding in a Social Network , 2007, DASFAA.

[23]  Péter Schönhofen,et al.  Identifying Document Topics Using the Wikipedia Category Network , 2006, 2006 IEEE/WIC/ACM International Conference on Web Intelligence (WI 2006 Main Conference Proceedings)(WI'06).

[24]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[25]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[26]  Monika Henzinger,et al.  Hyperlink Analysis for the Web , 2001, IEEE Internet Comput..

[27]  Ian H. Witten,et al.  A knowledge-based search engine powered by wikipedia , 2007, CIKM '07.