Using Personalized Web Search for Enhancing Common Sense and Folksonomy Based Intelligent Search Systems

A large part of the modern web is characterized by usergenerated content categorized using collaborative tagging or folksonomy. It becomes difficult to search for relevant content because of ambiguity in lexical representation of concepts and variances in preferences of users. With more and more services relying on tags for content categorization, it is important that search techniques evolve to better suit the scenario. A promising approach towards solving these problems is to use machine common sense in conjunction with folksonomy. A past attempt to use this approach has shown positive results in finding relevant content but it does not address the issue of noise in search results. In this paper, we use the personalized web search technique of traditional web search systems to address the issue of irrelevant search results in common sense and folksonomy based search systems. In personalized web search, results are reflective of user's preferences, which are decided by search history and categories of interest. We propose modifications to personalized web search technique. Using this modified approach, we extend the basic common sense and folksonomy based search systems to address the issue of noise in search results.

[1]  Christoph Lindemann,et al.  Coarse-grained classification of web sites by their structural properties , 2006, WIDM '06.

[2]  Jianchang Mao,et al.  Towards the Semantic Web: Collaborative Tag Suggestions , 2006 .

[3]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[4]  Luis Gravano,et al.  Probe, count, and classify: categorizing hidden web databases , 2001, SIGMOD '01.

[5]  J. Giles Internet encyclopaedias go head to head , 2005, Nature.

[6]  Ravi Kumar,et al.  Self-similarity in the web , 2001, TOIT.

[7]  Tao Qin,et al.  Subsite Retrieval: A Novel Concept for Topic Distillation , 2005, AIRS.

[8]  Andrew Tomkins,et al.  The volume and evolution of web page templates , 2005, WWW '05.

[9]  Hong Zhu,et al.  Web site complexity metrics for measuring navigability , 2004 .

[10]  Ingemar J. Cox,et al.  The web structure of e-government - developing a methodology for quantitative evaluation , 2006, WWW '06.

[11]  Adele E. Howe,et al.  SAVVYSEARCH: A Metasearch Engine That Learns Which Search Engines to Query , 1997, AI Mag..

[12]  Natasa Milic-Frayling,et al.  SmartView: Enhanced Document Viewer for Mobile Devices , 2002 .

[13]  Eelco Herder,et al.  Off the beaten tracks: exploring three aspects of web navigation , 2006, WWW '06.

[14]  Sandip Debnath,et al.  Automatic extraction of informative blocks from webpages , 2005, SAC '05.

[15]  Robert E. Tarjan,et al.  Depth-First Search and Linear Graph Algorithms , 1972, SIAM J. Comput..

[16]  Erik T. Mueller,et al.  Open Mind Common Sense: Knowledge Acquisition from the General Public , 2002, OTM.

[17]  David Carmel,et al.  The connectivity sonar: detecting site functionality by structural patterns , 2003, HYPERTEXT '03.

[18]  Andreas Hotho,et al.  Information Retrieval in Folksonomies: Search and Ranking , 2006, ESWC.

[19]  Bernardo A. Huberman,et al.  The Structure of Collaborative Tagging Systems , 2005, ArXiv.

[20]  Ravi Kumar,et al.  Hierarchical topic segmentation of websites , 2006, KDD '06.

[21]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[22]  Henry Lieberman,et al.  Adaptive Linking between Text and Photos Using Common Sense Reasoning , 2002, AH.

[23]  David M. Pennock,et al.  Using web structure for classifying and describing web pages , 2002, WWW.

[24]  Torsten Suel,et al.  Compressing the graph structure of the Web , 2001, Proceedings DCC 2001. Data Compression Conference.

[25]  Christopher Olston,et al.  What's new on the web?: the evolution of the web from a search engine perspective , 2004, WWW '04.

[26]  Luis Gravano,et al.  Generalizing GlOSS to Vector-Space Databases and Broker Hierarchies , 1995, VLDB.

[27]  Andreas Hotho,et al.  Mining Association Rules in Folksonomies , 2006, Data Science and Classification.

[28]  Wei-Ying Ma,et al.  Learning important models for web page blocks based on layout and content analysis , 2004, SKDD.

[29]  Andrei Z. Broder,et al.  Graph structure in the Web , 2000, Comput. Networks.

[30]  Adam Mathes,et al.  Folksonomies-Cooperative Classification and Communication Through Shared Metadata , 2004 .

[31]  M. Nauman,et al.  Common Sense and Folksonomy: Engineering an Intelligent Search System , 2007, 2007 International Conference on Information and Emerging Technologies.

[32]  Hugo Liu,et al.  ConceptNet — A Practical Commonsense Reasoning Tool-Kit , 2004 .

[33]  Martin Casdagli,et al.  Nonlinear prediction of chaotic time series , 1989 .

[34]  Amit P. Sheth,et al.  A Framework for Schema-Driven Relationship Discovery from Unstructured Text , 2006, SEMWEB.

[35]  Clement T. Yu,et al.  Personalized web search by mapping user queries to categories , 2002, CIKM '02.

[36]  Ziv Bar-Yossef,et al.  Template detection via data mining and its applications , 2002, WWW.

[37]  Sriram Raghavan,et al.  Representing Web graphs , 2003, Proceedings 19th International Conference on Data Engineering (Cat. No.03CH37405).

[38]  Jon Kleinberg,et al.  Authoritative sources in a hyperlinked environment , 1998, SODA '98.

[39]  Eli Upfal,et al.  Stochastic models for the Web graph , 2000, Proceedings 41st Annual Symposium on Foundations of Computer Science.

[40]  Xiaoli Li,et al.  Eliminating noisy information in Web pages for data mining , 2003, KDD '03.