论文信息 - Recent results in automatic Web resource discovery

Recent results in automatic Web resource discovery

Classical information retrieval (IR) is concerned with indexing a collection of documents and answering queries by returning a ranked list of relevant documents [14, 21, 24]. With the growth of the web, the problems of ambiguity, context sensitivity, synonymy (two terms with the same meaning) and polysemy (one term with different meanings) that are inherent in natural languages, together with the abundance of web pages related to prominent topics, have exacerbated the difficulty of fulfilling the user’s information need. Most search sites have added directory-based topic browsing. The web is organized as a tree of topics, similar to the Dewey decimal system, the Library of Congress catalog, or the US Patent and Trademarks Office subject codes. Tree nodes are maintained by paid ontologists and/or specialist volunteers, such as at Yahoo!, The Mining Co., WWW Virtual Library, and Open Directory Project. This strategy may be biased because of sparsity of experts; at any rate it is biased away from the most accomplished and busiest people.

Soumen Chakrabarti | Soumen Chakrabarti

[1] Craig Silverstein,et al. Analysis of a Very Large Altavista Query Log" SRC Technical note #1998-14 , 1998 .

[2] Jon M. Kleinberg,et al. Mining the Web's Link Structure , 1999, Computer.

[3] Jon M. Kleinberg,et al. Automatic Resource Compilation by Analyzing Hyperlink Structure and Associated Text , 1998, Comput. Networks.

[4] Prabhakar Raghavan,et al. Scalable feature selection, classification and signature generation for organizing large text databases into hierarchical topic taxonomies , 1998, The VLDB Journal.

[5] S. Wasserman,et al. Social Network Analysis: Computer Programs , 1994 .

[6] Stanley Wasserman,et al. Social Network Analysis: Methods and Applications , 1994 .

[7] Andrei Z. Broder,et al. A Technique for Measuring the Relative Size and Overlap of Public Web Search Engines , 1998, Comput. Networks.

[8] Eric W. Brown,et al. Execution performance issues in full-text information retrieval , 1995 .

[9] L. R. Rasmussen,et al. In information retrieval: data structures and algorithms , 1992 .

[10] Sergey Brin,et al. The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[11] Michael McGill,et al. Introduction to Modern Information Retrieval , 1983 .