论文信息 - Web searching and information retrieval

Web searching and information retrieval

The first Web information services were based on traditional information retrieval (IR) algorithms and techniques. However, IR algorithms were developed for smaller and more coherent collections than the Web is. Thus Web searching requires new techniques - exploiting linkage among Web pages or extensions of the old ones, for example. This article offers an overview of today's search engine architectures and techniques in the context of IR. The authors introduce three such architectures and describe their basic components. Then they discuss the most important feature of each Web search process: page importance and its use in retrieval. Some issues and challenges in Web search engines are also summarized as well as considerations on the future of Web searching in terms of the so-called semantic Web.

J. Pokorny | J. Pokorný

[1] Gerhard Weikum,et al. Adding Relevance to XML , 2000, WebDB.

[2] Sergey Brin,et al. The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[3] Dan Klein,et al. Evaluating strategies for similarity search on the web , 2002, WWW '02.

[4] Ioana Manolescu,et al. Integrating Keyword Search into XML Query Processing , 2000, BDA.

[5] Michael K. Bergman. White Paper: The Deep Web: Surfacing Hidden Value , 2001 .

[6] Denilson Barbosa,et al. The XML web: a first study , 2003, WWW '03.

[7] N. Fuhr. An Extension of XQL for Information Retrieval , 2000 .

[8] Massimo Melucci,et al. Information Retrieval on the Web , 2001, ESSIR.

[9] Akhil Kumar,et al. A dynamic warehouse for XML Data of the Web. , 2001 .

[10] Sriram Raghavan,et al. Searching the Web , 2001, ACM Trans. Internet Techn..

[11] Rajeev Motwani,et al. Stratified Planning , 2009, IJCAI.