Monitoring browsing behaviour and search services evolution adaptation with a capture-recapture Internet-based programming technique: A case-study over medical portals

Nowadays, web search services are considered as a valuable tool for information exploration. However, due to the extremely rapid growth of the web, search engines cannot index the new information at the same time or with the same priority and thus their performance is hampered in some extend. Besides, web search services index the disseminated information with different algorithms, having as result different response time in updating their directories. In this paper, we propose a meta-search algorithm, which is capable of self-adapting over the continuous changes that occur on the indexed web, using a web evolution adaptation mechanism. This mechanism is put in the context of real capture-recapture experiments conducted in wildlife biological studies. The meta-search algorithm also supports a monitor mechanism, which records the user's browsing behavior during his search sessions. The paper provides the implementation details of the proposed meta-search ranking algorithm along with its initial assessment over health-related information using three medical web search services. We prove in our approach, that when user's browsing behavior is jointly used with a dynamic survey mechanism, which scores the ability of each search service to adapt in the incessant evolution of the web, a more effective meta-search is provided. Experimental results showed that the precision of the third-party results, were increased in several recall levels, for a tested period between September of 2006 and April of 2007.

[1]  C. Schwarz,et al.  ESTIMATING TEMPORARY MIGRATION USING THE ROBUST DESIGN , 1997 .

[2]  Ioannis Anagnostopoulos,et al.  Implementing a customised meta-search interface for user query personalisation , 2002, ITI 2002. Proceedings of the 24th International Conference on Information Technology Interfaces (IEEE Cat. No.02EX534).

[3]  Taher H. Haveliwala Topic-sensitive PageRank , 2002, IEEE Trans. Knowl. Data Eng..

[4]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[5]  Clement T. Yu,et al.  Personalized Web search for improving retrieval effectiveness , 2004, IEEE Transactions on Knowledge and Data Engineering.

[6]  G. Seber The estimation of animal abundance and related parameters , 1974 .

[7]  Masatoshi Yoshikawa,et al.  Adaptive web search based on user profile constructed without any effort from users , 2004, WWW '04.

[8]  J. Nichols,et al.  Statistical inference for capture-recapture experiments , 1992 .

[9]  Hinrich Schütze,et al.  Personalized search , 2002, CACM.

[10]  Larry Fitzpatrick,et al.  Automatic feedback using past queries: social searching? , 1997, SIGIR '97.

[11]  Toru Ishida,et al.  Domain-specific Web search with keyword spices , 2004, IEEE Transactions on Knowledge and Data Engineering.

[12]  J. Pokorny,et al.  Web searching and information retrieval , 2004, Computing in Science & Engineering.

[13]  Seiji Yamada,et al.  Behavior-based web page evaluation , 2006, WWW.

[14]  M. Conroy,et al.  Analysis and Management of Animal Populations , 2002 .

[15]  Mary Beth Rosson,et al.  Paradox of the active user , 1987 .

[16]  Jolly Gm Mark-recapture models with parameters constant in time. , 1982 .

[17]  G. Jolly EXPLICIT ESTIMATES FROM CAPTURE-RECAPTURE DATA WITH BOTH DEATH AND IMMIGRATION-STOCHASTIC MODEL. , 1965, Biometrika.

[18]  Virgílio A. F. Almeida,et al.  A community-aware search engine , 2004, WWW '04.