Instant Web Retrieval for Instance-Attribute Queries

As the Web becomes the major information source of our daily activities, tools for finding various information on it are indispensable. This paper addresses theWeb retrieval of instance-attribute information, e.g., the contact addresses and research interests (attributes) of faculty and students (instances). This kind of information need is very common but cannot be directly supported by current keywordmatching-based search engines. People commonly use a two-phase search: First, locate the candidate pages, e.g., a faculty page, and then search within them for the desired information, e.g., contact information. Based on the stimulation of such human search behavior, we design a retrieval engine, upon general search engines, to help find the instance-attribute information from the Web. The experiment on several faculty members has shown the feasibility of the approach.

[1]  Tad Hogg,et al.  Spawn: A Distributed Computational Economy , 1992, IEEE Trans. Software Eng..

[2]  James P. Callan,et al.  Passage-level evidence in document retrieval , 1994, SIGIR '94.

[3]  Sergey Brin,et al.  Extracting Patterns and Relations from the World Wide Web , 1998, WebDB.

[4]  Michael P. Wellman,et al.  Market-aware agents for a multiagent world , 1998, Robotics Auton. Syst..

[5]  Li Zhang,et al.  Tycoon: An implementation of a distributed, market-based resource allocation system , 2004, Multiagent Grid Syst..

[6]  H. Howie Huang,et al.  A Feasibility Study of a Virtual Storage System for Large Organizations , 2006, First International Workshop on Virtualization Technology in Distributed Computing (VTDC 2006).

[7]  Wei-Ying Ma,et al.  Block-based web search , 2004, SIGIR '04.

[8]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[9]  Torsten Eymann,et al.  Decentralized resource allocation in application layer networks , 2003, CCGrid 2003. 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid, 2003. Proceedings..

[10]  Cyrus Harrison,et al.  OCEAN: the open computation exchange and arbitration network, a market approach to meta computing , 2003, Second International Symposium on Parallel and Distributed Computing, 2003. Proceedings..

[11]  J. Kephart,et al.  Price dynamics of vertically differentiated information markets , 1998, ICE '98.

[12]  Roy Goldman,et al.  WSQ/DSQ: a practical approach for combined querying of databases and the Web , 2000, SIGMOD '00.

[13]  Tom M. Mitchell Extracting targeted data from the web , 2001, KDD '01.

[14]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[15]  David Abramson,et al.  A Computational Economy for Grid Computing and its Implementation in the Nimrod-G Resource Brok , 2001, Future Gener. Comput. Syst..

[16]  Douglas E. Appelt,et al.  Introduction to Information Extraction Technology , 1999, IJCAI 1999.

[17]  Thomas Sandholm,et al.  Making Markets and Democracy Work: A Story of Incentives and Computing , 2003, IJCAI.

[18]  John D. Lafferty,et al.  Model-based feedback in the language modeling approach to information retrieval , 2001, CIKM '01.

[19]  Peter D. Turney Mining the Web for Synonyms: PMI-IR versus LSA on TOEFL , 2001, ECML.

[20]  David E. Culler,et al.  Market-based Proportional Resource Sharing for Clusters , 2000 .

[21]  Richard Wolski,et al.  G-commerce: market formulations controlling resource allocation on the computational grid , 2001, Proceedings 15th International Parallel and Distributed Processing Symposium. IPDPS 2001.

[22]  Doug Downey,et al.  Web-scale information extraction in knowitall: (preliminary results) , 2004, WWW '04.

[23]  Andrew McCallum,et al.  Disambiguating Web appearances of people in a social network , 2005, WWW '05.

[24]  James Allan,et al.  Automatic Retrieval With Locality Information Using SMART , 1992, TREC.

[25]  Julian Satran,et al.  Internet Small Computer Systems Interface (iSCSI) , 2004, RFC.

[26]  SaltonGerard,et al.  Term-weighting approaches in automatic text retrieval , 1988 .

[27]  Alexiei Dingli,et al.  Integrating Information to Bootstrap Information Extraction from Web Sites , 2003, IIWeb.

[28]  Amin Vahdat,et al.  Resource Allocation in Federated Distributed Computing Infrastructures , 2004 .

[29]  Azadeh Shakery,et al.  Toward Entity Retrieval over Structured and Text Data , 2004 .

[30]  Michael P. Wellman A Market-Oriented Programming Environment and its Application to Distributed Multicommodity Flow Problems , 1993, J. Artif. Intell. Res..

[31]  Jeffrey O. Kephart,et al.  Dynamic pricing by software agents , 2000, Comput. Networks.

[32]  Andrew McCallum,et al.  Extracting social networks and contact information from email and the Web , 2004, CEAS.

[33]  W. Bruce Croft,et al.  Query expansion using local and global document analysis , 1996, SIGIR '96.