Next Generation Web Search

In this chapter we provide our personal vision of what could be the next generation of Web search engines, outlining the main research challenges that derive from it. This vision is based on a single premise: people do not really want to search, they want to get tasks done. We motivate our work by the current trends in the Web and, in particular, Web search.

[1]  Raghu Ramakrishnan,et al.  Source-aware Entity Matching: A Compositional Approach , 2007, 2007 IEEE 23rd International Conference on Data Engineering.

[2]  Pauline Atherton,et al.  An Analysis of Controlled Vocabulary and Free Text Search Statements in Online Searches , 1980 .

[3]  John Domingue,et al.  Near-Term Prospects for Semantic Technologies , 2008, IEEE Intelligent Systems.

[4]  Cong Yu,et al.  Purple SOX extraction management system , 2009, SGMD.

[5]  Ricardo A. Baeza-Yates,et al.  Graphs from Search Engine Queries , 2007, SOFSEM.

[6]  Adam Kilgarriff,et al.  Introduction to the Special Issue on the Web as Corpus , 2003, CL.

[7]  George W. Furnas,et al.  Effective view navigation , 1997, CHI.

[8]  Ravi Kumar,et al.  A web of concepts , 2009, PODS.

[9]  Peter Mika,et al.  Learning to Tag and Tagging to Learn: A Case Study on Wikipedia , 2008, IEEE Intelligent Systems.

[10]  Hinrich Schütze,et al.  Introduction to information retrieval , 2008 .

[11]  Ricardo A. Baeza-Yates,et al.  The Intention Behind Web Queries , 2006, SPIRE.

[12]  Peter Pirolli,et al.  Computational models of information scent-following in a very large browsable text collection , 1997, CHI.

[13]  Giuseppe Attardi,et al.  Ranking very many typed entities on wikipedia , 2007, CIKM '07.

[14]  Carolyn Snyder,et al.  Web Site Usability: A Designer's Guide , 1997 .

[15]  Aristides Gionis,et al.  On the feasibility of multi-site web search engines , 2009, CIKM.

[16]  Fredric C. Gey,et al.  Advanced Search Technologies for Unfamiliar Metadata , 1999, MD.

[17]  Henry Lieberman,et al.  Letizia: An Agent That Assists Web Browsing , 1995, IJCAI.

[18]  Amanda Spink,et al.  Determining the user intent of web search engine queries , 2007, WWW '07.

[19]  Andrei Broder,et al.  A taxonomy of web search , 2002, SIGF.

[20]  Andrei Z. Broder,et al.  Anatomy of the long tail: ordinary people with extraordinary tastes , 2010, WSDM '10.

[21]  Soumen Chakrabarti,et al.  Mining the web - discovering knowledge from hypertext data , 2002 .

[22]  Stephen Cranefield UML and the Semantic Web , 2001, SWWS.

[23]  Peter Mika,et al.  Microsearch: An Interface for Semantic Search , 2008, SemSearch.

[24]  Alissa Cooper,et al.  A survey of query log privacy-enhancing techniques from a policy perspective , 2008, TWEB.

[25]  James J. Kistler,et al.  Challenges, Techniques and Directions in Building XSeek: an XML Search Engine. , 2009 .

[26]  Ricardo A. Baeza-Yates,et al.  Extracting semantic relations from query logs , 2007, KDD '07.

[27]  Andrei Z. Broder The Future of Web Search: From Information Retrieval to Information Supply , 2006, NGITS.

[28]  Ricardo Baeza-Yates,et al.  Genealogical trees on the web: a search engine user perspective , 2008, WWW.

[29]  David R. Karger,et al.  Scatter/Gather as a Tool for the Navigation of Retrieval Results , 1995 .

[30]  Luiz André Barroso,et al.  The Datacenter as a Computer: An Introduction to the Design of Warehouse-Scale Machines , 2009, The Datacenter as a Computer: An Introduction to the Design of Warehouse-Scale Machines.

[31]  Michael Gertz,et al.  On the value of temporal information in information retrieval , 2007, SIGF.

[32]  Jun Yang,et al.  Efficient Information Extraction over Evolving Text Data , 2008, 2008 IEEE 24th International Conference on Data Engineering.

[33]  Sanda M. Harabagiu,et al.  Experiments with Open-Domain Textual Question Answering , 2000, COLING.

[34]  Peter Ingwersen,et al.  Developing a Test Collection for the Evaluation of Integrated Search , 2010, ECIR.

[35]  Mihai Surdeanu,et al.  Learning to Rank Answers on Large Online QA Collections , 2008, ACL.

[36]  C. Lee Giles,et al.  CiteSeer: an automatic citation indexing system , 1998, DL '98.

[37]  Chris Anderson,et al.  The Long Tail: Why the Future of Business is Selling Less of More , 2006 .

[38]  Andrew Tomkins,et al.  Toward a PeopleWeb , 2007, Computer.

[39]  Luis Gravano,et al.  Computing Geographical Scopes of Web Resources , 2000, VLDB.

[40]  Wojciech Rytter,et al.  Extracting Powers and Periods in a String from Its Runs Structure , 2010, SPIRE.

[41]  Ben Shneiderman,et al.  Interface and data architecture for query preview in networked information systems , 1999, TOIS.

[42]  Gary Marchionini,et al.  Information Seeking in Electronic Environments , 1995 .

[43]  Jeffrey F. Naughton,et al.  Information extraction challenges in managing unstructured data , 2009, SGMD.

[44]  Giuseppe Attardi,et al.  Semantically Annotated Snapshot of the English Wikipedia , 2008, LREC.

[45]  Ricardo A. Baeza-Yates,et al.  Applications of Web Query Mining , 2005, ECIR.

[46]  Wiebe van der Hoek,et al.  SOFSEM 2007: Theory and Practice of Computer Science , 2007 .