Concerted research effort since the nineteen fifties has lead to effective methods for retrieval of relevant documents from homogeneous collections of text, such as newspaper archives, scientific abstracts and CD-ROM encyclopaedias. However, the triumph of the Web in the nineteen nineties forced a significant paradigm shift in the Information Retrieval field because of the need to address the issues of enormous scale, fluid collection definition, great heterogeneity, unfettered interlinking, democratic publishing, the presence of adversaries and most of all the diversity of purposes for which Web search may be used. Now, the IR field is confronted with a challenge of similarly daunting dimensions -- how to bring highly effective search to the complex information spaces within enterprises. Overcoming the challenge would bring massive economic benefit, but victory is far from assured. The present work characterises enterprise search, hints at its economic magnitude, states some of the unsolved research questions in the domain of enterprise search need, proposes an enterprise search test collection and presents results for a small but interesting sub-problem.
[1]
Stephen E. Robertson,et al.
Okapi at TREC-3
,
1994,
TREC.
[2]
Stephen E. Robertson,et al.
GatfordCentre for Interactive Systems ResearchDepartment of Information
,
1996
.
[3]
David Hawking,et al.
How Valuable is External Link Evidence When Searching Enterprise Webs?
,
2004,
ADC.
[4]
David Hawking,et al.
Query-independent evidence in home page finding
,
2003,
TOIS.
[5]
Dick Stenmark.
A Method for Intranet Search Engine Evaluations
,
1999
.
[6]
Ellen M. Voorhees,et al.
Learning collection fusion strategies
,
1995,
SIGIR '95.
[7]
Andrei Broder,et al.
A taxonomy of web search
,
2002,
SIGF.
[8]
David Hawking,et al.
Panoptic Expert: Searching for experts not just for documents
,
2001
.
[9]
Prabhakar Raghavan,et al.
Navigating large-scale semi-structured data in business portals
,
2001,
VLDB.
[10]
Stephen E. Robertson,et al.
Effective site finding using link anchor information
,
2001,
SIGIR '01.
[11]
Ronald Fagin,et al.
Searching the workplace web
,
2003,
WWW '03.