论文信息 - Sampling Search-Engine Results

Sampling Search-Engine Results

We consider the problem of efficiently sampling Web search engine query results. In turn, using a small random sample instead of the full set of results leads to efficient approximate algorithms fo...

AnagnostopoulosAris | CarmelDavid | Z BroderAndrei

[1] Philip S. Yu,et al. On using partial supervision for text categorization , 2004, IEEE Transactions on Knowledge and Data Engineering.

[2] Andrei Z. Broder,et al. Efficient query evaluation using a two-level retrieval process , 2003, CIKM '03.

[3] Marcus Fontoura,et al. High Performance Index Build Algorithms for Intranet Search Engines , 2004, VLDB.

[4] Jeffrey F. Naughton,et al. On the relative cost of sampling for join selectivity estimation , 1994, PODS '94.

[5] Andrei Z. Broder,et al. Sampling Search-Engine Results , 2005, WWW '05.

[6] Andrei Broder,et al. A taxonomy of web search , 2002, SIGF.

[7] Antonio Gulli,et al. The indexable web is more than 11.5 billion pages , 2005, WWW '05.

[8] Kevin Li,et al. Faceted metadata for image search and browsing , 2003, CHI '03.

[9] Andrew Tomkins,et al. How to build a WebFountain: An architecture for very large-scale text analytics , 2004, IBM Syst. J..

[10] Monika Henzinger,et al. Analysis of a very large web search engine query log , 1999, SIGF.

[11] Jeffrey Scott Vitter,et al. Random sampling with a reservoir , 1985, TOMS.

[12] Howard R. Turtle,et al. Query Evaluation: Strategies and Optimizations , 1995, Inf. Process. Manag..

[13] Dragomir R. Radev,et al. Mining the web for answers to natural language questions , 2001, CIKM '01.

[14] James P. Bagrow,et al. On the Google‐fame of scientists and other populations , 2005 .

[15] Sergey Brin,et al. The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[16] Kim-Hung Li,et al. Reservoir-sampling algorithms of time complexity O(n(1 + log(N/n))) , 1994, TOMS.

[17] David Carmel,et al. Scaling IR-system evaluation using term relevance sets , 2004, SIGIR '04.

[18] Srikanta Tirthapura,et al. Estimating simple functions on the union of data streams , 2001, SPAA '01.