Size estimation of non-cooperative data collections
暂无分享,去创建一个
Djoerd Hiemstra | Maurice van Keulen | Mohammadreza Khelghati | D. Hiemstra | M. V. Keulen | Mohammadreza Khelghati
[1] Ziv Bar-Yossef,et al. Efficient search engine measurements , 2007, WWW '07.
[2] Xin Jin,et al. Unbiased estimation of size and other aggregates over hidden web databases , 2010, SIGMOD Conference.
[3] Jianguo Lu,et al. Ranking bias in deep web size estimation using capture recapture method , 2010, Data Knowl. Eng..
[4] David J. C. Mackay,et al. Introduction to Monte Carlo Methods , 1998, Learning in Graphical Models.
[5] Andrei Z. Broder,et al. A Technique for Measuring the Relative Size and Overlap of Public Web Search Engines , 1998, Comput. Networks.
[6] James P. Callan,et al. Query-based sampling of text databases , 2001, TOIS.
[7] John C. Kern,et al. Introduction to Regression Analysis , 2007 .
[8] Andrei Z. Broder,et al. Sampling Search-Engine Results , 2005, WWW '05.
[9] Bryan F. J. Manly,et al. Handbook of Capture-Recapture Analysis , 2010 .
[10] Paul Thomas. Generalising multiple capture-recapture to non-uniform sample sizes , 2008, SIGIR '08.
[11] Jianguo Lu,et al. Estimating deep web data source size by capture–recapture method , 2010, Information Retrieval.
[12] Milad Shokouhi,et al. Capturing collection size for distributed non-cooperative retrieval , 2006, SIGIR.
[13] Sheng Wu,et al. Estimating collection size with logistic regression , 2007, SIGIR.
[14] H. Katzgraber. Introduction to Monte Carlo Methods , 2009, 0905.1629.
[15] Antonio Gulli,et al. The indexable web is more than 11.5 billion pages , 2005, WWW '05.
[16] Andrei Z. Broder,et al. Estimating corpus size via queries , 2006, CIKM '06.
[17] Ziv Bar-Yossef,et al. Random sampling from a search engine's index , 2006, WWW '06.
[18] David J. Olive,et al. Introduction to Regression Analysis , 2007 .