Webometric research with the Bing Search API 2.0

In May 2011 the Bing Search API 2.0 had become the only major international web search engine data source available for automatic offline processing for webometric research. This article describes its key features, contrasting them with previous web search data sources, and discussing implications for webometric research. Overall, it seems that large-scale quantitative web research is possible with the Bing Search API 2.0, including query splitting, but that legal issues require the redesign of webometric software to ensure that all results obtained from Bing are displayed directly to the user.

[1]  Howard Rosenbaum,et al.  Can search engines be used as tools for web-link analysis? A critical view , 1999, J. Documentation.

[2]  Helen Ashman,et al.  The effect of user intent on the stability of search engine results , 2011, J. Assoc. Inf. Sci. Technol..

[3]  Mike Thelwall,et al.  Webometrics: emergent or doomed? , 2010, Inf. Res..

[4]  Liwen Vaughan,et al.  Word co-occurrences on Webpages as a measure of the relatedness of organizations: A new Webometrics concept , 2010, J. Informetrics.

[5]  Soon Ae Chun,et al.  Predicting Web Search Hit Counts , 2010, 2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology.

[6]  Judit Bar-Ilan,et al.  The use of web search engines in information science research , 2005, Annu. Rev. Inf. Sci. Technol..

[7]  Michael L. Nelson,et al.  Agreeing to disagree: search engines and their public interfaces , 2007, JCDL '07.

[8]  Maite Taboada,et al.  Methods for Creating Semantic Orientation Dictionaries , 2006, LREC.

[9]  Philipp Mayr,et al.  Google Web APIs - an Instrument for Webometric Analyses? , 2006, ArXiv.

[10]  Mike Thelwall Quantitative comparisons of search engine results , 2008 .

[11]  Daren C. Brabham Crowdsourcing as a Model for Problem Solving , 2008 .

[12]  Michael Gamon,et al.  Search right and thou shalt find ... Using Web Queries for Learner Error Detection , 2010 .

[13]  Rudy Prabowo,et al.  Identifying and characterizing public science-related fears from RSS feeds , 2007, J. Assoc. Inf. Sci. Technol..

[14]  Judit Bar-Ilan,et al.  A method for measuring the evolution of a topic on the Web: The case of “informetrics” , 2009 .

[15]  Ahmet Uyar Google stemming mechanisms , 2009 .

[16]  Peter Ingwersen,et al.  Perspective of webometrics , 2004, Scientometrics.

[17]  Mike Thelwall,et al.  Search engine coverage bias: evidence and possible causes , 2004, Inf. Process. Manag..

[18]  Özgür Ulusoy,et al.  Evolution of web search results within years , 2011, SIGIR '11.

[19]  W. Bruce Croft,et al.  Evaluating verbose query processing techniques , 2010, SIGIR.

[20]  Oren Etzioni,et al.  A search engine for natural language applications , 2005, WWW '05.

[21]  Mike Thelwall,et al.  Extracting accurate and complete results from search engines: Case study windows live , 2008, J. Assoc. Inf. Sci. Technol..

[22]  Judit Bar-Ilan,et al.  A method for measuring the evolution of a topic on the Web: The case of "informetrics" , 2009, J. Assoc. Inf. Sci. Technol..

[23]  Mike Thelwall,et al.  Introduction to Webometrics: Quantitative Web Research for the Social Sciences , 2009, Introduction to Webometrics.

[24]  Ahmet Uyar,et al.  Investigation of the accuracy of search engine hit counts , 2009, J. Inf. Sci..

[25]  W. Bruce Croft,et al.  Structural annotation of search queries using pseudo-relevance feedback , 2010, CIKM.

[26]  Judit Bar-Ilan,et al.  Expectations versus reality – search engine features needed for Web research at mid 2005 , 2005 .

[27]  Elena Maceviciute Review of: Thelwall, Michael. Introduction to webometrics: quantitative web research for the social sciences. San Rafael, CA: Morgan & Claypool, 2009 , 2010, Inf. Res..

[28]  Dirk Lewandowski,et al.  A three-year study on the freshness of web search engine databases , 2008, J. Inf. Sci..

[29]  Michael L. Nelson,et al.  Search engines and their public interfaces: which apis are the most synchronized? , 2007, WWW '07.

[30]  Paul Nieuwenhuysen,et al.  Internet search engines - fluctuations in document accessibility , 2001, J. Documentation.