The Effect of Social and Physical Detachment on Information Need

The information need of users and the documents which answer this need are frequently contingent on the different characteristics of users. This is especially evident during natural disasters, such as earthquakes and violent weather incidents, which create a strong transient information need. In this article, we investigate how the information need of users, as expressed by their queries, is affected by their physical detachment, as estimated by their physical location in relation to that of the event, and by their social detachment, as quantified by the number of their acquaintances who may be affected by the event. Drawing on large-scale data from ten major events, we show that social and physical detachment levels of users are a major influence on their search engine queries. We demonstrate how knowing social and physical detachment levels can assist in improving retrieval for two applications: identifying search queries related to events and ranking results in response to event-related queries. We find that the average precision in identifying relevant search queries improves by approximately 18p, and that the average precision of ranking that uses detachment information improves by 10p. Using both types of detachment achieved a larger gain in performance than each of them separately.

[1]  Ramesh Nallapati,et al.  Discriminative models for information retrieval , 2004, SIGIR '04.

[2]  Fernando Diaz,et al.  Time is of the essence: improving recency ranking using Twitter data , 2010, WWW '10.

[3]  Pamela J. Shoemaker,et al.  Determinants of International News Coverage in the U.S. Media , 1987 .

[4]  Rich Caruana,et al.  Data mining in metric space: an empirical analysis of supervised learning performance criteria , 2004, ROCAI.

[5]  Yossi Richter,et al.  Predicting Customer Churn in Mobile Networks through Analysis of Social Groups , 2010, SDM.

[6]  Yutaka Matsuo,et al.  Earthquake shakes Twitter users: real-time event detection by social sensors , 2010, WWW '10.

[7]  Fernando Diaz,et al.  Integration of news content into web results , 2009, WSDM '09.

[8]  Barbara Poblete,et al.  Twitter under crisis: can we trust what we RT? , 2010, SOMA '10.

[9]  H. Russell Bernard,et al.  Estimating the Ripple Effect of a Disaster 1 , 2001 .

[10]  Meredith Ringel Morris,et al.  Discovering and using groups to improve personalized search , 2009, WSDM '09.

[11]  Amanda Lee Hughes,et al.  Crisis in a Networked World , 2009 .

[12]  Elad Yom-Tov,et al.  Enhancing digital libraries using missing content analysis , 2008, JCDL '08.

[13]  Fritz Drasgow,et al.  Polychoric and Polyserial Correlations , 2006 .

[14]  Susan T. Dumais,et al.  Similarity Measures for Short Segments of Text , 2007, ECIR.

[15]  Gilad Mishne,et al.  Towards recency ranking in web search , 2010, WSDM '10.

[16]  Jon M. Kleinberg,et al.  Spatial variation in search engine queries , 2008, WWW.

[17]  Konstantina Martzoukou,et al.  A review of Web information seeking research: considerations of method and foci of interest , 2005, Inf. Res..

[18]  H. Russell Bernard,et al.  Estimating the size of an average personal network and of an event subpopulation: Some empirical results☆ , 1991 .

[19]  Hao Wu,et al.  Geographic Distance and Us Newspaper Coverage of Canada and Mexico , 1998 .

[20]  Fernando Diaz,et al.  Geographic features in web search retrieval , 2008, GIR '08.

[21]  Janez Demsar,et al.  Statistical Comparisons of Classifiers over Multiple Data Sets , 2006, J. Mach. Learn. Res..

[22]  Xiao Li,et al.  Learning query intent from regularized click graphs , 2008, SIGIR '08.

[23]  Jeonghyun Kim Describing and predicting information-seeking behavior on the Web , 2009 .

[24]  Mehran Sahami,et al.  A web-based kernel function for measuring the similarity of short text snippets , 2006, WWW '06.

[25]  Elad Yom-Tov,et al.  What makes a query difficult? , 2006, SIGIR.

[26]  Fernando Diaz,et al.  A case study of using geographic cues to predict query news intent , 2009, GIS.

[27]  Ido Guy,et al.  Personalized social search based on the user's social network , 2009, CIKM.

[28]  Jure Leskovec,et al.  Meme-tracking and the dynamics of the news cycle , 2009, KDD.