Automatically Characterizing Salience Using Readers' Feedback

Salience is an important characteristic of information influencing users’ cognitive and emotional states. For example, salient parts of a document are those that readers will find moving or provoking. This article studies the salience concept and its meanings in linguistics and information retrieval. Then it analyses the main drawbacks of content-based techniques for automatic identification of salient passages in a document. A new context-based method for overcoming these difficulties is subsequently presented. Our method identifies passages that readers have reacted to by analyzing their textual feedback. Our experimentation with blog posts revealed that it is effective and can be on 90% of commented posts.

[1]  Ellen M. Voorhees,et al.  Query expansion using lexical-semantic relations , 1994, SIGIR '94.

[2]  Scott Weinstein,et al.  Centering: A Framework for Modeling the Local Coherence of Discourse , 1995, CL.

[3]  Stefano Mizzaro,et al.  How many relevances in information retrieval? , 1998, Interact. Comput..

[4]  Branimir K. Boguraev,et al.  Salience-based Content Characterisafion of Text Documents , 1997 .

[5]  K. Järvelin,et al.  EVALUATING INFORMATION RETRIEVAL SYSTEMS UNDER THE CHALLENGES OF INTERACTION AND MULTIDIMENSIONAL DYNAMIC RELEVANCE , 2002 .

[6]  Peter Ingwersen,et al.  Dimensions of relevance , 2000, Inf. Process. Manag..

[7]  Frédéric Landragin,et al.  Saillance physique et saillance cognitive , 2004 .

[8]  Barry Smyth,et al.  From social bookmarking to social summarization: an experiment in community-based summary generation , 2007, IUI '07.

[9]  Jean-Yves Delort,et al.  Identifying commented passages of documents using implicit hyperlinks , 2006, HYPERTEXT '06.

[10]  Frank M. Shipman,et al.  Identifying Useful Passages in Documents Based on Annotation Patterns , 2003, ECDL.

[11]  Goran Nenadic,et al.  Recognition and Acquisition of Compound Names from Corpora , 2000, Natural Language Processing.

[12]  Frances Kamm,et al.  Does Distance Matter Morally to the Duty to Rescue , 2000 .

[13]  Nick Cercone,et al.  Selection: Salience, Relevance and the Coupling between Domain-Level Tasks and Text Planning , 1990, INLG.

[14]  Dragomir R. Radev,et al.  LexRank: Graph-based Centrality as Salience in Text Summarization , 2004 .

[15]  E. Krahmer,et al.  Efficient Generation of Descriptions in Context , 1999 .

[16]  Branimir Boguraev,et al.  Anaphora for Everyone: Pronominal Anaphora Resolution without a Parser , 1996, COLING.

[17]  Shalom Lappin,et al.  An Algorithm for Pronominal Anaphora Resolution , 1994, CL.

[18]  Sang-goo Lee,et al.  Web content summarization using social bookmarks: a new approach for social summarization , 2008, WIDM '08.

[19]  Gerard Salton,et al.  A vector space model for automatic indexing , 1975, CACM.

[20]  Anne Treisman,et al.  Features and objects in visual processing , 1986 .

[21]  Lillian Lee,et al.  Opinion Mining and Sentiment Analysis , 2008, Found. Trends Inf. Retr..

[22]  Christof Koch,et al.  A Model of Saliency-Based Visual Attention for Rapid Scene Analysis , 2009 .

[23]  P. Niedenthal,et al.  The heart's eye: Emotional influences in perception and attention. , 1994 .

[24]  Max Chevalier,et al.  A Social Validation of Collaborative Annotations on Digital Documents , 2005, IWAC.

[25]  Daniel McIntyre Using Foregrounding Theory as a Teaching Methodology in a Stylistics Course , 2003 .

[26]  T. Saracevic,et al.  Relevance: A review of the literature and a framework for thinking on the notion in information science. Part II: nature and manifestations of relevance , 2007, J. Assoc. Inf. Sci. Technol..

[27]  Ee-Peng Lim,et al.  Comments-oriented blog summarization by sentence extraction , 2007, CIKM '07.

[28]  Barry Arons,et al.  A Review of The Cocktail Party Effect , 1992 .

[29]  Jade Goldstein-Stewart,et al.  Summarizing text documents: sentence selection and evaluation metrics , 1999, SIGIR '99.

[30]  G Sperling,et al.  Measuring the amplification of attention. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[31]  A. Tversky Features of Similarity , 1977 .

[32]  Andreas Dengel,et al.  Generating and using gaze-based document annotations , 2008, CHI Extended Abstracts.

[33]  Jake Harwood,et al.  The Family and Communication Dynamics of Group Salience , 2006 .

[34]  Owen Daly-Jones,et al.  Characterising the social salience of electronically mediated communication , 1994, CHI Conference Companion.

[35]  Fabio Rinaldi,et al.  Anaphora Resolution in ExtrAns , 2003 .

[36]  James Allan,et al.  Automatic Query Expansion Using SMART: TREC 3 , 1994, TREC.

[37]  Eva Hajicová,et al.  Topic-focus and Salience , 2001, ACL.