Web content summarization using social bookmarks: a new approach for social summarization

An increasing number of Web applications are allowing users to play more active roles for enriching the source content. The enriched data can be used for various applications such as text summarization, opinion mining and ontology creation. In this paper, we propose a novel Web content summarization method that creates a text summary by exploiting user feedback (comments and tags) in a social bookmarking service. We had manually analyzed user feedback in several representative social services including del.icio.us, Digg, YouTube, and Amazon.com. We found that (1) user comments in each social service have its own characteristics with respect to summarization, and (2) a tag frequency rank does not necessarily represent its usefulness for summarization. Based on these observations, we conjecture that user feedback in social bookmarking services is more suitable for summarization than other type of social services. We implemented prototype system called SSNote that analyzes tags and user comments in del.icio.us, and extracts summaries. Performance evaluations of the system were conducted by comparing its output summary with manual summaries generated by human evaluators. Experimental results show that our approach highlights the potential benefits of user feedback in social bookmarking services.

[1]  Xin Liu,et al.  Generic text summarization using relevance measure and latent semantic analysis , 2001, SIGIR '01.

[2]  Ee-Peng Lim,et al.  Comments-oriented blog summarization by sentence extraction , 2007, CIKM '07.

[3]  Mark T. Maybury,et al.  Advances in Automatic Text Summarization , 1999 .

[4]  Eduard H. Hovy,et al.  Automatic Evaluation of Summaries Using N-gram Co-occurrence Statistics , 2003, NAACL.

[5]  Wai Lam,et al.  MEAD - A Platform for Multidocument Multilingual Text Summarization , 2004, LREC.

[6]  Hans Peter Luhn,et al.  The Automatic Creation of Literature Abstracts , 1958, IBM J. Res. Dev..

[7]  Evangelos E. Milios,et al.  Summarizing Web Sites Automatically , 2003, Canadian Conference on AI.

[8]  Shogo Nishida,et al.  Social summarization of text feedback for online auctions and interactive presentation of the summary , 2006, IUI '06.

[9]  Cécile Paris,et al.  Automatically summarising Web sites: is there a way around it? , 2000, CIKM '00.

[10]  Shay David,et al.  Six degrees of reputation: The use and abuse of online review and recommendation systems , 2006, First Monday.

[11]  Wei-Ying Ma,et al.  Web-page classification through summarization , 2004, SIGIR '04.

[12]  Bernadette Bouchon-Meunier,et al.  Enhanced web document summarization using hyperlinks , 2003, HYPERTEXT '03.

[13]  Janyce Wiebe,et al.  Learning Subjective Language , 2004, CL.

[14]  Qiang Yang,et al.  Web-page summarization using clickthrough data , 2005, SIGIR '05.

[15]  H. P. Edmundson,et al.  New Methods in Automatic Extracting , 1969, JACM.

[16]  Tomohiro Fukuhara,et al.  Web Content Summarization Using Social Bookmarking Service , 2008 .

[17]  Barry Smyth,et al.  From social bookmarking to social summarization: an experiment in community-based summary generation , 2007, IUI '07.

[18]  David M. Pennock,et al.  Mining the peanut gallery: opinion extraction and semantic classification of product reviews , 2003, WWW '03.