Exploring the Relationship between Keywords and Feed Elements in Blog Post Search

Blogs are increasingly accepted as a useful means to proliferate a variety of information on the web. As the popularity of blogs grows rapidly, a number of blog search engines have appeared recently to help users access and discover blog posts efficiently. Nevertheless, existing approaches tend to focus on ranking the blog posts according to their recency or popularity only, leaving the problem of retrieving more topic relevant posts to a user’s query largely unexplored. In this paper, we present a novel blog ranking framework, called PTRank, that improves search quality by taking account of relevance feedback from users as well as various information available from RSS feeds. A neural network method is employed to learn ranking functions that provide a relevance score between a keyword and a blog post. Extensive experiments on real blog data have been conducted to validate the proposed ranking framework for blog post search, and the results indicate that PTRank performs significantly better than the existing popular approach.

[1]  Hyun-Kyu Cho,et al.  Efficient Monitoring Algorithm for Fast News Alerts , 2007, IEEE Transactions on Knowledge and Data Engineering.

[2]  Paolo Avesani,et al.  Using Tags and Clustering to Identify Topic-Relevant Blogs , 2007, ICWSM.

[3]  Joshua Goodman,et al.  Finding advertising keywords on web pages , 2006, WWW '06.

[4]  Lois Ann Scheidt,et al.  Bridging the gap: a genre analysis of Weblogs , 2004, 37th Annual Hawaii International Conference on System Sciences, 2004. Proceedings of the.

[5]  David G. Stork,et al.  Pattern Classification , 1973 .

[6]  K. Fujimura,et al.  BLOGRANGER – A Multi-faceted Blog Search Engine , 2006 .

[7]  Ravi Kumar,et al.  On the Bursty Evolution of Blogspace , 2003, WWW '03.

[8]  Thorsten Joachims,et al.  Optimizing search engines using clickthrough data , 2002, KDD.

[9]  Iraklis Varlamis,et al.  BlogRank: ranking weblogs based on connectivity and similarity features , 2006, AAA-IDEA '06.

[10]  Nesar Ahmad,et al.  Web search enhancement by mining user actions , 2007, Inf. Sci..

[11]  Susan T. Dumais,et al.  Improving Web Search Ranking by Incorporating User Behavior Information , 2019, SIGIR Forum.

[12]  Christina K. Pikas Blog Searching for Competitive Intelligence, Brand Image, and Reputation Management , 2005 .

[13]  Ko Fujimura,et al.  The EigenRumor Algorithm for Ranking Blogs , 2005 .

[14]  Chen Wang,et al.  A Self-Organizing Search Engine for RSS Syndicated Web Contents , 2006, 22nd International Conference on Data Engineering Workshops (ICDEW'06).

[15]  Whitney Davison‐Turley Blogs and RSS: Powerful Information Management Tools , 2005 .

[16]  Mark Levene,et al.  Ranking Pages by Topology and Popularity within Web Sites , 2006, World Wide Web.

[17]  Filippo Menczer,et al.  Algorithmic Computation and Approximation of Semantic Similarity , 2006, World Wide Web.

[18]  Mike Thelwall,et al.  Blog search engines , 2007, Online Inf. Rev..

[19]  Christopher H. Brooks,et al.  Improved annotation of the blogosphere via autotagging and hierarchical clustering , 2006, WWW '06.

[20]  Filip Radlinski,et al.  Search Engines that Learn from Implicit Feedback , 2007, Computer.

[21]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[22]  Eytan Adar,et al.  Implicit Structure and the Dynamics of Blogspace , 2004 .

[23]  Halley Suitt A blogger in their midst. , 2003, Harvard business review.

[24]  Aaron Weiss Your blog?: who gives a @*#%! , 2004, NTWK.

[25]  F ROSENBLATT,et al.  The perceptron: a probabilistic model for information storage and organization in the brain. , 1958, Psychological review.

[26]  Jon Kleinberg,et al.  Authoritative sources in a hyperlinked environment , 1999, SODA '98.

[27]  Filip Radlinski,et al.  Query chains: learning to rank from implicit feedback , 2005, KDD '05.