Blog Distillation towards Linking Wikipedia Entries to Blog Feeds

This paper proposes an approach to blog distillation, i.e., searching for blog feeds that are principally devoted to a given topic. We study this task for the purpose of regarding each of Wikipedia entries as a topic and linking it blog feeds. First, in order to collect candidates of blog feeds for a given query, in this paper, we use existing Web search engine APIs, which return a ranked list of blog posts, given a topic keyword. Next, we re-rank the list of blog feeds according to the number of hits of the topic keyword in each blog feed. We also apply the proposed blog distillation framework to the task of cross-lingually analyze multilingual blogs collected with a topic keyword. Here, we cross-lingually and cross-culturally compare less well known facts and opinions that are closely related to a given topic. Preliminary evaluation results support the effectiveness of the proposed framework.