Instant Topic Extraction From a Text-Based Communication Channel for Seeing the World
暂无分享,去创建一个
This article proposes three topic extraction methods for a text-based communication channel of a message stream in bulletin board or chat services. A message is input into the channel, and these methods instantly select active and curious topics from the channel by using noun phrases, topic pressure at the latest message in the channel, topic unexpectedness, and partial matching. This serves as a module of a system that enables a user to follow unfolding world developments. An evaluative comparison of the performance of our methods and a conventional method using four data sets from two standpoints was performed. This is the first step in testing the performance of our methods.
[1] Shivakumar Vaithyanathan,et al. Exploiting clustering and phrases for context-based information retrieval , 1997, SIGIR '97.
[2] Yukio Ohsawa,et al. Influence Diffusion Model in Text-Based Communication , 2002 .
[3] Wei-Ying Ma,et al. Learning to cluster web search results , 2004, SIGIR '04.
[4] Shinichiro Takagi,et al. Japanese Morphological Analyzer using Word Co-occurence -JTAG , 1998, COLING-ACL.