Extracting Topics From Weblogs Through Frequency Segments
暂无分享,去创建一个
In this paper, we present an approach to extracting topics from weblogs by using terms that appear in them. We model a term in terms of frequency segments, i.e., sequential occurrences of the term over time, as the unit of characterization. A notable feature of the model is its approximation of changes in the dynamics of term frequencies; it captures the granularity of frequencies from the very beginning of their occurrence. This approximation also makes a comparison of frequency patterns of terms more effective. We report on the results obtained from weblogs that contained an event of global significance i.e., the London bombings of 2005.
[1] Jun'ichi Tatemura,et al. Discovering Important Bloggers based on Analyzing Blog Threads , 2005 .
[2] Ravi Kumar,et al. On the Bursty Evolution of Blogspace , 2003, WWW '03.
[3] Ramanathan V. Guha,et al. Information diffusion through blogspace , 2004, WWW '04.
[4] Eytan Adar,et al. Implicit Structure and the Dynamics of Blogspace , 2004 .
[5] James Allan,et al. Topic detection and tracking: event-based information organization , 2002 .