论文信息 - Mining and visualising information from RSS feeds: a case study

Mining and visualising information from RSS feeds: a case study

Purpose – Recent years have seen “really simple syndication” or “rich site summary”(RSS) syndication of frequently updated content become ubiquitous across the internet. RSS's XML‐based format allows these data to be stored in a semi‐structured format but, despite the presence of online aggregators and readers, and the related work in clustering feeds and mining subjects by keywords, much potentially useful information present in RSS may remain undiscovered. This paper aims to address this issue in an experimental setting.Design/methodology/approach – This paper presents two distinct technologies which employ the semi‐structured nature of RSS content to allow users to mine information directly from raw RSS feeds: occurrence mining counts occurrences of text strings in feeds, whilst value mining mines structured ticker tape numeric data. It describes both technologies and their implementation in an experiment, where 35 students mined small numbers of RSS feeds and visualised the data mined.Findings – This ...

Mark Levene | Martin O'Shea

[1] Daniel A. Keim,et al. Visual Sentiment Analysis of RSS News Feeds Featuring the US Presidential Election in 2008 , 2009 .

[2] Richard Chbeir,et al. Relating RSS News/Items , 2009, ICWE.

[3] Li Qingcheng,et al. Extracting Content from Web Pages Based on RSS , 2008, 2008 International Conference on Computer Science and Software Engineering.

[4] Chih-Lin Hu,et al. RSS watchdog: an instant event monitor on real online news streams , 2009, CIKM.

[5] Bin Liu,et al. Personal News RSS Feeds Generation Using Existing News Feeds , 2009, ICWE.

[6] Fuji Ren,et al. Create Special Domain News Collections through Summarization and Classification , 2010 .

[7] Mariano P. Consens,et al. Visualizing structural patterns in web collections , 2007, WWW '07.

[8] Rudy Prabowo,et al. Are raw RSS feeds suitable for broad issue scanning? A science concern case study , 2006 .

[9] Martin Wattenberg,et al. Social data analysis workshop , 2008, CHI Extended Abstracts.

[10] Xin Li,et al. A novel clustering-based RSS aggregator , 2007, WWW '07.

[11] Rittwik Jana,et al. Geotracker: geospatial and temporal RSS navigation , 2007, WWW '07.

[12] Matthias Baumgarten,et al. Data mining and XML: current and future issues , 2000, Proceedings of the First International Conference on Web Information Systems Engineering.

[13] Maria Soledad Pera,et al. Synthesizing correlated RSS news articles based on a fuzzy equivalence relation , 2009, Int. J. Web Inf. Syst..