Web Page Segmentation for Small Screen Devices Using Tag Path Clustering Approach

The web pages breathing these days are developed to be displayed on a Desktop PCs and so viewing them on mobile web browsers is extremely tough. Since mobile devices have restricted resources, small screen device users need to scroll down and across the complicated sites persistently. To address the problem of resource limitation of small screen devices, a unique methodology of web page segmentation with tag path clustering is proposed, that reduces the memory space demand of the small hand-held devices. For segmenting web pages, both reappearance key patterns detection technique and page layout information are used to provide better segmentation accuracy. KeywordsDOM(Document Object Model), key patterns, Tag path clustering, web page segmentation.

[1]  Wolfgang Nejdl,et al.  A densitometric approach to web page segmentation , 2008, CIKM '08.

[2]  Jihong Kim,et al.  Structure-Aware Web Transcoding for Mobile Devices , 2003, IEEE Internet Comput..

[3]  Wei-Ying Ma,et al.  Detecting web page structure for adaptive viewing on small form factor devices , 2003, WWW '03.

[4]  Keiichiro Hoashi,et al.  Robust web page segmentation for mobile terminal using content-distances and page layout information , 2007, WWW '07.

[5]  Swe Swe Nyein Mining contents in Web page using cosine similarity , 2011, 2011 3rd International Conference on Computer Research and Development.

[6]  Deepayan Chakrabarti,et al.  A graph-theoretic approach to webpage segmentation , 2008, WWW.

[7]  Jan-Ming Ho,et al.  Discovering informative content blocks from Web documents , 2002, KDD.

[8]  Jaeyoung Yang,et al.  Repetition-based web page segmentation by detecting tag patterns for small-screen devices , 2010, IEEE Transactions on Consumer Electronics.

[9]  Xing Xie,et al.  Adapting Web pages for small-screen devices , 2005, IEEE Internet Computing.

[10]  Kerry Rodden,et al.  SearchMobil: Web Viewing and Search for Mobile Devices , 2003, WWW.

[11]  Mohammed Atiquzzaman,et al.  A novel scheme for streaming multimedia to personal wireless handheld devices , 2003, IEEE Trans. Consumer Electron..

[12]  HongJiang Zhang,et al.  HTML page analysis based on visual cues , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.

[13]  A. K. Singh,et al.  An Efficient Method of Eliminating Noisy Information in Web Pages for Data Mining , 2004, CIT.

[14]  Wei-Ying Ma,et al.  Block-based web search , 2004, SIGIR '04.

[15]  Baoyao Zhou,et al.  Function-based object model towards website adaptation , 2001, WWW '01.

[16]  Nan Liu,et al.  A Block Gathering Based on Mobile Web Page Segmentation Algorithm , 2011, 2011IEEE 10th International Conference on Trust, Security and Privacy in Computing and Communications.

[17]  Sandip Debnath,et al.  Automatic identification of informative sections of Web pages , 2005, IEEE Transactions on Knowledge and Data Engineering.

[18]  Sanggil Kang,et al.  Adaptive Hierarchical Surrogate for Searching Web with Mobile Devices , 2007, IEEE Transactions on Consumer Electronics.

[19]  V. Kalaivani,et al.  Dynamic web page segmentation based on detecting reappearance and layout of tag patterns for small screen devices , 2012, 2012 International Conference on Recent Trends in Information Technology.