Web Pages Reordering and Clustering Based on Web Patterns

In this paper was proposed a method for the description of web pages using web patterns. We will explain what we mean by the term "web pattern". We will present a taxonomy web patterns and a description of some their types. In the description of web patterns we will focus on properties which are useful for automatic detection on web pages. As a result of the detection we get a description of a web page using found web patterns. The description can be used for reordering and clustering of a web page set.

[1]  Khaled Shaalan,et al.  A Survey of Web Information Extraction Systems , 2006, IEEE Transactions on Knowledge and Data Engineering.

[2]  Gerard Salton,et al.  A vector space model for automatic indexing , 1975, CACM.

[3]  Václav Snásel,et al.  Web Page Analysis: Experiments Based on Discussion and Purchase Web Patterns , 2007, 2007 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology - Workshops.

[4]  Wei-Ying Ma,et al.  Web object retrieval , 2007, WWW '07.

[5]  Václav Snásel,et al.  Semantic Analysis of Web Pages Using Web Patterns , 2006, 2006 IEEE/WIC/ACM International Conference on Web Intelligence (WI 2006 Main Conference Proceedings)(WI'06).

[6]  John Mylopoulos,et al.  Text Mining Through Semi Automatic Semantic Annotation , 2006, PAKM.

[7]  Alberto H. F. Laender,et al.  Automatic web news extraction using tree edit distance , 2004, WWW '04.

[8]  Alexander Chatzigeorgiou,et al.  Design Pattern Detection Using Similarity Scoring , 2006, IEEE Transactions on Software Engineering.

[9]  Wei-Ying Ma,et al.  Improving pseudo-relevance feedback in web information retrieval using web page segmentation , 2003, WWW '03.

[10]  Soumen Chakrabarti,et al.  Mining the web - discovering knowledge from hypertext data , 2002 .

[11]  Teuvo Kohonen,et al.  Self-Organizing Maps , 2010 .

[12]  Max Jacobson,et al.  A Pattern Language: Towns, Buildings, Construction , 1981 .

[13]  Melody Y. Ivory,et al.  Evolution of web site design patterns , 2005, TOIS.

[14]  Aleksander Pivk,et al.  Thesis: automatic ontology generation from web tabular structures , 2006 .

[15]  Václav Snásel,et al.  GUI Patterns and Web Semantics , 2007, 6th International Conference on Computer Information Systems and Industrial Management Applications (CISIM'07).

[16]  Wei-Ying Ma,et al.  Object-level Vertical Search , 2007, CIDR.

[17]  James A. Landay,et al.  The Design of Sites: Patterns, Principles, and Processes for Crafting a Customer-Centered Web Experience , 2002 .

[18]  Václav Snásel,et al.  Semantic Analysis of Web Pages Using Cluster Analysis and Nonnegative Matrix Factorization , 2007, AWIC.

[19]  Jenifer Tidwell Designing Interfaces , 2005 .

[20]  Andrew M. Dearden,et al.  Pattern Languages in HCI: A Critical Review , 2006, Hum. Comput. Interact..

[21]  Jing Dong,et al.  Experiments on Design Pattern Discovery , 2007, Third International Workshop on Predictor Models in Software Engineering (PROMISE'07: ICSE Workshops 2007).