Association based classification for relational data and its use in web mining

Classification based on mining association rules is a method with good accuracy and human readable classification model. The aim of this paper is to propose modification of the basic association based classification method, which can be used for the data extracted from web pages. In this paper, the modifications of the method and necessary discretization of numeric attributes will be described. Next, the experiments with various data will be presented, with emphasis on data obtained by extraction and segmentation of web pages.

[1]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[2]  Wei-Ying Ma,et al.  VIPS: a Vision-based Page Segmentation Algorithm , 2003 .

[3]  Hong Shen,et al.  An Adaptive Method of Numerical Attribute Merging for Quantitative Association Rule Mining , 1999, ICSC.

[4]  Xiaogang Peng,et al.  Automatic web page classification in a dynamic and hierarchical way , 2002, 2002 IEEE International Conference on Data Mining, 2002. Proceedings..

[5]  Wynne Hsu,et al.  Integrating Classification and Association Rule Mining , 1998, KDD.

[6]  Radek Burget Automatic Document Structure Detection for Data Integration , 2007, BIS.

[7]  Jaroslav Zendulka,et al.  Mining Association Rules from Relational Data - Average Distance Based Method , 2003, CoopIS/DOA/ODBASE.

[8]  Gerhard Widmer,et al.  Relative Unsupervised Discretization for Association Rule Mining , 2000, PKDD.

[9]  Osmar R. Zaïane,et al.  Text document categorization by term association , 2002, 2002 IEEE International Conference on Data Mining, 2002. Proceedings..

[10]  Gregg C. Vanderheiden,et al.  Web content accessibility guidelines 1.0 , 2001, INTR.

[11]  Jian Pei,et al.  CMAR: accurate and efficient classification based on multiple class-association rules , 2001, Proceedings 2001 IEEE International Conference on Data Mining.

[12]  Zahir Tari,et al.  On the Move to Meaningful Internet Systems. OTM 2018 Conferences , 2018, Lecture Notes in Computer Science.

[13]  Wee Sun Lee,et al.  Towards Understanding the Functions of Web Element , 2004, AIRS.