Web Text Clustering for Personalized E-learning Based on Maximal Frequent Itemsets

With the rapid development of the network technique and the prevalence of the Internet, e-learning has become the major trend of the development of international education since 1980s, and the important access for the internationalization and the information of education. To meet the personalized needs of learners in e-learning, a new Web text clustering method for personalized e-learning based on maximal frequent itemsets is proposed. Firstly, the Web documents are represented by vector space model. Then, maximal frequent word sets are discovered. Finally, based on a new similarity measure of itemsets, maximal itemsets are used for clustering documents. Experimental results show that the proposed method is effective.

[1]  Daniel Boley,et al.  Principal Direction Divisive Partitioning , 1998, Data Mining and Knowledge Discovery.

[2]  Ramakrishnan Srikant,et al.  Fast algorithms for mining association rules , 1998, VLDB 1998.

[3]  Hinrich Schütze,et al.  Introduction to information retrieval , 2008 .

[4]  Jiawei Han,et al.  Frequent pattern mining: current status and future directions , 2007, Data Mining and Knowledge Discovery.

[5]  Sebastián Ventura,et al.  Data Mining in E-learning , 2006 .

[6]  Martin Ester,et al.  Frequent term-based text clustering , 2002, KDD.

[7]  Mohamed S. Kamel,et al.  Phrase-based document similarity based on an index graph model , 2002, 2002 IEEE International Conference on Data Mining, 2002. Proceedings..

[8]  Bingru Yang,et al.  Index-Maxminer: a New Maximal Frequent Itemset Mining Algorithm , 2008, Int. J. Artif. Intell. Tools.

[9]  Pilian He,et al.  A Study on Text Clustering Algorithms Based on Frequent Term Sets , 2005, ADMA.

[10]  Benjamin C. M. Fung,et al.  Hierarchical Document Clustering using Frequent Itemsets , 2003, SDM.

[11]  Ramayya Krishnan,et al.  Incremental hierarchical clustering of text documents , 2006, CIKM '06.

[12]  Hajo Hippner,et al.  Text Mining , 2006, Informatik-Spektrum.

[13]  Xiaolong Wang,et al.  ConSOM: A conceptional self-organizing map model for text clustering , 2008, Neurocomputing.