A Web Document Retrieval Algorithm Based on Particle Swarm Optimization

With advances in the computer technologies and the rapid development of Internet, information on the Internet is increasing exponentially. To efficiently retrieve relevant documents from the explosive growth of the Internet and other sources of information access, a novel Web document retrieval algorithm based on particle swarm optimization (PSO) and linear discriminant analysis (LDA) algorithm is proposed to deal this problem. Experimental results clearly demonstrate its effectiveness and efficiency.

[1]  Jorng-Tzong Horng,et al.  Applying genetic algorithms to query optimization in document retrieval , 2000, Inf. Process. Manag..

[2]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[3]  Amit Singhal,et al.  Pivoted document length normalization , 1996, SIGIR 1996.

[4]  Tao Jiang,et al.  Robust and accurate cancer classification with gene expression profiling , 2005, 2005 IEEE Computational Systems Bioinformatics Conference (CSB'05).

[5]  Golub Gene H. Et.Al Matrix Computations, 3rd Edition , 2007 .

[6]  Pierre Comon,et al.  Independent component analysis, A new concept? , 1994, Signal Process..

[7]  Russell C. Eberhart,et al.  A new optimizer using particle swarm theory , 1995, MHS'95. Proceedings of the Sixth International Symposium on Micro Machine and Human Science.

[8]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[9]  Shigeo Abe DrEng Pattern Classification , 2001, Springer London.

[10]  David D. Lewis,et al.  Reuters-21578 Text Categorization Test Collection, Distribution 1.0 , 1997 .

[11]  Chris Buckley,et al.  Pivoted Document Length Normalization , 1996, SIGIR Forum.

[12]  James Kennedy,et al.  The particle swarm: social adaptation of knowledge , 1997, Proceedings of 1997 IEEE International Conference on Evolutionary Computation (ICEC '97).