An Effective Web Mining Algorithm using Link Analysis

The search engines, such as Google, Yahoo and Bing, provide a powerful information retrieval on the Web. A number of Web Mining algorithms, such as PageRank, Weighted PageRank and HITS, are commonly used to categorize and rank the search results. The motive behind this paper is to present and analyze the currently important algorithms for ranking of web pages such as PageRank and Weighted PageRank and HITS. Second, this paper proses a ranking algorithm based on Weighted PageRank and the existing profile of the user to yield more accurate search results. Simulation Program is developed for the prosed algorithm. The experimental results shows that the proposed algorithm provides acceptable results compared to the Weighted PageRank algorithms.

[1]  Jon Kleinberg,et al.  Authoritative sources in a hyperlinked environment , 1999, SODA '98.

[2]  Ashutosh Kumar Singh,et al.  Web Structure Mining: Exploring Hyperlinks and Algorithms for Information Retrieval , 2010 .

[3]  Komal Kumar Bhatia,et al.  Page Ranking Algorithms: A Survey , 2009, 2009 IEEE International Advance Computing Conference.

[4]  Wenpu Xing,et al.  Weighted PageRank algorithm , 2004, Proceedings. Second Annual Conference on Communication Networks and Services Research, 2004..

[5]  Feng Li Extracting Structure of Web Site Based on Hyperlink Analysis , 2008, 2008 4th International Conference on Wireless Communications, Networking and Mobile Computing.

[6]  James Allan,et al.  A Comparative Study of Utilizing Topic Models for Information Retrieval , 2009, ECIR.

[7]  Jon M. Kleinberg,et al.  Mining the Web's Link Structure , 1999, Computer.

[8]  Hendrik Blockeel,et al.  Web mining research: a survey , 2000, SKDD.

[9]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[10]  Sankar K. Pal,et al.  Comparing Scores Intended for Ranking , 2009, IEEE Transactions on Knowledge and Data Engineering.

[11]  David Cohn,et al.  Learning to Probabilistically Identify Authoritative Documents , 2000, ICML.

[12]  Jaideep Srivastava,et al.  Web mining: information and pattern discovery on the World Wide Web , 1997, Proceedings Ninth IEEE International Conference on Tools with Artificial Intelligence.

[13]  Taher H. Haveliwala Topic-sensitive PageRank , 2002, IEEE Trans. Knowl. Data Eng..

[14]  Durgesh Kumar Mishra,et al.  Knowledge Discovery and Retrieval on World Wide Web Using Web Structure Mining , 2010, 2010 Fourth Asia International Conference on Mathematical/Analytical Modelling and Computer Simulation.

[15]  Yanchun Zhang,et al.  Effectively Finding Relevant Web Pages from Linkage Information , 2003, IEEE Trans. Knowl. Data Eng..