An Semantic Rank for Web Crawler Based on Formal Concept Analysis

Web Crawler is an important research in Search Engine. In this paper, a method for measuring the similarity of FCA concepts is proposed by using information content approach based on user Web log. In process of crawling Web pages for Web Crawler, in order to make choice of Web pages, the semantic rank of Web pages can be determined by using the similarity, other than relying on ontology with human domain expertise. The semantic rank can be made choice of Web pages for Web crawler.

[1]  Rohana K. Rajapakse,et al.  Text retrieval with more realistic concept matching and reinforcement learning , 2006, Inf. Process. Manag..

[2]  Patricia Bouyer,et al.  Improved undecidability results on weighted timed automata , 2006, Inf. Process. Lett..

[3]  Evangelos E. Milios,et al.  Using HMM to learn user browsing patterns for focused Web crawling , 2006, Data & Knowledge Engineering.

[4]  Arnon Rungsawang,et al.  Learnable topic-specific web crawler , 2002, J. Netw. Comput. Appl..

[5]  Song Liang Design of Crawler's Algorithm and Implementation of Crawler's Program , 2004 .

[6]  R. Wille Concept lattices and conceptual knowledge systems , 1992 .

[7]  Dekang Lin,et al.  An Information-Theoretic Definition of Similarity , 1998, ICML.

[8]  Zheng Pei,et al.  Intelligent Spider's Algorithm of Search Engine Based on Keyword , 2005 .

[9]  M. M. Sufyan Beg A subjective measure of web search quality , 2005, Inf. Sci..

[10]  Anna Formica,et al.  Concept similarity in Formal Concept Analysis: An information content approach , 2008, Knowl. Based Syst..

[11]  Hector Garcia-Molina,et al.  Efficient Crawling Through URL Ordering , 1998, Comput. Networks.

[12]  Rudolf Wille,et al.  Restructuring Lattice Theory: An Approach Based on Hierarchies of Concepts , 2009, ICFCA.

[13]  Philip Resnik,et al.  Using Information Content to Evaluate Semantic Similarity in a Taxonomy , 1995, IJCAI.

[14]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[15]  Michel C. A. Klein,et al.  The semantic web: yet another hip? , 2002, Data Knowl. Eng..

[16]  Harith Alani,et al.  Content-based ontology ranking , 2006 .

[17]  Rudolf Wille,et al.  Lattices in Data Analysis: How to Draw Them with a Computer , 1989 .

[18]  Anna Formica,et al.  Ontology-based concept similarity in Formal Concept Analysis , 2006, Inf. Sci..

[19]  Michael Bain,et al.  Inductive Construction of Ontologies from Formal Concept Analysis , 2003, Australian Conference on Artificial Intelligence.