Estimating Page Importance based on Page Accessing Frequency

the vast growth of the Internet, many web pages are available online. Search engines use a component called as web crawlers for collecting these web pages from the web for storage and indexing. Many web pages are autonomous and are updated independent of the users. .As the web pages are updated autonomously; users do not come to know of how often the sources change. An incremental crawler visits the web repeatedly after a specific interval of time for the updation of its collection. Users are benefited by knowing the page importance based upon the page accessing frequency. This paper finds out the page importance based on page accessing frequency and also architecture for the same is also proposed.

[1]  Hector Garcia-Molina,et al.  The Evolution of the Web and Implications for an Incremental Crawler , 2000, VLDB.

[2]  Komal Kumar Bhatia,et al.  A Framework for Incremental Hidden Web Crawler , 2010 .

[3]  Junghoo Cho,et al.  Impact of search engines on page popularity , 2004, WWW '04.

[4]  Komal Kumar Bhatia,et al.  A Framework for Incremental Domain-Specific Hidden Web Crawler , 2010, IC3.

[5]  Ravita Chahar,et al.  Management Of Volatile Information In Incremental Web Crawler , 2009, ArXiv.

[6]  Hector Garcia-Molina,et al.  Estimating frequency of change , 2003, TOIT.

[7]  Ashutosh,et al.  Design of A Priority Based Frequency Regulated Incremental Crawler , 2014 .

[8]  Jenny Edwards,et al.  An adaptive model for optimizing performance of an incremental web crawler , 2001, WWW '01.

[9]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[10]  Arvind Kumar,et al.  An Effective Method for Ranking of Changed Web Pages in Incremental Crawler , 2010 .

[11]  George Cybenko,et al.  How dynamic is the Web? , 2000, Comput. Networks.

[12]  Divya Gupta,et al.  Discussion on Web Crawlers of Search Engine , 2008 .

[13]  Sriram Raghavan,et al.  Searching the Web , 2001, ACM Trans. Internet Techn..

[14]  Sakshi Goel,et al.  A Novel Approach for Page Rank in Incremental Crawler , 2012 .

[15]  Tao Luo,et al.  Effective personalization based on association rule discovery from web usage data , 2001, WIDM '01.

[16]  Marc Najork,et al.  High-performance Web Crawling High-performance Web Crawling Publication History , 2001 .

[17]  Ashutosh Dixit Self Adjusting Refresh Time Based Architecture for Incremental Web Crawler , 2008 .