Extraction of link context using tag tree and LALR parsing

Extraction of link context is used to know the theme of the target web page. This Link Context is used in many tasks like categorization of the web page, focused crawling. In this paper we have proposed a method to extract the link context with the help of tag tree approach and parsing method. Tag tree approach will help to find the concept of the anchor text and this concept will be used by LALR parser followed by the algorithm for extraction of link context.

[1]  Filippo Menczer,et al.  Topical Crawling for Business Intelligence , 2003, ECDL.

[2]  Padmini Srinivasan,et al.  Link Contexts in Classifier-Guided Topical Crawlers , 2006, IEEE Trans. Knowl. Data Eng..

[3]  Gautam Pant Deriving link-context from HTML tag tree , 2003, DMKD '03.

[4]  Martin van den Berg,et al.  Focused Crawling: A New Approach to Topic-Specific Web Resource Discovery , 1999, Comput. Networks.

[5]  Giuseppe Attardi,et al.  Automatic Web Page Categorization by Link and Context Analysis , 1999 .

[6]  Wanli Zuo,et al.  Deriving Link Context through Dependency Analysis , 2009, 2009 International Conference on Education Technology and Computer.

[7]  Yoelle Maarek,et al.  The Shark-Search Algorithm. An Application: Tailored Web Site Mapping , 1998, Comput. Networks.

[8]  Filippo Menczer,et al.  Adaptive Retrieval Agents: Internalizing Local Context and Scaling up to the Web , 2000, Machine Learning.

[9]  Wanli Zuo,et al.  Extracting Precise Link Context Using NLP Parsing Technique , 2004, IEEE/WIC/ACM International Conference on Web Intelligence (WI'04).

[10]  Jon M. Kleinberg,et al.  Automatic Resource Compilation by Analyzing Hyperlink Structure and Associated Text , 1998, Comput. Networks.

[11]  Soumen Chakrabarti,et al.  Accelerated focused crawling through online relevance feedback , 2002, WWW.

[12]  Manjeet Singh,et al.  A Rule-Based Approach for Extraction of Link-Context from Anchor-Text Structure , 2012, ISI.

[13]  Toyoaki Nishida,et al.  IICA: An Ontology-based Internet Navigation System , 2002 .

[14]  Oliver A. McBryan,et al.  GENVL and WWWW: Tools for taming the Web , 1994, WWW Spring 1994.