Information Extraction Using Web Usage Mining, Web Scrapping and Semantic Annotation

Extracting useful information from the web is the most significant issue of concern for the realization of semantic web. This may be achieved by several ways among which Web Usage Mining, Web Scrapping and Semantic Annotation plays an important role. Web mining enables to find out the relevant results from the web and is used to extract meaningful information from the discovery patterns kept back in the servers. Web usage mining is a type of web mining which mines the information of access routes/manners of users visiting the web sites. Web scraping, another technique, is a process of extracting useful information from HTML pages which may be implemented using a scripting language known as Prolog Server Pages(PSP) based on Prolog. Third, Semantic annotation is a technique which makes it possible to add semantics and a formal structure to unstructured textual documents, an important aspect in semantic information extraction which may be performed by a tool known as KIM(Knowledge Information Management). In this paper, we revisit, explore and discuss some information extraction techniques on web like web usage mining, web scrapping and semantic annotation for a better or efficient information extraction on the web illustrated with examples.

[1]  Ee-Peng Lim,et al.  Web Usage Mining: Algorithms and Results , 2004 .

[2]  B. Omelayenko Web Service Configuration on the Semantic Web , 2005 .

[3]  Huajun Chen,et al.  The Semantic Web , 2011, Lecture Notes in Computer Science.

[4]  Patrick Lambrix Towards a semantic Web for bioinformatics using ontology-based annotation , 2005, 14th IEEE International Workshops on Enabling Technologies: Infrastructure for Collaborative Enterprise (WETICE'05).

[5]  Luc Steels,et al.  Corporate Knowledge Management , 1992, AIFIPP.

[6]  Mahmudur Rahman,et al.  Pattern Discovery of Web Usage Mining , 2009, 2009 International Conference on Computer Technology and Development.

[7]  Li Haigang,et al.  Study of Application of Web Mining Techniques in E-Business , 2006, 2006 International Conference on Service Systems and Service Management.

[8]  A. Joshi,et al.  Web mining: research and practice , 2004, Computing in Science & Engineering.

[9]  Huang Lucheng,et al.  Web Mining in Technology Management , 2008, 2008 International Seminar on Business and Information Management.