A Transport Service Ontology-based Focused Crawler

Ontology is a technology for conceptualizing specific domain knowledge, which can provide machine-readable definitions to the severed domain. Therefore, ontology can be utilized to enhance the performance of focused crawlers, by precisely defining the crawling boundary. In this paper, we will exhibit a conceptual framework of an ontology-based focused crawler serving in the domain of transport services. Here, a transport service ontology is designed for filtering non-relevant metadata, by means of logically linking the metadata with ontological concepts. In addition, we will provide the evaluation process in order to assess the power of ontology in the focused crawler. Conclusion and further works based on our current evaluation results will be made in the final section.

[1]  Hai Zhuge Semantic grid: scientific issues, infrastructure, and methodology , 2005, CACM.

[2]  John Dunnion,et al.  The use of data mining in the design and implementation of an incident report retrieval system , 2003, IEEE Systems and Information Engineering Design Symposium, 2003.

[3]  Hai Zhuge,et al.  The Web Resource Space Model , 2008 .

[4]  Arputharaj Kannan,et al.  LSCrawler: A Framework for an Enhanced Focused Web Crawler Based on Link Semantics , 2006, 2006 IEEE/WIC/ACM International Conference on Web Intelligence (WI 2006 Main Conference Proceedings)(WI'06).

[5]  Günther Kundt,et al.  Skin sparing mastectomy with conservation of the nipple-areola-complex and autologous reconstruction is an oncological safe procedure – an extended follow-up study , 2008 .

[6]  G. Aghila,et al.  Ontology-based Web crawler , 2004, International Conference on Information Technology: Coding and Computing, 2004. Proceedings. ITCC 2004..

[7]  Hai Zhuge,et al.  Resource Space Grid: model, method and platform , 2004, Concurr. Pract. Exp..

[8]  B. Hammond Ontology , 2004, Lawrence Booth’s Book of Visions.

[9]  Sheng-Yuan Yang An ontological website models-supported search agent for web services , 2008, Expert Syst. Appl..

[10]  Hector Garcia-Molina,et al.  Parallel crawlers , 2002, WWW.

[11]  Jie Liu,et al.  Extended resource space model , 2005, Future Gener. Comput. Syst..

[12]  Hai Zhuge,et al.  The knowledge grid , 2004 .

[13]  Donald Perlis,et al.  Information Retrieval on the World Wide Web and Active Logic: A Survey and Problem Definition , 2002 .

[14]  Iraklis Varlamis,et al.  THESUS: Organizing Web document collections based on link semantics , 2003, The VLDB Journal.

[15]  Gerd Stumme,et al.  Semantic resource management for the web: an e-learning application , 2004, WWW Alt. '04.

[16]  Marja-riitta Koivunen,et al.  W3C Semantic Web Activity , 2001 .

[17]  Hai Zhuge,et al.  Resource space model, OWL and database: Mapping and integration , 2008, TOIT.