A framework of service oriented semantic search engine

A service oriented semantic search engine framework is proposed to extract automatically and accurately information on service provider on the Internet. Domain knowledge on theme and abstract service provider are modeled by ontology to guide information extraction. Theme identification for each web page is executed after the page is fetched by search engine. A page or a part of content of a page is assigned a theme. Concrete service provider entity and attributes are extracted from content of web pages based on themes of the pages. Several algorithms are developed. Computational experiments demonstrate that service oriented semantic search engine shows high recall rate and precision. Highest recall rate of concrete service provider entity exceeds 97% on tested web sites. Precision of concrete service provider entity reaches up to 100% on two tested web sites. It also shows good recall rate and precision of concrete service provider attributes.

[1]  Yuhong Yan,et al.  Between Service Science and Service-Oriented Software Systems , 2008, 2008 IEEE Congress on Services Part II (services-2 2008).

[2]  B.H. Chandrashekar,et al.  Semantic domain specific search engine , 2010, 2010 The 2nd International Conference on Computer and Automation Engineering (ICCAE).

[3]  Yuekui Yang,et al.  A Topic-Specific Web Crawler with Web Page Hierarchy Based on HTML Dom-Tree , 2009, 2009 Asia-Pacific Conference on Information Processing.

[4]  Li Liu,et al.  Web information extraction based on news domain ontology theory , 2010, 2010 IEEE 2nd Symposium on Web Society.

[5]  Gao Honghao,et al.  A design and implementation of search engine for mobile devices based Chinese semantics and reasoning , 2010, 2010 International Conference On Computer Design and Applications.

[6]  Zubair A. Shaikh,et al.  SWISE: Semantic Web based intelligent search engine , 2010, 2010 International Conference on Information and Emerging Technologies.

[7]  Sheau-Ling Hsieh,et al.  Semantic similarity measure in biomedical domain leverage Web Search Engine , 2010, 2010 Annual International Conference of the IEEE Engineering in Medicine and Biology.

[8]  Danushka Bollegala,et al.  A Web Search Engine-Based Approach to Measure Semantic Similarity between Words , 2011, IEEE Transactions on Knowledge and Data Engineering.

[9]  Xiaoyao Xie,et al.  The design and realization of open-source search engine based on Nutch , 2010, 2010 International Conference on Anti-Counterfeiting, Security and Identification.

[10]  Lizhen Li,et al.  Ontology of General Concept for Semantic Searching , 2010, 2010 Second International Conference on Computer Modeling and Simulation.