Bridging the Syntactic and the Semantic Web Search

This paper proposes an information system, which aims to bridge the semantic gap in web search. The system uses multiple domain ontological structures expanding the user's query with semantically related concepts, enhancing in parallel the quality of retrieval to a large extend. Query analyzers broaden the user's information needs from classical term-based to conceptually representations, using knowledge from relevant ontologies and theirs' properties. Besides the use of semantics, the system employs machine learning techniques from the field of swarm intelligence through the Ant Colony algorithm, where ants are considered as web agents capable of collecting and processing relevant information. Furthermore, the effectiveness of the approach is verified experimentally, by observing that the retrieval precision for the enhanced queries is in higher levels, in comparison with the results derived from the classical term-based retrieval procedure.

[1]  Peter W. Foltz,et al.  An introduction to latent semantic analysis , 1998 .

[2]  Ioannis Anagnostopoulos,et al.  A generalised regression algorithm for Web page categorisation , 2004, Neural Computing & Applications.

[3]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[4]  Yun Peng,et al.  Swoogle: A semantic web search and metadata engine , 2004, CIKM 2004.

[5]  Luca Maria Gambardella,et al.  An Ant Colony Optimization Approach to the Probabilistic Traveling Salesman Problem , 2002, PPSN.

[6]  Luca Maria Gambardella,et al.  Ant Algorithms for Discrete Optimization , 1999, Artificial Life.

[7]  David Hawking,et al.  Merging Results From Isolated Search Engines , 1999, Australasian Database Conference.

[8]  Marco Dorigo,et al.  Ant system: optimization by a colony of cooperating agents , 1996, IEEE Trans. Syst. Man Cybern. Part B.

[9]  Ioannis Anagnostopoulos,et al.  Classifying Web pages employing a probabilistic neural network , 2004, IEE Proc. Softw..

[10]  Stephen Chen,et al.  Commonality and Genetic Algorithms , 1996 .

[11]  Marco Dorigo,et al.  Swarm intelligence: from natural to artificial systems , 1999 .

[12]  Marco Dorigo,et al.  The ant colony optimization meta-heuristic , 1999 .

[13]  Dik Lun Lee,et al.  Server Ranking for Distributed Text Retrieval Systems on the Internet , 1997, DASFAA.

[14]  Timothy W. Finin,et al.  Swoogle: a search and metadata engine for the semantic web , 2004, CIKM '04.

[15]  Gian Luca Foresti,et al.  A distributed probabilistic system for adaptive regulation of image processing parameters , 1996, IEEE Trans. Syst. Man Cybern. Part B.