A Phrase-based Ontology Enabled Semantic Processing System for Web Search

Semantic processing system (SPS) is a system that performs phrase search of web content. SPS takes a user query in natural language, converts it to a keyword query, expands the keyword query with synonyms, hypernyms, hyponyms, and meronyms, and presents the keyword query to a search engine. SPS then sifts through the search engine result pages extracting grammatical and semantic information from each page for computing the page’s relevance to the natural language query. SPS' relevance computation uses semantic matching of phrases rather than term-and-document frequency weighting—a method that is most commonly used by existing web search engines. SPS consults an ontology that is both “crowd-sourced,” i.e., built collaboratively and incrementally by the large number of users and “auto-learned,” i.e., contextually inferred from sentences containing desired words. SPS would be suitable for the areas of biomedical literature mining, legal document review and discovery, and news/RSS feed monitoring because these are laden with prose text. We implemented a prototype SPS, experimented with it and demonstrate that SPS outperforms a representative keyword based search engine. The strength of SPS stems from its exploitation of phrase semantics, which is not used in the conventional search engines.

[1]  Tim Berners-Lee,et al.  Linked Data - The Story So Far , 2009, Int. J. Semantic Web Inf. Syst..

[2]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[3]  Dan Brickley,et al.  Rdf vocabulary description language 1.0 : Rdf schema , 2004 .

[4]  Gary G. Hendrix,et al.  LEARNING BY BEING TOLD: ACQUIRING KNOWLEDGE FOR INFORMATION MANAGEMENT , 1983 .

[5]  Gary G. Hendrix,et al.  An Approach to Acquiring and Applying Knowledge , 1980, AAAI.

[6]  Michael Schroeder,et al.  GoWeb: a semantic search engine for the life science web , 2009, BMC Bioinformatics.

[7]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[8]  Yuzhong Qu,et al.  Falcons: searching and browsing entities on the semantic web , 2008, WWW.

[9]  John F. Sowa,et al.  Conceptual Structures: Information Processing in Mind and Machine , 1983 .

[10]  Marie-Claude Landau,et al.  Conceptual Graphs for Semantics and Knowledge Processing , 1986, IBM J. Res. Dev..

[11]  Ian Horrocks,et al.  Ontologies and the semantic web , 2008, CACM.

[12]  Deborah L. McGuinness,et al.  OWL Web ontology language overview , 2004 .

[13]  Gerard Salton,et al.  A vector space model for automatic indexing , 1975, CACM.

[14]  John F. Sowa,et al.  Knowledge representation: logical, philosophical, and computational foundations , 2000 .

[15]  Michael R. Genesereth,et al.  Software agents , 1994, CACM.

[16]  Timothy W. Finin,et al.  Swoogle: a search and metadata engine for the semantic web , 2004, CIKM '04.

[17]  Jeremy J. Carroll,et al.  Resource description framework (rdf) concepts and abstract syntax , 2003 .

[18]  Ivan Herman,et al.  RDFa 1.1 Primer , 2012 .

[19]  Douglas B. Lenat,et al.  CYC: a large-scale investment in knowledge infrastructure , 1995, CACM.

[20]  C E Lipscomb,et al.  Medical Subject Headings (MeSH). , 2000, Bulletin of the Medical Library Association.

[21]  H. Lan,et al.  SWRL : A semantic Web rule language combining OWL and ruleML , 2004 .

[22]  Enrico Motta,et al.  Watson: a gateway for next generation semantic web applications , 2007 .

[23]  Philip Bubeck,et al.  Ho Chi Minh City , 2010 .

[24]  Jürgen Umbrich,et al.  Searching and browsing Linked Data with SWSE: The Semantic Web Search Engine , 2011, J. Web Semant..