Incorporating Domain Knowledge and Information Retrieval Techniques to Develop an Architectural/Engineering/Construction Online Product Search Engine

This paper introduces a domain-specific search engine, which was developed to take advantage of the growing online product information for surveying the virtual product market. Knowledge about a product was uniquely incorporated with query expansion operations and the extended Boolean model retrieval approach to handle issues associated with the search engine development. The search engine was designed to (1) represent and utilize the represented knowledge in the product domain; (2) identify online product information; and (3) then evaluate the collected online product information. A prototype search engine for testing was developed and statistically validated with five data sets with each data set being derived from a different type of product according to MasterFormat (Alexandria, Va.) categorization. The validation results indicated that compared with the tested general search engine or aggregated information service, the prototype was able to identify more distinct product manufacturers for procurement-related decision support.

[1]  Hans-Jürgen Zimmermann,et al.  Fuzzy Set Theory - and Its Applications , 1985 .

[2]  Edward A. Fox,et al.  Research Contributions , 2014 .

[3]  Joon Ho Lee,et al.  Properties of extended Boolean models in information retrieval , 1994, SIGIR '94.

[4]  Justin Zobel,et al.  How reliable are the results of large-scale information retrieval experiments? , 1998, SIGIR '98.

[5]  C. Paice Soft evaluation of Boolean search queries in information retrieval systems , 1984 .

[6]  Donald H. Kraft,et al.  A mathematical model of a weighted boolean retrieval system , 1979, Inf. Process. Manag..

[7]  F. A. Grootjen,et al.  Conceptual query expansion , 2006, Data Knowl. Eng..

[8]  Jiawei Han,et al.  AUTOMATED CLASSIFICATION OF CONSTRUCTION PROJECT DOCUMENTS , 2002 .

[9]  Lucio Soibelman,et al.  Knowledge-Assisted Retrieval of Online Product Information in Architectural/Engineering/Construction , 2007 .

[10]  Peter G. Anick Adapting a full-text information retrieval system to the computer troubleshooting domain , 1994, SIGIR '94.

[11]  Rada Mihalcea,et al.  Semantic Indexing using WordNet Senses , 2000 .

[12]  Amanda Spink,et al.  Searching the Web: the public and their queries , 2001 .

[13]  Padmini Srinivasan,et al.  Thesaurus Construction , 1992, Information Retrieval: Data Structures & Algorithms.

[14]  Stephen Ashcroft,et al.  The Wilcoxon Signed-Rank test , 2003 .

[15]  John Tait,et al.  Word sense disambiguation in information retrieval revisited , 2003, SIGIR.

[16]  George A. Miller,et al.  Introduction to WordNet: An On-line Lexical Database , 1990 .

[17]  Celson Lima,et al.  Domain Taxonomy for Construction Concepts: Toward a Formal Ontology for Construction Knowledge , 2005 .

[18]  Edward A. Fox,et al.  Lexical relations: enhancing effectiveness of information retrieval systems , 1980, SIGF.

[19]  Yacine Rezgui,et al.  Ontology-Centered Knowledge Management Using Information Retrieval Techniques , 2006 .

[20]  Ellen M. Voorhees,et al.  Query expansion using lexical-semantic relations , 1994, SIGIR '94.