Omnibase: Uniform Access to Heterogeneous Data for Question Answering

Although the World Wide Web contains a tremendous amount of information, the lack of uniform structure makes finding the right knowledge difficult. A solution is to turn the Web into a "virtual database" and to access it through natural language. We built Omnibase, a system that integrates heterogeneous data sources using an object-property-value model. With the help of Omnibase, our START natural language system can now access numerous heterogeneous data sources on the Web in a uniform manner, and answers millions of user questions with high precision.

[1]  Craig A. Knoblock,et al.  A hierarchical approach to wrapper induction , 1999, AGENTS '99.

[2]  Boris Katz,et al.  Annotating the World Wide Web using Natural Language , 1997, RIAO.

[3]  Lucy Vanderwende,et al.  Automatically Deriving Structured Knowledge Bases From On-Line Dictionaries , 1993 .

[4]  MiningChun-Nan Hsu Finite-state Transducers for Semi-structured Text Mining , 1999 .

[5]  Jimmy J. Lin The Web as a Resource for Question Answering: Perspectives and Challenges , 2002, LREC.

[6]  Jimmy J. Lin,et al.  Annotating the Semantic Web Using Natural Language , 2002, NLPXML@COLING.

[7]  Divesh Srivastava,et al.  The Information Manifold , 1995 .

[8]  Brad Adelberg,et al.  NoDoSE—a tool for semi-automatically extracting structured and semistructured data from text documents , 1998, SIGMOD '98.

[9]  Roy Goldman,et al.  Lore: a database management system for semistructured data , 1997, SGMD.

[10]  Alberto O. Mendelzon,et al.  Database techniques for the World-Wide Web: a survey , 1998, SGMD.

[11]  Craig A. Knoblock,et al.  The Ariadne Approach to Web-Based Information Integration , 2001, Int. J. Cooperative Inf. Syst..

[12]  Nicholas Kushmerick,et al.  Wrapper Induction for Information Extraction , 1997, IJCAI.

[13]  Paolo Merialdo,et al.  Semistructured and structured data in the Web: going back and forth , 1997, SGMD.

[14]  Boris Katz,et al.  Using English for Indexing and Retrieving , 1991 .

[15]  Peter Thanisch,et al.  Natural language interfaces to databases – an introduction , 1995, Natural Language Engineering.

[16]  Hector Garcia-Molina,et al.  Extracting Semistructured Information from the Web. , 1997 .