Free-Text Search over Complex Web Forms

This paper investigates the problem of using free-text queries as an alternative means for searching 'behind' web forms. We introduce a novel specification language for specifying free-text interfaces, and report the results of a user study where we evaluated our prototype in a travel planner scenario. Our results show that users prefer this free-text interface over the original web form and that they are about 9% faster on average at completing their search tasks.

[1]  Joseph Weizenbaum,et al.  ELIZA—a computer program for the study of natural language communication between man and machine , 1966, CACM.

[2]  Sebastian Rudolph,et al.  Ontology-Based Interpretation of Keywords for Semantic Search , 2007, ISWC/ASWC.

[3]  Jayant Madhavan,et al.  Google's Deep Web crawl , 2008, Proc. VLDB Endow..

[4]  Jaime G. Carbonell,et al.  The XCALIBUR Project: A Natural Language Interface to Expert Systems , 1983, IJCAI.

[5]  Djoerd Hiemstra,et al.  Onebox: Free-Text Interfaces as an Alternative to Complex Web Forms , 2011 .

[6]  Frank Meng A natural language interface for information retrieval from forms on the World Wide Web , 1999, ICIS.

[7]  Jeffrey D. Ullman,et al.  A Query Translation Scheme for Rapid Implementation of Wrappers , 1995, DOOD.

[8]  Tyrone Grandison,et al.  Accessing the deep web: when good ideas go bad , 2008, OOPSLA Companion.

[9]  Jigui Sun,et al.  Towards a Wrapper-Driven Ontology-Based Framework for Knowledge Extraction , 2007, KSEM.

[10]  Michael Kifer,et al.  Deductive and Object-Oriented Databases , 1991 .

[11]  Chong Wang,et al.  SPARK: Adapting Keyword Query to Semantic Search , 2007, ISWC/ASWC.

[12]  Abraham Bernstein,et al.  Evaluating the usability of natural language query languages and interfaces to Semantic Web knowledge bases , 2010, J. Web Semant..

[13]  Shaul Dar,et al.  DTL's DataSpot: Database Exploration Using Plain Language , 1998, VLDB.

[14]  Michael H. Kutner Applied Linear Statistical Models , 1974 .

[15]  Sandeep Tata,et al.  SQAK: doing more with keywords , 2008, SIGMOD Conference.

[16]  Peter Fankhauser,et al.  DivQ: diversification for keyword search over structured databases , 2010, SIGIR.

[17]  V. Barnett,et al.  Applied Linear Statistical Models , 1975 .

[18]  M. Kendall,et al.  Rank Correlation Methods (5th ed.). , 1992 .

[19]  Peter Thanisch,et al.  Natural language interfaces to databases – an introduction , 1995, Natural Language Engineering.

[20]  Eser Kandogan,et al.  Avatar semantic search: a database approach to information retrieval , 2006, SIGMOD Conference.

[21]  M. Kendall Rank Correlation Methods , 1949 .

[22]  Douglas E. Appelt,et al.  The Common Pattern Specification Language , 1998, TIPSTER.

[23]  Jaime G. Carbonell,et al.  Dynamic Strategy Selection in Flexible Parsing , 1981, ACL.

[24]  Gary G. Hendrix,et al.  Developing a natural language interface to complex data , 1977, TODS.

[25]  Gary Marchionini,et al.  Examining the effectiveness of real-time query expansion , 2007, Inf. Process. Manag..