Synthesizing structured text from logical database subsets

In the classical database world, information access has been based on a paradigm that involves structured, schema-aware, queries and tabular answers. In the current environment, however, where information prevails in most activities of society, serving people, applications, and devices in dramatically increasing numbers, this paradigm has proved to be very limited. On the query side, much work has been done on moving towards keyword queries over structured data. In our previous work, we have touched the other side as well, and have proposed a paradigm that generates entire databases in response to keyword queries. In this paper, we continue in the same direction and propose synthesizing textual answers in response to queries of any kind over structured data. In particular, we study the transformation of a dynamically-generated logical database subset into a narrative through a customizable, extensible, and templatebased process. In doing so, we exploit the structured nature of database schemas and describe three generic translation modules for different formations in the schema, called unary, split, and join modules. We have implemented the proposed translation procedure into our own database front end and have performed several experiments evaluating the textual answers generated as several features and parameters of the system are varied. We have also conducted a set of experiments measuring the effectiveness of such answers on users. The overall results are very encouraging and indicate the promise that our approach has for several applications.

[1]  Veda C. Storey,et al.  Naive Semantics to Support Automated Database Design , 2002, IEEE Trans. Knowl. Data Eng..

[2]  Elisabeth Métais,et al.  Database Schema Design: A Perspective From Natural Language Techniques to Validation and View Integration , 1993, ER.

[3]  Joseph P. Olive,et al.  Text-to-speech synthesis , 1995, AT&T Technical Journal.

[4]  Ioana Manolescu,et al.  Integrating Keyword Search into XML Query Processing , 2000, BDA.

[5]  Michael Jason Minock,et al.  A Phrasal Approach to Natural Language Interfaces over Databases , 2005, NLDB.

[6]  Elisabeth Métais Enhancing information systems management with natural language processing techniques , 2002, Data Knowl. Eng..

[7]  Surajit Chaudhuri,et al.  DBXplorer: a system for keyword-based search over relational databases , 2002, Proceedings 18th International Conference on Data Engineering.

[8]  Veda C. Storey Understanding and Representing Relationship Semantics in Database Design , 2000, NLDB.

[9]  Jiang Hu,et al.  Natural Language Query vs. Keyword Search: Effects of Task Complexity on Search Performance, Participant Perceptions, and Preferences , 2005, INTERACT.

[10]  Fernando Llopis,et al.  Improving Question Answering Using Named Entity Recognition , 2005, NLDB.

[11]  Eriks Sneiders,et al.  Automated Question Answering Using Question Templates That Cover the Conceptual Model of the Database , 2002, NLDB.

[12]  Luis Gravano,et al.  Efficient IR-Style Keyword Search over Relational Databases , 2003, VLDB.

[13]  Paul Taylor,et al.  Text-to-Speech Synthesis , 2009 .

[14]  Peter Thanisch,et al.  Natural language interfaces to databases – an introduction , 1995, Natural Language Engineering.

[15]  Georgia Koutrika,et al.  Précis: The Essence of a Query Answer , 2006, 22nd International Conference on Data Engineering (ICDE'06).

[16]  Feng Shao,et al.  XRANK: ranked keyword search over XML documents , 2003, SIGMOD '03.

[17]  Paul Dixon,et al.  Basics of Oracle Text Retrieval , 2001, IEEE Data Eng. Bull..

[18]  Tanja Schultz,et al.  Multilingual Speech Processing , 2006 .

[19]  R. Cole,et al.  Survey of the State of the Art in Human Language Technology , 2010 .

[20]  Gerhard Weikum,et al.  The SphereSearch Engine for Unified Ranked Retrieval of Heterogeneous XML and Web Documents , 2005, VLDB.

[21]  Georgia Koutrika,et al.  Précis: from unstructured keywords as queries to structured databases as answers , 2007, The VLDB Journal.

[22]  S. Sudarshan,et al.  Bidirectional Expansion For Keyword Search on Graph Databases , 2005, VLDB.

[23]  Vagelis Hristidis,et al.  Keyword proximity search on XML graphs , 2003, Proceedings 19th International Conference on Data Engineering (Cat. No.03CH37405).

[24]  Georgia Koutrika,et al.  Comprehensible Answers to Précis Queries , 2006, CAiSE.