A Two-stage Evaluation of User Query Performance for the Relational Model and SQL

What are the impacts of a data model and a query language on user query performance? This is a longstanding research question about database query. Current knowledge has provided their combined effect. A data model comprises structures and operations, and could be decoupled from a query language. It is theoretically possible to identify the effects from a data model (without the query language), and the additional effects when a query language is included. An experiment was conducted to provide answers on these effects. Subject query performance with the relational model and SQL was measured at two query stages: the query translation and query writing stages. The experiment confirms literature findings about SQL query difficulties (which are all based on the query writing stage). Exploratory analysis of query difficulties show surprises. For example, operations generally perceived to be difficult (such as joins, group count and repeated relations) are not difficult at the query translation stage, i.e. the difficulties are not because of the relational model, but because of SQL. The study illustrates an approach for separating the effects of data model and query language, which can be used for future studies of other models and languages.

[1]  Peter Thanisch,et al.  Natural Language Interfaces to Databases , 1994 .

[2]  Lerina Aversano,et al.  Understanding SQL through iconic interfaces , 2002, Proceedings 26th Annual International Computer Software and Applications.

[3]  Charles Welty,et al.  Correcting User Errors in SQL , 1985, Int. J. Man Mach. Stud..

[4]  Antonija Mitrovic,et al.  Learning SQL with a computerized tutor , 1998, SIGCSE '98.

[5]  Shamkant B. Navathe,et al.  Enriching the conceptual basis for query formulation through relationship semantics in databases , 2001, Inf. Syst..

[6]  Vesper Owei Natural language querying of databases: an information extraction approach in the conceptual query language , 2000, Int. J. Hum. Comput. Stud..

[7]  Peter Rob,et al.  Database systems : design, implementation, and management , 2000 .

[8]  Kwok Kee Wei,et al.  Effect of Grading Schemes on Outcomes in Query Writing Experiments , 1996, Interact. Comput..

[9]  Jong-Hyeok Lee,et al.  Database semantics representation for natural language access , 2002, First International Symposium on Cyber Worlds, 2002. Proceedings..

[10]  K. Siau,et al.  Cognitive mapping techniques for user-database interaction , 2006, IEEE Transactions on Professional Communication.

[11]  Louiqa Raschid,et al.  Query interoperation among object-oriented and relational databases , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[12]  Chechen Liao,et al.  The impact of data models and task complexity on end-user performance: an experimental investigation , 2000, Int. J. Hum. Comput. Stud..

[13]  Iris Vessey,et al.  An empirical investigation of entity-based and object-oriented data modeling: a development life cycle approach , 1999, ICIS.

[14]  Ananth Srinivasan,et al.  Data Abstractions and Their Use: An Experimental Study of User Productivity , 1999, INTERACT.

[15]  Donald R. Jones,et al.  The effects of information request ambiguity and construct incongruence on query development , 2001, Decis. Support Syst..

[16]  William C. Ogden IMPLICATIONS OF A COGNITIVE MODEL OF DATABASE QUERY: COMPARISON OF A NATURAL LANGUAGE, FORMAL LANGUAGE AND DIRECT MANIPULATION INTERFACE , 1986, SGCH.

[17]  Peter Thanisch,et al.  Natural language interfaces to databases – an introduction , 1995, Natural Language Engineering.

[18]  David W. Stemple,et al.  Human factors comparison of a procedural and a nonprocedural query language , 1981, TODS.

[19]  Bernard C. Y. Tan,et al.  Three important determinants of user performance for database retrieval , 1999, Int. J. Hum. Comput. Stud..

[20]  E. F. Codd Data models in database management , 1981, SIGMOD 1981.

[21]  C. J. Date A guide to the SQL standard (2nd ed.) , 1989 .

[22]  Ralph L. Rosnow,et al.  Essentials of Behavioral Research: Methods and Data Analysis , 1984 .

[23]  Carolyn E. Begg,et al.  Database Systems: A Practical Approach to Design, Implementation and Management , 1998 .

[24]  Keng Siau,et al.  User-Database Interface: The Effect of Abstraction Levels on Query Performance , 1993, MIS Q..

[25]  Shamkant B. Navathe,et al.  Conceptual Database Design: An Entity-Relationship Approach , 1991 .

[26]  Kil-Soo Suh,et al.  A Comparison of Linear Keyword and Restricted Natural Language Data Base Interfaces for Novice Users , 1992, Inf. Syst. Res..

[27]  Hock-Hai Teo,et al.  An Experimental Study of Object-Oriented Query Language and Relational Query Language for Novice Users , 1994 .

[28]  Phyllis Reisner,et al.  Use of Psychological Experimentation as an Aid to Development of a Query Language , 1977, IEEE Transactions on Software Engineering.

[29]  Hock-Hai Teo,et al.  Performance effects of formal modeling language differences: a combined abstraction level and construct complexity analysis , 2006 .

[30]  Peretz Shoval,et al.  Entity-Relationship and Object-Oriented Data Modeling-an Experimental Comparison of Design Quality , 1997, Data Knowl. Eng..

[31]  Phyllis Reisner,et al.  Human Factors Studies of Database Query Languages: A Survey and Assessment , 1981, CSUR.

[32]  C. J. Date The database relational model : a retrospective review and analysis : a historical account and assessment of E.F. Codd's contribution to the field of database technology , 2000 .

[33]  Alexandra Galatescu A Unifying Translation of Natural Language Patterns to Object and Process Modeling , 2001, Information Modeling in the New Millennium.

[34]  David A. Bradbard,et al.  The Effects of Relational and Entity-Relationship Data Models on Query Performance of End Users , 1989, Int. J. Man Mach. Stud..

[35]  Ron Weber,et al.  Are Attributes Entities? A Study of Database Designers' Memory Structures , 1996, Inf. Syst. Res..

[36]  Michael V. Mannino Database Application Development and Design , 2001 .

[37]  James D. Hollan,et al.  Direct Manipulation Interfaces , 1985, Hum. Comput. Interact..

[38]  Paul L. Bowen,et al.  Analysis of Competing Data Structures: Does Ontological Clarity Produce Better End User Query Performance , 2006, J. Assoc. Inf. Syst..

[39]  Peter Rob,et al.  Database Systems , 1993 .

[40]  A. Faye Borthick,et al.  The effects of normalization on end-user query errors: An experimental evaluation , 2001, Int. J. Account. Inf. Syst..

[41]  Richard W. Scamell,et al.  A Human Factors Experimental Comparison of SQL and QBE , 1993, IEEE Trans. Software Eng..

[42]  John B. Smelcer,et al.  User errors in database query composition , 1995, Int. J. Hum. Comput. Stud..

[43]  Darren George,et al.  SPSS for Windows Step by Step: A Simple Guide and Reference , 1998 .

[44]  Xiaohua Zeng,et al.  An evaluation of novice end-user computing performance: Data modeling, query writing, and comprehension , 2005, J. Assoc. Inf. Sci. Technol..

[45]  Paul L. Bowen,et al.  Further evidence of the effects of normalization on end-user query errors: an experimental evaluation , 2002, Int. J. Account. Inf. Syst..

[46]  Raghu Ramakrishnan,et al.  Database Management Systems , 1976 .

[47]  A. Adam Whatever happened to information systems ethics? Caught between the devil and the deep blue sea , 2004 .

[48]  Wo-Shun Luk,et al.  Query Translation: The Query Interoperability Approach , 1996, International Hong Kong Computer Society Database Workshop.

[49]  Izak Benbasat,et al.  The Relative Importance of Structural Constraints and Surface Semantics in Information Modeling , 1997, Inf. Syst..