Concepts for Modeling and Querying List-Structured Data

Abstract Traditionally, data models and query languages have provided mechanisms for dealing with sets of objects. Many database applications, however, are list oriented (i.e., deal with collections or aggregates of objects in which ordering is important). This article presents the list-structure data model which has ordering as a fundamental feature. The model is based on atomic, aggregate, and list constructors and thus provides support for tree-structured and sequential representations of data. These constructors can be intermixed and allow the modeling of variable and recursive schemes. Such schemes occur naturally in list-oriented data like tagged text, dictionaries, genealogies, etc. The notion of patterns found in traditional text and list processing applications is generalized to deal with the tree-structured representation and then incorporated in operations for searching, marking, updating, and restructuring list-structure instances. These operations form the core of a query language wherein users can succinctly and naturally formulate complex problems typically encountered in list-oriented database applications.

[1]  Serge Abiteboul,et al.  Non first normal form relations to represent hierarchically organized data , 1984, PODS.

[2]  Serge Abiteboul,et al.  Database theory column , 1989, SIGA.

[3]  Forbes J. Burkowski,et al.  An Algebra for Hierarchically Organized Text-Dominate Databases , 1992, Inf. Process. Manag..

[4]  Heather Fawcett,et al.  The "New Oxford English Dictionary" Project. , 1993 .

[5]  Gaston H. Gonnet,et al.  Mind Your Grammar: a New Approach to Modelling Text , 1987, VLDB.

[6]  Ralf Hartmut Güting,et al.  An algebra for structured office documents , 1989, TOIS.

[7]  Seymour Ginsburg,et al.  Pattern matching by Rs-operations: towards a unified approach to querying sequenced data , 1992, PODS '92.

[8]  P Pistor,et al.  A database language for sets, lists and tables , 1986, Inf. Syst..

[9]  E. F. Codd,et al.  A relational model of data for large shared data banks , 1970, CACM.

[10]  Ian A. Macleod,et al.  Storage and retrieval of structured documents , 1990, Inf. Process. Manag..

[11]  E. F. Codd,et al.  A Relational Model for Large Shared Data Banks , 1970 .

[12]  Frank Wm. Tompa,et al.  Shortening the OED: experience with a grammar-defined database , 1992, TOIS.

[13]  Marc Gyssens,et al.  A grammar-based approach towards unifying hierarchical data models , 1989, SIGMOD '89.

[14]  Patrick C. Fischer,et al.  Nested Relational Structures , 1986, Adv. Comput. Res..

[15]  R. G. G. Cattell,et al.  Object Data Management: Object-Oriented and Extended Relational Database Systems (Revised Edition) , 1991 .

[16]  Jan Van den Bussche,et al.  Concepts for Graph-Oriented Object Manipulation , 1992, EDBT.

[17]  Ian A. Macleod A Query Language for Retrieving Information from Hierarchic Text Structures , 1991, Comput. J..

[18]  Michael A. Shepherd,et al.  A transient hypergraph-based model for data access , 1990, TOIS.

[19]  Roger King,et al.  Semantic database modeling: survey, applications, and research issues , 1987, CSUR.

[20]  Joel E. Richardson,et al.  Supporting Lists in a Data Model (A Timely Approach) , 1992, VLDB.

[21]  Carolyn R. Watters,et al.  A Two-Level Structure for Textual Databases to Support Hypertext Access , 1992, J. Am. Soc. Inf. Sci..

[22]  Hans-Jörg Schek,et al.  Remarks on the algebra of non first normal form relations , 1982, PODS.

[23]  Latha S. Colby A recursive algebra for nested relations , 1990, Inf. Syst..

[24]  Frank Wm. Tompa A data model for flexible hypertext database systems , 1989, TOIS.

[25]  Marc Gyssens,et al.  A grammar-based approach towards unifying hierarchical data models , 1989, SIGMOD '89.

[26]  Fausto Rabitti A Model for Multimedia Documents , 1985 .

[27]  Paris C. Kanellakis,et al.  Database Theory Column: Query Languages for Complex Object Databases. , 1990 .