Handling Semi-Structured Data through an Extended Object-Oriented Data Model

In traditional database applications the structure of data is pre-deened, and data are entered into the database specifying the schema element (relation or class, depending on the paradigm) they belong to. New emerging database applications, expecially those related to the Web, are characterized by data that have an irregular, heterogeneous, partial structure that quickly evolves. In this paper we adapt an object-oriented data model to this kind of data, by providing it with more exible type system and by weakening the conditions for membership of an object to a class. An approach to classify objects, created without specifying the class they belong to, in the most appropriate class of the schema, is also proposed.

[1]  Dan Suciu,et al.  Adding Structure to Unstructured Data , 1997, ICDT.

[2]  Dan Suciu,et al.  A query language and optimization techniques for unstructured data , 1996, SIGMOD '96.

[3]  Albrecht Schmiedel,et al.  The BACK System Revisited , 1989 .

[4]  O. Deux,et al.  The Story of O2 , 1990, IEEE Trans. Knowl. Data Eng..

[5]  Serge Abiteboul,et al.  From structured documents to novel query facilities , 1994, SIGMOD '94.

[6]  Peter Buneman,et al.  Semistructured data , 1997, PODS.

[7]  Robin Milner,et al.  An Algebraic Definition of Simulation Between Programs , 1971, IJCAI.

[8]  Jennifer Widom,et al.  Object exchange across heterogeneous information sources , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[9]  Sophie Cluet Modeling and Querying Semi-structured Data , 1997, SCIE.

[10]  Serge Abiteboul,et al.  Querying Semi-Structured Data , 1997, Encyclopedia of Database Systems.

[11]  Serge Abiteboul,et al.  Extracting schema from semistructured data , 1998, SIGMOD '98.

[12]  Dan Smith,et al.  Information extraction for semi-structured documents , 1997 .

[13]  Hector Garcia-Molina,et al.  Extracting Semistructured Information from the Web. , 1997 .

[14]  David Maier,et al.  The GemStone Data Management System , 1989, Object-Oriented Concepts, Databases, and Applications.

[15]  Serge Abiteboul,et al.  Queries and computation on the web , 1997, Theor. Comput. Sci..

[16]  Jennifer Widom,et al.  The Lorel query language for semistructured data , 1997, International Journal on Digital Libraries.

[17]  Heikki Mannila,et al.  Methods and Problems in Data Mining , 1997, ICDT.