Optimizing queries on files

We present a framework which allows the user to access and manipulate data uniformly, regardless of whether it resides in a database or in the file system (or in both). A key issue is the performance of the system. We show that text indexing, combined with newly developed optimization techniques, can be used to provide an efficient high level interface to information stored in files. Furthermore, using these techniques, some queries can be evaluated significantly faster than in standard database implementations. We also study the tradeoff between efficiency and the amount of indexing.

[1]  Heikki Mannila,et al.  Retrieval from hierarchical texts by partial patterns , 1993, SIGIR.

[2]  Leslie Lamport,et al.  Latex : A Document Preparation System , 1985 .

[3]  Forbes J. Burkowski Retrieval activities in a database consisting of heterogeneous collections of structured text , 1992, SIGIR '92.

[4]  A. Paepcke An object-oriented view onto public, heterogeneous text databases , 1993, Proceedings of IEEE 9th International Conference on Data Engineering.

[5]  Gaston H. Gonnet,et al.  Mind Your Grammar: a New Approach to Modelling Text , 1987, VLDB.

[6]  Alberto O. Mendelzon,et al.  GraphLog: a visual formalism for real life recursion , 1990, PODS '90.

[7]  John Mylopoulos,et al.  A language facility for designing database-intensive applications , 1980, TODS.

[8]  Michael Kifer,et al.  Querying object-oriented databases , 1992, SIGMOD '92.

[9]  Gita Gopal,et al.  The Architecture , 2022 .

[10]  Hector Garcia-Molina,et al.  The Gold Mailer , 1993, Proceedings of IEEE 9th International Conference on Data Engineering.

[11]  Ravi Sethi,et al.  Testing for the Church-Rosser Property , 1974, JACM.

[12]  Alfred V. Aho,et al.  LR Parsing , 1974, ACM Comput. Surv..

[13]  Peter M. Schwarz,et al.  The Rufus System: Information Organization for Semi-Structured Data , 1993, VLDB.

[14]  Serge Abiteboul,et al.  Querying and Updating the File , 1993, VLDB.

[15]  Alberto O. Mendelzon,et al.  Hy+: a Hygraph-based query and visualization system , 1993, SIGMOD '93.

[16]  Michael F. Schwartz,et al.  Internet resource discovery at the University of Colorado , 1993, Computer.

[17]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[18]  Elisa Bertino,et al.  A Survey of Indexing Techniques for Object-Oriented Database Management Systems , 1991, Query Processing for Advanced Database Systems.

[19]  Douglas B. Terry,et al.  Using collaborative filtering to weave an information tapestry , 1992, CACM.