Implementing SMART for minicomputers via relational processing With abstract data types

Designed during the 1960's as a research tool for the field of information retrieval, the SMART system has been operating on an IBM 370 since 1974. SMART is now being enhanced, redesigned, and programmed under the UNIX operating system [28] on a DEC VAX 11/780. The techniques used should allow real-time operation on smaller minicomputers in the PDP 11 family. The implementation provides for a combination of database and information retrieval operations which make it applicable to office automation, personal information system management, and research studies. The SMART vector space model, which treats information requests and stored information records as vectors in an n-space (of terms), is integrated into the relational database model using the concepts of abstract data types (ADTs). Domains of relations are allowed to be any ADT; an extended relational algebra is described with operators that manipulate many complex data structures. After illustrating the application of these concepts to typical SMART tasks, a prototype implementation is outlined. Also included is a discussion of techniques to be employed in a more efficient version.

[1]  KentWilliam Limitations of record-based information models , 1979 .

[2]  Lawrence A. Rowe,et al.  Data abstraction from a programming language viewpoint , 1981, Workshop on Data Abstraction, Databases and Conceptual Modelling.

[3]  Ken Thompson,et al.  The UNIX time-sharing system , 1974, CACM.

[4]  Gerard Salton,et al.  The SMART Retrieval System—Experiments in Automatic Document Processing , 1971 .

[5]  Stephen Todd,et al.  The Peterlee Relational Test Vehicle - A System Overview , 1976, IBM Syst. J..

[6]  Edwin H. Satterthwaite,et al.  Early experience with Mesa , 1977, CACM.

[7]  Patricia G. Selinger,et al.  Support for repetitive transactions and ad hoc queries in System R , 1981, TODS.

[8]  John V. Guttag,et al.  Abstract data types and the development of data structures , 1977, CACM.

[9]  Alain Pirotte,et al.  Associating types with domains of relational data bases , 1980, Workshop on Data Abstraction, Databases and Conceptual Modelling.

[10]  Anthony I. Wasserman The Extension of Data Abstraction to Database Management , 1980, Workshop on Data Abstraction, Databases and Conceptual Modelling.

[11]  Donald D. Chamberlin,et al.  Relational database management systems , 1978 .

[12]  Henry F. Ledgard,et al.  Two views of data abstraction , 1977, CACM.

[13]  Narain H. Gehani,et al.  Some ideas on data types in high-level languages , 1976, CACM.

[14]  Jonathan E. Shopiro Theseus—a programming language for relational databeses , 1979, ACM Trans. Database Syst..

[15]  Paolo Paolini Abstract data types and Data Bases , 1981, SIGMOD 1981.

[16]  William Kent,et al.  Limitations of record-based information models , 1979, TODS.

[17]  Ian A. Macleod,et al.  SEQUEL as a Language for Document Retrieval , 2007, J. Am. Soc. Inf. Sci..

[18]  Gerard Salton,et al.  A new comparison between conventional indexing (MEDLARS) and automatic text processing (SMART) , 1972, J. Am. Soc. Inf. Sci..

[19]  Robert T. Dattola FIRST: Flexible Information Retrieval System for Text , 1979, J. Am. Soc. Inf. Sci..

[20]  Craig Schaffert,et al.  Abstraction mechanisms in CLU , 1977, Commun. ACM.

[21]  C. Robert Carlson,et al.  A Comparison of the Relational and CODASYL Approaches to Data-Base Management , 1976, CSUR.

[22]  Diane C. P. Smith,et al.  Database abstractions: aggregation , 1977, CACM.

[23]  Lawrence A. Rowe Issues in the Design of Database Programming Languages , 1980, Workshop on Data Abstraction, Databases and Conceptual Modelling.

[24]  Brian W. Kernighan,et al.  The C Programming Language , 1978 .

[25]  Patrick A. V. Hall,et al.  Approximate String Matching , 1994, Encyclopedia of Algorithms.

[26]  Anthony I. Wasserman The extension of data abstraction to database management , 1981 .

[27]  Gerard Salton,et al.  Dynamic information and library processing , 1975 .

[28]  Gerard Salton,et al.  Automatic Information Organization And Retrieval , 1968 .

[29]  Gerard Salton,et al.  Parallel computations in information retrieval , 1981, CONPAR.

[30]  Evan L. Ivie The programmer's workbench—a machine for software development , 1977, CACM.

[31]  Einar Nodtvedt Information Retrieval in the Business Environment , 1980 .

[32]  Robert G. Crawford The relational model in information retrieval , 1981, J. Am. Soc. Inf. Sci..

[33]  Donald D. Chamberlin,et al.  Relational Data-Base Management Systems , 1976, CSUR.

[34]  John V. Guttag,et al.  Abstract data types and the development of data structures , 1976, Software Pioneers.

[35]  J. Joseph,et al.  POSSIBLE TIME-SHARING ORGANIZATION FOR A SMART RETRIEVAL SYSTEM , .

[36]  Gerard Salton Suggestions for a Uniform Representation of Query and Record Content in Data Base and Document Retrieval , 1979 .

[37]  Heinrich C. Mayr,et al.  Data abstractions for database systems , 1979, TODS.

[38]  Harold Borko,et al.  Indexing concepts and methods , 1978 .

[39]  GriesDavid,et al.  Some ideas on data types in high-level languages , 1977 .

[40]  Joachim W. Schmidt,et al.  Some high-level language constructs for data of type relation , 1977, SIGMOD '77.

[41]  James E. Donahue,et al.  An Informal Description of Russell , 1980 .

[42]  E. F. Codd,et al.  Extending the database relational model to capture more meaning , 1979, ACM Trans. Database Syst..