The integration of information retrieval techniques within a software reuse environment

This paper describes the development of an information retrieval (IR) model for the indexing, storage and retrieval of documents created in extensible mark-up language (XML). The application area is the software reuse environment, which involves a broader class of documents than can be processed by conventional IR systems. This includes design and analysis documents in unified modelling language (UML) notation, as well as textual format, source code and textual and source code component interface definitions. XML was selected because it is emerging as the key standard for the representation of structured documents on the World Wide Web (WWW) and incorporates methods for the representation of metadata. A model is described that is easily customisable, since it is based upon an extensible object-oriented framework. This allows the development of an IR architecture that can easily be adapted to cope with the proliferation of XML document type definitions (DTDs) that is likely to be a characteristic of the WWW in the near future.

[1]  Martin L. Griss,et al.  Software Reuse: From Library to Factory , 1993, IBM Syst. J..

[2]  Justin Zobel,et al.  Passage retrieval revisited , 1997, SIGIR '97.

[3]  Charles W. Krueger,et al.  Software reuse , 1992, CSUR.

[4]  Ross Wilkinson,et al.  Effective retrieval of structured documents , 1994, SIGIR '94.

[5]  Kam-Fai Wong,et al.  KPS: a Web Information Mining Algorithm , 1999, Comput. Networks.

[6]  Thomas H. Davenport,et al.  Information Ecology: Mastering the Information and Knowledge Environment , 1997 .

[7]  Norbert Fuhr Towards Data Abstraction in Networked Information Retrieval Systems , 1999, Inf. Process. Manag..

[8]  Ivar Jacobson,et al.  Software Reuse: Architecture, Process And Organization For Business Success , 1998, Proceedings. Technology of Object-Oriented Languages. TOOLS 26 (Cat. No.98EX176).

[9]  Thomas H. Davenport,et al.  Process Innovation: Reengineering Work Through Information Technology , 1992 .

[10]  Reidar Conradi,et al.  The REBOOT approach to software reuse , 1995, J. Syst. Softw..

[11]  Klemens Böhm,et al.  Building a Configurable Database Application for Structured Documents , 1995 .

[12]  Chen Xiaoqun,et al.  Supporting project-centered reuse in object-oriented software development , 1997, Proceedings. Technology of Object-Oriented Languages. TOOLS 24 (Cat. No.97TB100240).

[13]  Ophir Frieder,et al.  Integrating Structured Data and Text: A Relational Approach , 1997, J. Am. Soc. Inf. Sci..

[14]  James Solderitsch,et al.  DAGAR: a process for domain architecture definition and asset implementation , 1996, TRI-Ada '96.

[15]  David C. Gross,et al.  Managing Domain-Specific, Product-Line Development , 1996, IEEE Softw..

[16]  Ophir Frieder,et al.  A parallel relational database management system approach to relevance feedback in information retrieval , 1999 .

[17]  Margaret J. Davis,et al.  Reuse Strategy Model: Planning Aid for Reuse-Based Projects , 1993 .

[18]  Gabriele Sonnenberger,et al.  Exploiting the Functionality of Object-Oriented Database Management Systems for Information Retrieval , 1996, IEEE Data Eng. Bull..

[19]  Hans-Peter Frei Information Retrieval-from Academic Research to Practical Applications , 1996 .

[20]  Géza Haidegger,et al.  Application of software reuse and object-oriented methodologies for the modelling and control of manufacturing systems , 1999 .

[21]  Ivar Jacobson,et al.  Object-Oriented Software Engineering , 1991, TOOLS.

[22]  Gerald W. Both,et al.  Object-oriented analysis and design with applications , 1994 .

[23]  William E. Lorensen,et al.  Object-Oriented Modeling and Design , 1991, TOOLS.

[24]  Hans-Peter Frei,et al.  Design of reusable IR framework , 1995, SIGIR '95.

[25]  Irene Díaz,et al.  ALGORITMO DE FILTRADO MULTI-TÉRMINO PARA LA OBTENCIÓN DE RELACIONES JERÁRQUICAS EN LA CONSTRUCCIÓN AUTOMÁTICA DE UN TESAURO , 1999 .

[26]  Even-André Karlsson,et al.  Software reuse: a holistic approach , 1995 .

[27]  Irene Díaz,et al.  Semi-automatic construction of Thesaurus applying Domain Analysis techniques , 1998 .

[28]  C. M. Sperberg-McQueen,et al.  Extensible Markup Language (XML) , 1997, World Wide Web J..

[29]  Hans-Erik Eriksson,et al.  UML toolkit , 1997 .

[30]  John M. Wilson,et al.  Business Processes: Modelling and Analysis for Re-engineering and Improvement , 1995 .