An Information Retrieval Approach For Automatically Constructing Software Libraries

A technology for automatically assembling large software libraries which promote software reuse by helping the user locate the components closest to her/his needs is described. Software libraries are automatically assembled from a set of unorganized components by using information retrieval techniques. The construction of the library is done in two steps. First, attributes are automatically extracted from natural language documentation by using an indexing scheme based on the notions of lexical affinities and quantity of information. Then a hierarchy for browsing is automatically generated using a clustering technique which draws only on the information provided by the attributes. Due to the free-text indexing scheme, tools following this approach can accept free-style natural language queries. >

[1]  Ian Sommerville,et al.  An information retrieval system for software components , 1988, SIGF.

[2]  Gail E. Kaiser,et al.  Using conceptual clustering for classifying reusable Ada code , 1987, SIGAda '87.

[3]  Hans Peter Luhn,et al.  The Automatic Creation of Literature Abstracts , 1958, IBM J. Res. Dev..

[4]  Bruce A. Burton,et al.  The Reusable Software Library , 1987, IEEE Software.

[5]  William B. Frakes,et al.  Representing reusable software , 1990 .

[6]  Susan P. Arnold,et al.  The Reuse System: Cataloging and Retrieval of Reusable Software , 1988, IEEE Computer Society International Conference.

[7]  Robert B. Ash,et al.  Information Theory , 2020, The SAGE International Encyclopedia of Mass Media and Society.

[8]  Gerard Salton,et al.  On the application of syntactic methodologies in automatic text analysis , 1990, Inf. Process. Manag..

[9]  William B. Frakes,et al.  Panel Session: Information Retrieval and Software Reuse , 1999 .

[10]  Frank Srnadja Lexical Co-occurrence: The Missing Link , 1989 .

[11]  Rolf Adams,et al.  NLH/E: a natural language help system , 1989, ICSE '89.

[12]  Karen Spärck Jones,et al.  Automatic Search Term variant Generation , 1984, J. Documentation.

[13]  Richard A. Harshman,et al.  Indexing by Latent Semantic Analysis , 1990, J. Am. Soc. Inf. Sci..

[14]  G. N. Lance,et al.  A General Theory of Classificatory Sorting Strategies: 1. Hierarchical Systems , 1967, Comput. J..

[15]  Gerard Salton,et al.  On the application of syntactic methodologies in automatic text analysis , 1989, SIGIR '89.

[16]  M. E. Maron,et al.  An evaluation of retrieval effectiveness for a full-text document-retrieval system , 1985, CACM.

[17]  Yoelle Maarek,et al.  Full text indexing based on lexical relations an application: software libraries , 1989, SIGIR '89.

[18]  Gerard Salton,et al.  Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer , 1989 .

[19]  Paul H. Klingbiel Machine-aided indexing of technical literature , 1973, Inf. Storage Retr..

[20]  Simon M. Kaplan,et al.  Incremental Maintenance of Semantic Links in Dynamically Changing Hypertext Systems , 1990, Interact. Comput..

[21]  Masahito Hirakawa,et al.  ARES: A relational database with the capability of performing flexible interpretation of queries , 1986, IEEE Transactions on Software Engineering.

[22]  Ryszard S. Michalski,et al.  Automated Construction of Classifications: Conceptual Clustering Versus Numerical Taxonomy , 1983, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[23]  Alexander Dekhtyar,et al.  Information Retrieval , 2018, Lecture Notes in Computer Science.

[24]  B. Allen,et al.  A Knowledge-based Environment For The Development Of Software Parts Composition Systems , 1989, 11th International Conference on Software Engineering.

[25]  Peter Willett,et al.  Hierarchic Agglomerative Clustering Methods for Automatic Document Classification , 1984, J. Documentation.

[26]  Yoelle Maarek,et al.  An incremental conceptual clustering algorithm that reduces input-ordering bias , 1990 .

[27]  Fazli Can,et al.  A clustering scheme , 1983, SIGIR '83.

[28]  Peter Freeman,et al.  Classifying Software for Reusability , 1987, IEEE Software.

[29]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[30]  William B. Frakes,et al.  Software reuse through information retrieval , 1986, SIGF.

[31]  R. W. Schwanke,et al.  Discovering, visualizing, and controlling software structure , 1989, IWSSD '89.

[32]  C. J. van Rijsbergen,et al.  The use of hierarchic clustering in information retrieval , 1971, Inf. Storage Retr..

[33]  Premkumar T. Devanbu,et al.  A Knowledge-Based Software Information System , 1989, IJCAI.