Towards a self-structuring software library

Software storage structuring and retrieval remain a major challenge to the widespread adoption of software reuse. An approach that can facilitate the automatic structuring of software components libraries is presented here. Based on the automatic indexing and the self-organising map (SOM) technologies, key features associated with software components can be identified and organised in a very simple way that makes their distance relations geographically explicit on the two-dimensional output layer of the SOM. The proposed approach was applied to a collection of UNIX commands to evaluate its retrival effectiveness. Preliminary results were encouraging and showed improvement on both recall and precision, with substantial reduction in the amount of human effort required in the process.

[1]  Gerard Salton,et al.  Automatic Information Organization And Retrieval , 1968 .

[2]  Gail E. Kaiser,et al.  An Information Retrieval Approach For Automatically Constructing Software Libraries , 1991, IEEE Trans. Software Eng..

[3]  William B. Frakes,et al.  Representing reusable software , 1990 .

[4]  Hinrich Schütze,et al.  A Cooccurrence-Based Thesaurus and Two Applications to Information Retrieval , 1994, Inf. Process. Manag..

[5]  Peter Freeman,et al.  Classifying Software for Reusability , 1987, IEEE Software.

[6]  Gerard Salton,et al.  Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer , 1989 .

[7]  Gary Marchionini,et al.  A self-organizing semantic map for information retrieval , 1991, SIGIR '91.

[8]  Robert J. Kauffman,et al.  Supporting Search for Reusable Software Objects , 1996, IEEE Trans. Software Eng..

[9]  Jay F. Nunamaker,et al.  A graphical, self-organizing approach to classifying electronic meeting output , 1997 .

[10]  Teuvo Kohonen,et al.  Self-organization and associative memory: 3rd edition , 1989 .

[11]  Roland Mittermeir,et al.  A survey of software reuse libraries , 1998, Ann. Softw. Eng..

[12]  Constance Palmer,et al.  A CAMP update , 1989 .

[13]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[14]  B. Allen,et al.  A Knowledge-based Environment For The Development Of Software Parts Composition Systems , 1989, 11th International Conference on Software Engineering.

[15]  Joel L. Fagan,et al.  The effectiveness of a nonsyntactic approach to automatic phrase indexing for document retrieval , 1989, JASIS.

[16]  Gerti Kappel,et al.  Learning the semantic similarity of reusable software components , 1994, Proceedings of 1994 3rd International Conference on Software Reuse.

[17]  Timo Honkela,et al.  Newsgroup Exploration with WEBSOM Method and Browsing Interface , 1996 .