Shortening retrieval sequences in browsing-based component retrieval using information entropy

Reuse repositories are an essential element in component-based software development (CBSD). Querying-based retrieval and browsing-based retrieval are two main retrieval mechanisms provided in real world reuse repositories, especially web-based repositories. Although browsing-based retrieval is superior to querying-based retrieval in some aspects, the tedious retrieval process is its main drawback, because the browsing-based component retrieval usually involves long retrieval sequences. In this paper, we propose a novel approach to shorten the retrieval sequences in browsing-based component retrieval using information entropy. The basic idea of our approach is to build a navigation model by ranking the features into a tree structure using the components' indexing information. According to our experimental results on real data, our approach can effectively shorten the average length of retrieval sequences.

[1]  Roland Mittermeir,et al.  Storing and retrieving software components: a refinement based system , 1994, ICSE '94.

[2]  Premkumar T. Devanbu,et al.  LaSSIE—a knowledge-based software information system , 1991, ICSE '90.

[3]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[4]  Robert J. Kauffman,et al.  Supporting Search for Reusable Software Objects , 1996, IEEE Trans. Software Eng..

[5]  Dan Ionescu,et al.  A Learning Agent that Assists the Browsing of Software Libraries , 2000, IEEE Trans. Software Eng..

[6]  Jon Kleinberg,et al.  Authoritative sources in a hyperlinked environment , 1999, SODA '98.

[7]  Robert C. Seacord,et al.  AGORA: A Search Engine for Software Components , 1998, IEEE Internet Comput..

[8]  Rubén Prieto-Díaz Implementing faceted classification for software reuse , 1991, CACM.

[9]  Ruben Prieto-Diaz A software classification scheme (reusability, libraries, development) , 1985 .

[10]  Rubén Prieto Díaz,et al.  A software classification scheme , 1985 .

[11]  Susan T. Dumais,et al.  The vocabulary problem in human-system communication , 1987, CACM.

[12]  C. E. SHANNON,et al.  A mathematical theory of communication , 1948, MOCO.

[13]  Luqi,et al.  A survey of software reuse repositories , 2000, Proceedings Seventh IEEE International Conference and Workshop on the Engineering of Computer-Based Systems (ECBS 2000).

[14]  Yunwen Ye Programming with an Intelligent Agent , 2003, IEEE Intell. Syst..

[15]  Melvil Dewey Decimal classification and relative index , 1942 .

[16]  Gerhard Fischer,et al.  Context-aware browsing of large component repositories , 2001, Proceedings 16th Annual International Conference on Automated Software Engineering (ASE 2001).

[17]  Gabriel Pinski,et al.  Citation influence for journal aggregates of scientific publications: Theory, with application to the literature of physics , 1976, Inf. Process. Manag..

[18]  Letha H. Etzkorn,et al.  Towards a semantic-based approach for software reusable component classification and retrieval , 2004, ACM-SE 42.

[19]  Zheying Zhang Enhancing Component Reuse Using Search Techniques , 2000 .

[20]  William B. Frakes,et al.  An Empirical Study of Representation Methods for Reusable Software Components , 1994, IEEE Trans. Software Eng..

[21]  Morten Hertzum,et al.  Browsing and querying in online documentation: a study of user interfaces and the interaction process , 1996, TCHI.

[22]  James A. Hendler,et al.  Computing similarity in a reuse library system: an AI-based approach , 1992, TSEM.

[23]  Hamid Mcheick,et al.  An experiment in software component retrieval , 2003, Inf. Softw. Technol..

[24]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[25]  Ali Mili,et al.  Toward an Engineering Discipline of Software Reuse , 1999, IEEE Softw..

[26]  J. Maxwell,et al.  Theory of Heat , 1892 .

[27]  Shinji Kusumoto,et al.  Component rank: relative significance rank for software component search , 2003, 25th International Conference on Software Engineering, 2003. Proceedings..

[28]  Alfréd Rényi,et al.  Probability Theory , 1970 .

[29]  Christopher Olston,et al.  ScentTrails: Integrating browsing and searching on the Web , 2003, TCHI.

[30]  Jeannette M. Wing,et al.  Specification matching of software components , 1997 .

[31]  William B. Frakes,et al.  Proteus: a software reuse library system , 1990, SIGF.

[32]  Yunwen Ye,et al.  An Active and Intelligent Agent for Component Location , 2000 .

[33]  William B. Frakes,et al.  Representing reusable software , 1990 .

[34]  Jeff Conklin,et al.  Hypertext: An Introduction and Survey , 1987, Computer.

[35]  Ragnhild Van Der Straeten,et al.  Supporting evolution in component-based development using component libraries , 2003, Seventh European Conference onSoftware Maintenance and Reengineering, 2003. Proceedings..

[36]  R. Tolman,et al.  The Principles of Statistical Mechanics. By R. C. Tolman. Pp. xix, 661. 40s. 1938. International series of monographs on physics. (Oxford) , 1939, The Mathematical Gazette.

[37]  Huilin Ye,et al.  Towards a self-structuring software library , 2001, IEE Proc. Softw..

[38]  Jeannette M. Wing,et al.  Signature matching: a tool for using software libraries , 1995, TSEM.

[39]  Gerhard Fischer,et al.  Supporting reuse by delivering task-relevant and personalized information , 2002, ICSE '02.

[40]  Xudong He,et al.  A formal approach for component retrieval and integration analysis , 2000 .

[41]  Bruce Weide,et al.  Component-based software using RESOLVE , 1994, SOEN.

[42]  Scott Henninger,et al.  Using iterative refinement to find reusable software , 1994, IEEE Software.

[43]  Mei Hong,et al.  An overview of JB (Jade Bird) component library system JBCL , 1997, Proceedings. Technology of Object-Oriented Languages. TOOLS 24 (Cat. No.97TB100240).

[44]  Bernd Fischer Specification-Based Browsing of Software Component Libraries , 2004, Automated Software Engineering.

[45]  Ge Li,et al.  Attribute Ranking: An Entropy-Based Approach to Accelerating Browsing-Based Component Retrieval , 2004, ICSR.

[46]  Ali Mili,et al.  Reusing Software: Issues and Research Directions , 1995, IEEE Trans. Software Eng..

[47]  Vijayan Sugumaran,et al.  A semantic-based approach to component retrieval , 2003, DATB.