Determining the Relative Importance of Features for Influencing Software Product Similarity Matching

As a software product line evolves a significant management challenge is comparing existing products to each other or planned products. The approach to product comparison will vary according to its purposes. One solution includes the representation of a configured product as a weighted binary string where 1 represents a feature’s presence, 0 represents its absence, and the weight represents the different levels of relative importance to the product that a feature is perceived to have. Relative importance values influence similarity matching so that the features considered important are the ones that primarily influence what is judged to be similar. A binary string similarity metric supports product comparison (a product similarity metric). For a product line that contains thousands of features the allocation of relative importance values is only practical when done automatically. This paper proposes a novel algorithm for automatically determining the relative importance of each feature. A feature tree can represent a product line in which a feature is a node in the tree and a relationship between features is an edge. A feature’s relative importance is calculated as a function of local and global tree structural measures. The local measures are the number of input and output nodes to which a feature is connected and the variability property of each of these nodes. The global measure is the distance of the feature from the root node. A mobile phone worked example illustrates the feasibility of the algorithm.

[1]  Jianguo Liu,et al.  Identification of spreading influence nodes via multi-level structural attributes based on the graph convolutional network , 2022, Expert Syst. Appl..

[2]  M. Alazab,et al.  CroLSSim: Cross‐language software similarity detector using hybrid approach of LSA‐based AST‐MDrep features and CNN‐LSTM model , 2022, Int. J. Intell. Syst..

[3]  Balwinder Sodhi,et al.  OSS Effort Estimation Using Software Features Similarity and Developer Activity-Based Metrics , 2021, ACM Trans. Softw. Eng. Methodol..

[4]  Yu Xie,et al.  Influence-aware graph neural networks , 2021, Appl. Soft Comput..

[5]  Bernd Bischl,et al.  Relative Feature Importance , 2020, 2020 25th International Conference on Pattern Recognition (ICPR).

[6]  Daniel J. Brass,et al.  Centrality , 2019, Encyclopedia of Social Network Analysis and Mining.

[7]  Robyn R. Lutz,et al.  Feature Similarity: A Method to Detect Unwanted Feature Interactions Earlier in Software Product Lines , 2019, SISAP.

[8]  Xiaobing Peng,et al.  A Novel Strategy to Adjust Feature Weights for Data Classification , 2019, 2019 12th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI).

[9]  Yong Deng,et al.  Identification of influencers in complex networks by local information dimension , 2019, Inf. Sci..

[10]  Alireza Abdollahpouri,et al.  Ranking nodes in complex networks based on local structure and improving closeness centrality , 2019, Neurocomputing.

[11]  Thomas Thüm,et al.  Effective product-line testing using similarity-based product prioritization , 2019, Software & Systems Modeling.

[12]  Ahmad Zareie,et al.  A hierarchical approach for influential node ranking in complex social networks , 2018, Expert Syst. Appl..

[13]  Shahliza Abd Halim,et al.  An experiment of different similarity measures on test case prioritization for software product lines , 2017 .

[14]  Stephan Mennicke,et al.  Is there a mismatch between real-world feature models and product-line research? , 2017, ESEC/SIGSOFT FSE.

[15]  Zhao Lu,et al.  Graph-theoretic node importance mining in world city networks: methods and applications , 2017 .

[16]  H. Yazdani,et al.  New similarity functions , 2016, 2016 Third International Conference on Artificial Intelligence and Pattern Recognition (AIPR).

[17]  Pierre-Yves Schobbens,et al.  Search-based Similarity-driven Behavioural SPL Testing , 2016, VaMoS.

[18]  Jon Rokne,et al.  Encyclopedia of Social Network Analysis and Mining , 2014, Springer New York.

[19]  Sergio Segura,et al.  A Comparison of Test Case Prioritization Criteria for Software Product Lines , 2014, 2014 IEEE Seventh International Conference on Software Testing, Verification and Validation.

[20]  Patrick P. K. Chan,et al.  An improved differential evolution and its application to determining feature weights in similarity based clustering , 2013, 2013 International Conference on Machine Learning and Cybernetics.

[21]  Gunter Saake,et al.  Multi-dimensional variability modeling , 2011, VaMoS.

[22]  C. Tappert,et al.  A Survey of Binary Similarity and Distance Measures , 2010 .

[23]  Donald D. Cowan,et al.  S.P.L.O.T.: software product lines online tools , 2009, OOPSLA Companion.

[24]  Thomas Leich,et al.  FeatureIDE: A tool framework for feature-oriented software development , 2009, 2009 IEEE 31st International Conference on Software Engineering.

[25]  Pierre-Yves Schobbens,et al.  Generic semantics of feature diagrams , 2007, Comput. Networks.

[26]  Pierre-Yves Schobbens,et al.  Feature Diagrams: A Survey and a Formal Semantics , 2006, 14th IEEE International Requirements Engineering Conference (RE'06).

[27]  Don S. Batory,et al.  Feature Models, Grammars, and Propositional Formulas , 2005, SPLC.

[28]  Antonio Ruiz Cortés,et al.  Automated Reasoning on Feature Models , 2005, Seminal Contributions to Information Systems Engineering.

[29]  Danilo Beuche,et al.  Variability management with feature models , 2004, Sci. Comput. Program..

[30]  Padhraic Smyth,et al.  Algorithms for estimating relative importance in networks , 2003, KDD '03.

[31]  Jaejoon Lee,et al.  FORM: A feature-;oriented reuse method with domain-;specific reference architectures , 1998, Ann. Softw. Eng..

[32]  Kyo Chul Kang,et al.  Feature-Oriented Domain Analysis (FODA) Feasibility Study , 1990 .

[33]  Handbook of Re-Engineering Software Intensive Systems into Software Product Lines , 2023, Handbook of Re-Engineering Software Intensive Systems into Software Product Lines.

[34]  Alexander Hinneburg,et al.  An Application of Learned Multi-modal Product Similarity to E-Commerce , 2022, SISAP.

[35]  Hermann Kaindl,et al.  Using Binary Strings for Comparing Products from Software-intensive Systems Product Lines , 2021, ICEIS.

[36]  Xiaocui Li,et al.  Influence maximization in social graphs based on community structure and node coverage gain , 2021, Future Gener. Comput. Syst..

[37]  Arthur Zimek,et al.  Similarity Search and Applications: 13th International Conference, SISAP 2020, Copenhagen, Denmark, September 30 – October 2, 2020, Proceedings , 2020, Lecture Notes in Computer Science.

[38]  Caetano Traina,et al.  Similarity Search and Applications , 2014, Lecture Notes in Computer Science.

[39]  Andreas Classen,et al.  Introducing TVL, a Text-based Feature Modelling Language , 2010, VaMoS' 2010.

[40]  Hermann Kaindl,et al.  Using parameters and discriminants for product line requirements , 2008, Syst. Eng..

[41]  Krzysztof Czarnecki,et al.  Formalizing cardinality-based feature models and their specialization , 2005, Softw. Process. Improv. Pract..

[42]  Vladimir Batagelj,et al.  Centrality in Social Networks , 1993 .

[43]  L. Freeman Centrality in social networks conceptual clarification , 1978 .

[44]  P. Jaccard Distribution de la flore alpine dans le bassin des Dranses et dans quelques régions voisines , 1901 .