Behavior-Derived Variability Analysis: Mining Views for Comparison and Evaluation

The large variety of computerized solutions (software and information systems) calls for a systematic approach to their comparison and evaluation. Different methods have been proposed over the years for analyzing the similarity and variability of systems. These methods get artifacts, such as requirements, design models, or code, of different systems (commonly in the same domain), identify and calculate their similarities, and represent the variability in models, such as feature diagrams. Most methods rely on implementation considerations of the input systems and generate outcomes based on predefined, fixed strategies of comparison (referred to as variability views). In this paper, we introduce an approach for mining relevant views for comparison and evaluation, based on the input artifacts. Particularly, we equip SOVA – a Semantic and Ontological Variability Analysis method – with data mining techniques in order to identify relevant views that highlight variability or similarity of the input artifacts (natural language requirement documents). The comparison is done using entropy and Rand index measures. The method and its outcomes are evaluated on a case of three photo sharing applications.

[1]  Klaus Pohl,et al.  Software Product Line Engineering , 2005 .

[2]  Carlo Strapparava,et al.  Corpus-based and Knowledge-based Measures of Text Semantic Similarity , 2006, AAAI.

[3]  Zarinah Mohd Kasirun,et al.  Extracting features from online software reviews to aid requirements reuse , 2016, Appl. Soft Comput..

[4]  Martha Palmer,et al.  Verb Semantics and Lexical Selection , 1994, ACL.

[5]  Alexander Egyed,et al.  Reengineering legacy applications into software product lines: a systematic mapping , 2017, Empirical Software Engineering.

[6]  Zhendong Niu,et al.  A Systems Approach to Product Line Requirements Reuse , 2014, IEEE Systems Journal.

[7]  George Karypis,et al.  A Comparison of Document Clustering Techniques , 2000 .

[8]  Mathieu Acher,et al.  Automated extraction of product comparison matrices from informal product descriptions , 2017, J. Syst. Softw..

[9]  D. Steinley Properties of the Hubert-Arabie adjusted Rand index. , 2004, Psychological methods.

[10]  Kyo Chul Kang,et al.  Feature-Oriented Domain Analysis (FODA) Feasibility Study , 1990 .

[11]  Yair Wand,et al.  Variability Analysis of Requirements: Considering Behavioral Differences and Reflecting Stakeholders’ Perspectives , 2016, IEEE Transactions on Software Engineering.

[12]  Mathieu Acher,et al.  Feature model extraction from large collections of informal product descriptions , 2013, ESEC/FSE 2013.

[13]  Simonetta Montemagni,et al.  A Contrastive Approach to Multi-word Extraction from Domain-specific Corpora , 2010, LREC.

[14]  Zarinah Mohd Kasirun,et al.  Feature extraction approaches from natural language requirements for reuse in software product lines: A systematic literature review , 2015, J. Syst. Softw..

[15]  T. Landauer,et al.  Indexing by Latent Semantic Analysis , 1990 .

[16]  Jacques Klein,et al.  Bottom-up adoption of software product lines: a generic and extensible approach , 2015, SPLC.

[17]  Camille Roth,et al.  Natural Scales in Geographical Patterns , 2017, Scientific Reports.

[18]  William M. Rand,et al.  Objective Criteria for the Evaluation of Clustering Methods , 1971 .

[19]  Yair Wand,et al.  Analyzing Variability of Software Product Lines Using Semantic and Ontological Considerations , 2014, CAiSE.

[20]  Iris Reinhartz-Berger,et al.  Generating feature models from requirements: structural vs. functional perspectives , 2014, SPLC '14.

[21]  Iris Reinhartz-Berger,et al.  SOVA - A Tool for Semantic and Ontological Variability Analysis , 2014, CAiSE.

[22]  Krzysztof Czarnecki,et al.  A survey of variability modeling in industrial practice , 2013, VaMoS.

[23]  Felice Dell'Orletta,et al.  Mining commonalities and variabilities from natural language documents , 2013, SPLC '13.