Bayesian Approaches to Matching Architectural Diagrams

IT system architectures and many other kinds of structured artifacts are often described by formal models or informal diagrams. In practice, there are often a number of versions of a model or diagram, such as a series of revisions, divergent variants, or multiple views of a system. Understanding how versions correspond or differ is crucial, and thus, automated assistance for matching models and diagrams is essential. We have designed a framework for finding these correspondences automatically based on Bayesian methods. We represent models and diagrams as graphs whose nodes have attributes such as name, type, connections to other nodes, and containment relations, and we have developed probabilistic models for rating the quality of candidate correspondences based on various features of the nodes in the graphs. Given the probabilistic models, we can find high-quality correspondences using search algorithms. Preliminary experiments focusing on architectural models suggest that the technique is promising.

[1]  Philippe Krutchen,et al.  Architectural blueprints--the '4+1' view model of software architecture , 1995 .

[2]  Pedro M. Domingos,et al.  Reconciling schemas of disparate data sources: a machine-learning approach , 2001, SIGMOD '01.

[3]  Rina Dechter,et al.  Generalized best-first search strategies and the optimality of A* , 1985, JACM.

[4]  Hyoseob Kim,et al.  Supporting the reconciliation of models of object behaviour , 2003, Software and Systems Modeling.

[5]  Jean Bézivin,et al.  Model Integration with Model Weaving: a Case Study in System Architecture , 2007, 2007 International Conference on Systems Engineering and Modeling.

[6]  D. Redmond-Pyle,et al.  A Standard for Architecture Description , 1999, IBM Syst. J..

[7]  Steven H. Tang,et al.  Architectural thinking and modeling with the Architects' Workbench , 2006, IBM Syst. J..

[8]  Alon Y. Halevy,et al.  Why Your Data Won’t Mix , 2005, ACM Queue.

[9]  Udo Kelter,et al.  Difference tools for analysis and design documents , 2003, International Conference on Software Maintenance, 2003. ICSM 2003. Proceedings..

[10]  Jeff Garland,et al.  Large-Scale Software Architecture , 2002 .

[11]  Erhard Rahm,et al.  Generic Schema Matching with Cupid , 2001, VLDB.

[12]  Eleni Stroulia,et al.  UMLDiff: an algorithm for object-oriented design differencing , 2005, ASE.

[13]  David S. Johnson,et al.  Computers and Intractability: A Guide to the Theory of NP-Completeness , 1978 .

[14]  Erhard Rahm,et al.  Similarity flooding: a versatile graph matching algorithm and its application to schema matching , 2002, Proceedings 18th International Conference on Data Engineering.

[15]  Robert E. Tarjan,et al.  Data structures and network algorithms , 1983, CBMS-NSF regional conference series in applied mathematics.

[16]  Erhard Rahm,et al.  A survey of approaches to automatic schema matching , 2001, The VLDB Journal.

[17]  P. G. Moore,et al.  The estimation of the mean of a censored normal distribution by ordered variables , 1956 .

[18]  Daniel M. Yellin,et al.  A Bayesian approach to diagram matching with application to architectural models , 2006, ICSE '06.

[19]  Kewei Tu,et al.  CMC: Combining Multiple Schema-Matching Strategies Based on Credibility Prediction , 2005, DASFAA.

[20]  Geoff Hulten,et al.  Learning at Low False Positive Rates , 2006, CEAS.

[21]  Christoph Treude,et al.  Difference computation of large models , 2007, ESEC-FSE '07.

[22]  Giuliano Antoniol,et al.  Maintaining traceability links during object‐oriented software evolution , 2001, Softw. Pract. Exp..

[23]  Eoin Woods,et al.  Software Systems Architecture: Working with Stakeholders Using Viewpoints and Perspectives , 2005 .

[24]  Peter Bernus,et al.  Handbook on Architectures of Information Systems , 1999 .

[25]  Jeffrey G. Gray,et al.  DSMDiff: a differentiation tool for domain-specific models , 2007 .

[26]  René Witte,et al.  Traceability in Software Engineering – Past, Present and Future , 2007 .

[27]  Philippe Kruchten,et al.  The 4+1 View Model of Architecture , 1995, IEEE Softw..

[28]  Antonio Cicchetti,et al.  A Metamodel Independent Approach to Difference Representation , 2007, J. Object Technol..

[29]  Silvana Castano,et al.  Semantic integration of semistructured and structured data sources , 1999, SGMD.

[30]  Philip A. Bernstein,et al.  Applying Model Management to Classical Meta Data Problems , 2003, CIDR.

[31]  M. Tribus,et al.  Probability theory: the logic of science , 2003 .

[32]  Matthias Jarke,et al.  Toward Reference Models of Requirements Traceability , 2001, IEEE Trans. Software Eng..

[33]  Kenneth M. Anderson,et al.  A Framework for Mapping Traceability Relationships , 2003 .