Formal Concept Analysis With Background Knowledge: Attribute Priorities

This paper deals with background knowledge in knowledge extraction from binary data. A background knowledge represents an additional piece of information a user may have along with the input data. Such information can be considered as specifying the type of knowledge a user is looking for in the data. In particular, we emphasize the need for taking into account background knowledge in formal concept analysis. We present an approach to modeling background knowledge that represents user's priorities regarding attributes and their relative importance. Such priorities serve as a constraint-only those formal concepts that are compatible with user's priorities are considered relevant, extracted from data, and presented to the user. Our approach has two main practical features. First, the number of formal concepts presented to the user may get significantly reduced. As a result, the user is supplied with relevant formal concepts only and is not overloaded by a large number of possibly nonrelevant formal concepts. Second, different priorities lead to different pieces of knowledge extracted from data. This way, the input data may be repeatedly used in knowledge extraction for different purposes corresponding to different priorities. We concentrate on foundational aspects such as mathematical feasibility, reasoning with background knowledge, removing redundancy from background knowledge, and computational tractability, and present several illustrative examples. In addition, we discuss directions for future research.

[1]  Claudio Carpineto,et al.  Concept data analysis - theory and applications , 2004 .

[2]  Vilem Vychodil,et al.  Reducing the size of fuzzy concept lattices by fuzzy closure operators , 2006 .

[3]  J. W. Lloyd,et al.  Foundations of logic programming; (2nd extended ed.) , 1987 .

[4]  Jean-François Boulicaut,et al.  Constraint-based Data Mining , 2005, Data Mining and Knowledge Discovery Handbook.

[5]  J. Lloyd Foundations of Logic Programming , 1984, Symbolic Computation.

[6]  Ronen Feldman,et al.  The Data Mining and Knowledge Discovery Handbook , 2005 .

[7]  Jean-Claude Falmagne,et al.  Knowledge spaces , 1998 .

[8]  P. Sopp Cluster analysis. , 1996, Veterinary immunology and immunopathology.

[9]  Vilém Vychodil,et al.  Adding background knowledge to formal concept analysis via attribute dependency formulas , 2008, SAC '08.

[10]  Vilém Vychodil,et al.  Semantic Entailment of Attribute-dependency Formulas and Their Non-redundant Bases , 2006, JCIS.

[11]  Gerd Stumme,et al.  Attribute Exploration with Background Implications and Exceptions , 1996 .

[12]  Bernhard Ganter,et al.  Attribute Exploration with Background Knowledge , 1999, Theor. Comput. Sci..

[13]  David Maier,et al.  The Theory of Relational Databases , 1983 .

[14]  Claire Cardie,et al.  Proceedings of the Eighteenth International Conference on Machine Learning, 2001, p. 577–584. Constrained K-means Clustering with Background Knowledge , 2022 .

[15]  Vilém Vychodil,et al.  Reducing the Size of Fuzzy Concept Lattices by Hedges , 2005, The 14th IEEE International Conference on Fuzzy Systems, 2005. FUZZ '05..

[16]  Bernhard Ganter,et al.  Formal Concept Analysis: Mathematical Foundations , 1998 .

[17]  Vincent Duquenne,et al.  Familles minimales d'implications informatives résultant d'un tableau de données binaires , 1986 .

[18]  Radim Belohlávek,et al.  Formal Concept Analysis Constrained by Attribute-Dependency Formulas , 2005, ICFCA.

[19]  S. S. Ravi,et al.  Agglomerative Hierarchical Clustering with Constraints: Theoretical and Empirical Results , 2005, PKDD.

[20]  Vilém Vychodil,et al.  Formal Concept Analysis with Constraints by Closure Operators , 2006, ICCS.