Factorizing Three-Way Ordinal Data Using Triadic Formal Concepts

The paper presents a new approach to factor analysis of three-way ordinal data, i.e. data described by a 3-dimensional matrix I with values in an ordered scale. The matrix describes a relationship between objects, attributes, and conditions. The problem consists in finding factors for I, i.e. finding a decomposition of I into three matrices, an object-factor matrix A, an attribute-factor matrix B, and a condition-factor matrix C, with the number of factors as small as possible. The difference from the decomposition-based methods of analysis of three-way data consists in the composition operator and the constraint on A, B, and C to be matrices with values in an ordered scale. We prove that optimal decompositions are achieved by using triadic concepts of I, developed within formal concept analysis, and provide results on natural transformations between the space of attributes and conditions and the space of factors. We present an illustrative example demonstrating the usefulness of finding factors and a greedy algorithm for computing decompositions.

[1]  Rasmus Bro,et al.  Multi-way Analysis with Applications in the Chemical Sciences , 2004 .

[2]  Johannes Fürnkranz,et al.  Knowledge Discovery in Databases: PKDD 2006, 10th European Conference on Principles and Practice of Knowledge Discovery in Databases, Berlin, Germany, September 18-22, 2006, Proceedings , 2006, PKDD.

[3]  George J. Klir,et al.  Fuzzy sets and fuzzy logic - theory and applications , 1995 .

[4]  Ronald L. Rivest,et al.  Introduction to Algorithms , 1990 .

[5]  Rudolf Wille,et al.  The Basic Theorem of triadic concept analysis , 1995 .

[6]  Jan Outrata,et al.  Boolean Factor Analysis for Data Preprocessing in Machine Learning , 2010, 2010 Ninth International Conference on Machine Learning and Applications.

[7]  Radim Belohlávek,et al.  Triadic Concept Analysis of Data with Fuzzy Attributes , 2010, 2010 IEEE International Conference on Granular Computing.

[8]  Vilém Vychodil,et al.  Factor Analysis of Incidence Data via Novel Decomposition of Matrices , 2009, ICFCA.

[9]  John F. Sowa,et al.  Conceptual Structures: Applications, Implementation and Theory , 1995, Lecture Notes in Computer Science.

[10]  Pauli Miettinen,et al.  The Discrete Basis Problem , 2008, IEEE Trans. Knowl. Data Eng..

[11]  P. Kroonenberg Applied Multiway Data Analysis , 2008 .

[12]  Tamara G. Kolda,et al.  Tensor Decompositions and Applications , 2009, SIAM Rev..

[13]  Hai Tao,et al.  Binary Principal Component Analysis , 2006, BMVC.

[14]  W. P. Dixon,et al.  BMPD statistical software manual , 1988 .

[15]  Petr Hájek,et al.  Metamathematics of Fuzzy Logic , 1998, Trends in Logic.

[16]  Andrzej Cichocki,et al.  Nonnegative Matrix and Tensor Factorization T , 2007 .

[17]  Siegfried Gottwald,et al.  Fuzzy Sets and Fuzzy Logic , 1993 .

[18]  Vilém Vychodil,et al.  Discovery of optimal factors in binary data via a novel method of matrix decomposition , 2010, J. Comput. Syst. Sci..

[19]  Bernhard Ganter,et al.  Formal Concept Analysis , 2013 .

[20]  Radim Belohlávek,et al.  Optimal decompositions of matrices with entries from residuated lattices , 2012, J. Log. Comput..

[21]  Jan Outrata,et al.  Preprocessing Input Data for Machine Learning by FCA , 2010, CLA.

[22]  Vilém Vychodil,et al.  Optimal Factorization of Three-Way Binary Data , 2010, 2010 IEEE International Conference on Granular Computing.

[23]  Sergei O. Kuznetsov,et al.  Comparing performance of algorithms for generating concept lattices , 2002, J. Exp. Theor. Artif. Intell..

[24]  Cynthia Vera Glodeanu,et al.  Optimal Factorization of Three-Way Binary Data Using Triadic Concepts , 2013, Order.

[25]  Clifford Stein,et al.  Introduction to Algorithms, 2nd edition. , 2001 .

[26]  Bernhard Ganter,et al.  Formal Concept Analysis: Mathematical Foundations , 1998 .

[27]  Aristides Gionis,et al.  What is the Dimension of Your Binary Data? , 2006, Sixth International Conference on Data Mining (ICDM'06).

[28]  Rudolf Wille,et al.  A Triadic Approach to Formal Concept Analysis , 1995, ICCS.

[29]  Dana S. Nau,et al.  A mathematical analysis of human leukocyte antigen serology , 1978 .

[30]  Andreas Hotho,et al.  TRIAS--An Algorithm for Mining Iceberg Tri-Lattices , 2006, Sixth International Conference on Data Mining (ICDM'06).