Data Mining, Reasoning and Incremental Information Retrieval through Non Enlargeable Rectangular Relation Coverage

Association rules extraction from a binary relation as well as reasoning and information retrieval are generally based on the initial representation of the binary relation as an adjacency matrix. This presents some inconvenience in terms of space memory and knowledge organization. A coverage of a binary relation by a minimal number of non enlargeable rectangles generally reduces memory space consumption without any loss of information. It also has the advantage of organizing objects and attributes contained in the binary relation into a conceptual representation. In this paper, we propose new algorithms to extract association rules (i.e. data mining), conclusions from initial attributes (i.e. reasoning), as well as retrieving the total objects satisfying some initial attributes, by using only the minimal coverage. Finally we propose an incremental approximate algorithm to update a binary relation organized as a set of non enlargeable rectangles. Two main operations are mostly used during the organization process: First, separation of existing rectangles when we delete some pairs. Second, join of rectangles when common properties are discovered, after addition or removal of elements from a binary context. The objective is the minimization of the number of rectangles and the maximization of their structure. The article also raises the problems of equational modeling of the minimization criteria, as well as incrementally providing equations to maintain them.

[1]  Bernhard Ganter,et al.  Formal Concept Analysis, 6th International Conference, ICFCA 2008, Montreal, Canada, February 25-28, 2008, Proceedings , 2008, International Conference on Formal Concept Analysis.

[2]  Gunther Schmidt,et al.  Relations and Graphs: Discrete Mathematics for Computer Scientists , 1993 .

[3]  Claudio Carpineto,et al.  Exploiting the Potential of Concept Lattices for Information Retrieval with CREDO , 2004, J. Univers. Comput. Sci..

[4]  Samir Elloumi,et al.  Galois connection, formal concepts and Galois lattice in real relations: application in a real classifier , 2002, J. Syst. Softw..

[5]  Gerd Stumme,et al.  Formal Concept Analysis , 2009, Handbook on Ontologies.

[6]  Gerd Stumme,et al.  Formal Concept Analysis: foundations and applications , 2005 .

[7]  Ali Jaoua,et al.  Using difunctional relations in information organization , 2000, Inf. Sci..

[8]  Vilém Vychodil,et al.  Discovery of optimal factors in binary data via a novel method of matrix decomposition , 2010, J. Comput. Syst. Sci..

[9]  J. Riguet,et al.  Relations binaires, fermetures, correspondances de Galois , 1948 .

[10]  Georg Struth,et al.  Relational and Kleene-Algebraic Methods in Computer Science , 2003, Lecture Notes in Computer Science.

[11]  Michael Winter Decomposing Relations into Orderings , 2003, RelMiCS.

[12]  M. Sarfraz Computer-Aided Intelligent Recognition Techniques and Applications , 2005 .

[13]  Robert Godin,et al.  Design of a browsing interface for information retrieval , 1989, SIGIR '89.

[14]  Ali Jaoua,et al.  Décomposition Rectangulaire Optimale D’une Relation Binaire: Application Aux Bases De Données Documentaires , 1994 .