Contributions to the Theory of Rough Sets

We study properties of rough sets, that is, approximations to sets of records in a database or, more formally, to subsets of the universe of an information system. A rough set is a pair 〈L, U〉 such that L, U are definable in the information system and L ⊆ U. In the paper, we introduce a language, called the language of inclusion-exclusion, to describe incomplete specifications of (unknown) sets. We use rough sets in order to define a semantics for theories in the inclusion-exclusion language. We argue that our concept of a rough set is closely related to that introduced by Pawlak. We show that rough sets can be ordered by the knowledge ordering (denoted $$\preceq$$ kn)- We prove that Pawlak's rough sets are characterized as $$\preceq$$ kn-greatest approximations. We show that for any consistent (that is, satisfiable) theory T in the language of inclusion-exclusion there exists a $$\preceq$$ kn-greatest rough set approximating all sets X that satisfy T. For some classes of theories in the language of inclusion-exclusion, we provide algorithmic ways to find this best approximation. We also state a number of miscellaneous results and discuss some open problems. (This is an extended version of the first part of the presentation made by the authors at the RSCTC98, Rough Sets and Current Trends in Computing, an international meeting held in Warsaw, Poland, in June 1998. Address for correspondence: Department of Computer Science, University of Kentucky, Lexington, KY 40506-0046.)

[1]  J. Kacprzyk,et al.  Incomplete Information: Rough Set Analysis , 1997 .

[2]  Janusz Zalewski,et al.  Rough sets: Theoretical aspects of reasoning about data , 1996 .

[3]  Yiyu Yao,et al.  Rough Sets and Current Trends in Computing , 2001, Lecture Notes in Computer Science.

[4]  Victor W. Marek,et al.  Myths about rough set theory , 1998, CACM.

[5]  S. C. Kleene,et al.  Introduction to Metamathematics , 1952 .

[6]  Z. Pawlak Rough Sets: Theoretical Aspects of Reasoning about Data , 1991 .

[7]  Zdzisław Pawlak,et al.  Rough sets and information systems , 1988 .

[8]  Matthew L. Ginsberg,et al.  Multivalued logics: a uniform approach to reasoning in artificial intelligence , 1988, Comput. Intell..

[9]  Sinh Hoa Nguyen,et al.  On Finding Optimal Discretizations for Two Attributes , 1998, Rough Sets and Current Trends in Computing.

[10]  A. Tarski A LATTICE-THEORETICAL FIXPOINT THEOREM AND ITS APPLICATIONS , 1955 .

[11]  R. Słowiński Intelligent Decision Support: Handbook of Applications and Advances of the Rough Sets Theory , 1992 .

[12]  Jeffrey D. Uuman Principles of database and knowledge- base systems , 1989 .

[13]  Tom Michael Mitchell Version spaces: an approach to concept learning. , 1979 .

[14]  Surajit Chaudhuri,et al.  An overview of data warehousing and OLAP technology , 1997, SGMD.

[15]  Wojciech Buszkowski,et al.  Approximation Spaces and Definability for Incomplete Information Systems , 1998, Rough Sets and Current Trends in Computing.

[16]  Andrzej Skowron,et al.  The Discernibility Matrices and Functions in Information Systems , 1992, Intelligent Decision Support.

[17]  Victor W. Marek,et al.  Information Storage and Retrieval Systems: Mathematical Foundations , 1976, Theor. Comput. Sci..

[18]  Andrzej Skowron,et al.  Rough-Fuzzy Hybridization: A New Trend in Decision Making , 1999 .

[19]  Serge Abiteboul,et al.  Foundations of Databases , 1994 .

[20]  C. J. V. Rijsbergen,et al.  Rough Sets, Fuzzy Sets and Knowledge Discovery , 1994, Workshops in Computing.