Two Bayesian approaches to rough sets

Bayesian inference and probabilistic rough sets (PRSs) provide two methods for data analysis. Both of them use probabilities to express uncertainties and knowledge in data and to make inference about data. Many proposals have been made to combine Bayesian inference and rough sets. The main objective of this paper is to present a unified framework that enables us (a) to review and classify Bayesian approaches to rough sets, (b) to give proper perspectives of existing studies, and (c) to examine basic ingredients and fundamental issues of Bayesian approaches to rough sets. By reviewing existing studies, we identify two classes of Bayesian approaches to PRSsand three fundamental issues. One class is interpreted as Bayesian classification rough sets, which is built from decision-theoretic rough set (DTRS) models proposed by Yao, Wong and Lingras. The other class is interpreted as Bayesian confirmation rough sets, which is built from parameterized rough set models proposed by Greco, Matarazzo and Slowinski. Although the two classes share many similarities in terms of making use of Bayes’ theorem and a pair of thresholds to produce three regions, their semantic interpretations and, hence, intended applications are different. The three fundamental issues are the computation and interpretation of thresholds, the estimation of required conditional probabilities, and the application of derived three regions. DTRS models provide an interpretation and a method for computing a pair of thresholds according to Bayesian decision theory. Naive Bayesian rough set models give a practical technique for estimating probability based on Bayes’ theorem and inference. Finally, a theory of three-way decisions offers a tool for building ternary classifiers. The main contribution of the paper lies in weaving together existing results into a coherent study of Bayesian approaches to rough sets, rather than introducing new specific results.

[1]  Yiyu Yao,et al.  Decision-Theoretic Rough Set Models , 2007, RSKT.

[2]  Salvatore Greco,et al.  Mining Pareto-optimal rules with respect to support and confirmation or support and anti-support , 2007, Eng. Appl. Artif. Intell..

[3]  Rob Goudey,et al.  Do statistical inferences allowing three alternative decisions give better feedback for environmentally precautionary decision-making? , 2007, Journal of environmental management.

[4]  Z. Pawlak,et al.  Rough membership functions , 1994 .

[5]  Yiyu Yao,et al.  Probabilistic rough set approximations , 2008, Int. J. Approx. Reason..

[6]  Jerzy W. Grzymala-Busse,et al.  Knowledge acquisition under uncertainty — a rough set approach , 1988, J. Intell. Robotic Syst..

[7]  A. Wald Sequential Tests of Statistical Hypotheses , 1945 .

[8]  Shusaku Tsumoto,et al.  Accuracy and Coverage in Rough Set Rule Induction , 2002, Rough Sets and Current Trends in Computing.

[9]  Yiyu Yao,et al.  A Note on Definability and Approximations , 2007, Trans. Rough Sets.

[10]  Zdzislaw Pawlak,et al.  Decision Rules, Bayes' Rule and Ruogh Sets , 1999, RSFDGrC.

[11]  Zhongsheng Hua,et al.  Identifying future defaulters: A hierarchical Bayesian method , 2015, Eur. J. Oper. Res..

[12]  Yiyu Yao,et al.  Probabilistic approaches to rough sets , 2003, Expert Syst. J. Knowl. Eng..

[13]  Salvatore Greco,et al.  Parameterized rough set model using rough membership and Bayesian confirmation measures , 2008, Int. J. Approx. Reason..

[14]  Philip W. Woodward,et al.  An application of Bayesian methods in SPC , 1993 .

[15]  Yiyu Yao,et al.  Cost-sensitive three-way email spam filtering , 2013, Journal of Intelligent Information Systems.

[16]  W Ziarko,et al.  INFER: an adaptative decision support system based on the probabilistic approximate classification , 1987 .

[17]  T. E. M. Atteya Rough multiple objective programming , 2016, Eur. J. Oper. Res..

[18]  Salvatore Greco,et al.  Rough Membership and Bayesian Confirmation Measures for Parameterized Rough Sets , 2005, RSFDGrC.

[19]  Zdzisław Pawlak,et al.  Can Bayesian confirmation measures be useful for rough set decision rules? , 2004, Eng. Appl. Artif. Intell..

[20]  S. K. Michael Wong,et al.  Rough Sets: Probabilistic versus Deterministic Approach , 1988, Int. J. Man Mach. Stud..

[21]  Yiyu Yao,et al.  Comparison of Two Models of Probabilistic Rough Sets , 2013, RSKT.

[22]  Jerzy W. Grzymala-Busse,et al.  An Empirical Comparison of Rule Sets Induced by LERS and Probabilistic Rough Classification , 2010, RSCTC.

[23]  Yiyu Yao,et al.  Three-way decisions with probabilistic rough sets , 2010, Inf. Sci..

[24]  Dominik Slezak,et al.  The investigation of the Bayesian rough set model , 2005, Int. J. Approx. Reason..

[25]  Yiyu Yao,et al.  Three-Way Decision: An Interpretation of Rules in Rough Set Theory , 2009, RSKT.

[26]  Piotr Synak,et al.  Brighthouse: an analytic data warehouse for ad-hoc queries , 2008, Proc. VLDB Endow..

[27]  Yiyu Yao,et al.  A Decision Theoretic Framework for Approximating Concepts , 1992, Int. J. Man Mach. Stud..

[28]  Jerzy W. Grzymala-Busse,et al.  Rough Sets , 1995, Commun. ACM.

[29]  Pedro M. Domingos,et al.  Beyond Independence: Conditions for the Optimality of the Simple Bayesian Classifier , 1996, ICML.

[30]  Gary Robinson,et al.  A statistical approach to the spam problem , 2003 .

[31]  Yee Leung,et al.  Knowledge acquisition in incomplete information systems: A rough set approach , 2006, Eur. J. Oper. Res..

[32]  Wojciech Ziarko Set Approximation Quality Measures in the Variable Precision Rough Set Model , 2002, HIS.

[33]  Wojciech Kotlowski,et al.  On Nonparametric Ordinal Classification with Monotonicity Constraints , 2013 .

[34]  Daniel Steel,et al.  Bayesian Confirmation Theory and The Likelihood Principle , 2007, Synthese.

[35]  Salvatore Greco,et al.  Bayesian Confirmation Measures within Rough Set Approach , 2004, Rough Sets and Current Trends in Computing.

[36]  C. Allen,et al.  Stanford Encyclopedia of Philosophy , 2011 .

[37]  J. Kassirer,et al.  The threshold approach to clinical decision making. , 1980, The New England journal of medicine.

[38]  Wojciech Kotlowski,et al.  Stochastic dominance-based rough set model for ordinal classification , 2008, Inf. Sci..

[39]  Branden Fitelson,et al.  STUDIES IN BAYESIAN CONFIRMATION THEORY , 2001 .

[40]  Alfons Oude Lansink,et al.  A multiple criteria decision making approach to manure management systems in the Netherlands , 2014, Eur. J. Oper. Res..

[41]  Yiyu Yao,et al.  An Analysis of Quantitative Measures Associated with Rules , 1999, PAKDD.

[42]  Wojciech Ziarko,et al.  Variable Precision Rough Set Model , 1993, J. Comput. Syst. Sci..

[43]  Wenqing Zhao,et al.  An Email Classification Scheme Based on Decision-Theoretic Rough Set Theory and Analysis of Email Security , 2005, TENCON 2005 - 2005 IEEE Region 10 Conference.

[44]  Pat Langley,et al.  An Analysis of Bayesian Classifiers , 1992, AAAI.

[45]  Harry Zhang,et al.  Exploring Conditions For The Optimality Of Naïve Bayes , 2005, Int. J. Pattern Recognit. Artif. Intell..

[46]  Zdzislaw Pawlak,et al.  Rough sets, decision algorithms and Bayes' theorem , 2002, Eur. J. Oper. Res..

[47]  Yiyu Yao,et al.  The superiority of three-way decisions in probabilistic rough set models , 2011, Inf. Sci..

[48]  Yiyu Yao,et al.  Naive Bayesian Rough Sets , 2010, RSKT.

[49]  Jingtao Yao,et al.  Game-Theoretic Risk Analysis in Decision-Theoretic Rough Sets , 2008, RSKT.