Causality measures and analysis: A rough set framework

Abstract Data and rules power expert systems and intelligent systems. Rules, as a form of knowledge representation, can be acquired by experts or learned from data. The accuracy and precision of knowledge largely determines the success of the systems, which awakens the concern for causality. The ability to elicit cause–effect rules directly from data is key and difficult to any expert systems and intelligent systems. Rough set theory has succeeded in automatically transforming data into knowledge, where data are often presented as an attribute-value table. However, the existing tools in this theory are currently incapable of interpreting counterfactuals and interventions involved in causal analysis. This paper offers an attempt to characterize the cause–effect relationships between attributes in attribute-value tables with intent to overcome existing limitations. First, we establish the main conditions that attributes need to satisfy in order to estimate the causal effects between them, by employing the back-door criterion and the adjustment formula for a directed acyclic graph. In particular, based on the notion of lower approximation, we extend the back-door criterion to an original data table without any graphical structures. We then identify the effects of the interventions and the counterfactual interpretation of causation between attributes in such tables. Through illustrative studies completed for some attribute-value tables, we show the procedure for identifying the causation between attributes and examine whether the dependency of the attributes can describe causality between them.

[1]  Min Zhao,et al.  Multilevel data summarization from information systems: a "rule + exception" approach , 2003, AI Commun..

[2]  Z. Pawlak Rough Sets: Theoretical Aspects of Reasoning about Data , 1991 .

[3]  Wojciech Ziarko,et al.  Dependencies in Structures of Decision Tables , 2007, RSEISP.

[4]  Zied Elouedi,et al.  Dealing with external actions in belief causal networks , 2013, Int. J. Approx. Reason..

[5]  Yiyu Yao,et al.  The superiority of three-way decisions in probabilistic rough set models , 2011, Inf. Sci..

[6]  Daisuke Yamaguchi,et al.  Attribute dependency functions considering data efficiency , 2009, Int. J. Approx. Reason..

[7]  Ivo Düntsch,et al.  Roughian: Rough information analysis , 2001 .

[8]  Michael G. Madden,et al.  Bayesian networks for mathematical models: Techniques for automatic construction and efficient inference , 2013, Int. J. Approx. Reason..

[9]  Andrzej Skowron,et al.  Some contributions by zdzisław pawlak , 2006 .

[10]  Nouman Azam,et al.  Web-Based Medical Decision Support Systems for Three-Way Medical Decision Making With Game-Theoretic Rough Sets , 2015, IEEE Transactions on Fuzzy Systems.

[11]  Usman Qamar,et al.  A heuristic based dependency calculation technique for rough set theory , 2018, Pattern Recognit..

[12]  Wei-Zhi Wu,et al.  On rule acquisition in incomplete multi-scale decision tables , 2017, Inf. Sci..

[13]  Duoqian Miao,et al.  Quantitative/qualitative region-change uncertainty/certainty in attribute reduction: Comparative region-change analyses based on granular computing , 2016, Inf. Sci..

[14]  P. Spirtes,et al.  Causation, prediction, and search , 1993 .

[15]  Jin Tian,et al.  Probabilities of causation: Bounds and identification , 2000, Annals of Mathematics and Artificial Intelligence.

[16]  Davide Ciucci,et al.  Rough Set Theory and Digraphs , 2017, Fundam. Informaticae.

[17]  Somjit Arch-int,et al.  A rough set approach for approximating differential dependencies , 2018, Expert Syst. Appl..

[18]  Davide Ciucci,et al.  Generalizations of Rough Set Tools Inspired by Graph Theory , 2016, Fundam. Informaticae.

[19]  Judea Pearl,et al.  The seven tools of causal inference, with reflections on machine learning , 2019, Commun. ACM.

[20]  Elias Bareinboim,et al.  Causal inference and the data-fusion problem , 2016, Proceedings of the National Academy of Sciences.

[21]  Ivo Düntsch,et al.  Statistical evaluation of rough set dependency analysis , 1997, Int. J. Hum. Comput. Stud..

[22]  Jadzia Cendrowska,et al.  PRISM: An Algorithm for Inducing Modular Rules , 1987, Int. J. Man Mach. Stud..

[23]  Thierry Denoeux,et al.  Making Use of Partial Knowledge About Hidden States in HMMs: An Approach Based on Belief Functions , 2014, IEEE Transactions on Fuzzy Systems.

[24]  M. Maathuis,et al.  A Generalized Back-door Criterion 1 , 2015 .

[25]  Salem Benferhat,et al.  Possibilistic Causal Networks for Handling Interventions: A New Propagation Algorithm , 2007, AAAI.

[26]  Jie Zhou,et al.  β-Interval attribute reduction in variable precision rough set model , 2011, Soft Comput..

[27]  J. Pearl Causality: Models, Reasoning and Inference , 2000 .

[28]  D. Rubin [On the Application of Probability Theory to Agricultural Experiments. Essay on Principles. Section 9.] Comment: Neyman (1923) and Causal Inference in Experiments and Observational Studies , 1990 .

[29]  Pawan Lingras,et al.  Partially ordered rough ensemble clustering for multigranular representations , 2015, Intell. Data Anal..

[30]  P. Games Correlation and Causation: A Logical Snafu , 1990 .

[31]  Giampiero Chiaselotti,et al.  Granular computing on information tables: Families of subsets and operators , 2018, Inf. Sci..

[32]  Andrzej Skowron,et al.  Rudiments of rough sets , 2007, Inf. Sci..

[33]  Xiaodong Yue,et al.  Parallel attribute reduction algorithms using MapReduce , 2014, Inf. Sci..

[34]  Zhifei Zhang,et al.  Probabilistic Estimation for Generalized Rough Modus Ponens and Rough Modus Tollens , 2016, IJCRS.

[35]  Wojciech,et al.  Probabilistic Approach to Rough Set Theory , 2006 .

[36]  Li Long-shu Variable precision rough set model based on set pair situation , 2010 .

[37]  D. A. Kenny,et al.  Correlation and Causation , 1937, Wilmott.

[38]  Witold Pedrycz,et al.  Granular Computing: Analysis and Design of Intelligent Systems , 2013 .

[39]  Jin Tian,et al.  Recovering Causal Effects from Selection Bias , 2015, AAAI.

[40]  J. Pearl Causal Thinking in the Twilight Zone , 2021 .

[41]  J. Pearl 3. The Foundations of Causal Inference , 2010 .

[42]  D. Rubin Estimating causal effects of treatments in randomized and nonrandomized studies. , 1974 .

[43]  Salem Benferhat,et al.  Interventions and belief change in possibilistic graphical models , 2010, Artif. Intell..

[44]  Zdzislaw Pawlak,et al.  Some Issues on Rough Sets , 2004, Trans. Rough Sets.

[45]  Joshua B. Tenenbaum,et al.  Building machines that learn and think like people , 2016, Behavioral and Brain Sciences.