Causal graph extraction from news: a comparative study of time-series causality learning techniques

Causal graph extraction from news has the potential to aid in the understanding of complex scenarios. In particular, it can help explain and predict events, as well as conjecture about possible cause-effect connections. However, limited work has addressed the problem of large-scale extraction of causal graphs from news articles. This article presents a novel framework for extracting causal graphs from digital text media. The framework relies on topic-relevant variables representing terms and ongoing events that are selected from a domain under analysis by applying specially developed information retrieval and natural language processing methods. Events are represented as event-phrase embeddings, which make it possible to group similar events into semantically cohesive clusters. A time series of the selected variables is given as input to a causal structure learning techniques to learn a causal graph associated with the topic that is being examined. The complete framework is applied to the New York Times dataset, which covers news for a period of 246 months (roughly 20 years), and is illustrated through a case study. An initial evaluation based on synthetic data is carried out to gain insight into the most effective time-series causality learning techniques. This evaluation comprises a systematic analysis of nine state-of-the-art causal structure learning techniques and two novel ensemble methods derived from the most effective techniques. Subsequently, the complete framework based on the most promising causal structure learning technique is evaluated with domain experts in a real-world scenario through the use of the presented case study. The proposed analysis offers valuable insights into the problems of identifying topic-relevant variables from large volumes of news and learning causal graphs from time series.

[1]  Ana Gabriela Maguitman,et al.  Detecting ongoing events using contextual word and sentence embeddings , 2022, Expert Syst. Appl..

[2]  K. Candan,et al.  Evaluation Methods and Measures for Causal Learning Algorithms , 2022, IEEE Transactions on Artificial Intelligence.

[3]  Josiah Poon,et al.  A survey on extraction of causal relations from natural language text , 2021, Knowledge and Information Systems.

[4]  Shubhashis Sengupta,et al.  Causal-BERT : Language models for causality detection between events expressed in text , 2020, SAI.

[5]  Ana Gabriela Maguitman,et al.  Assessing the behavior and performance of a supervised term-weighting technique for topic-based retrieval , 2020, Inf. Process. Manag..

[6]  Stefan Heindorf,et al.  CauseNet: Towards a Causality Graph Extracted from the Web , 2020, CIKM.

[7]  Ana Gabriela Maguitman,et al.  Event Detection Dataset , 2020 .

[8]  Jonas Peters,et al.  Foundations and new horizons for causal inference , 2020, Oberwolfach Reports.

[9]  Andrea Esuli,et al.  Learning to Weight for Text Classification , 2019, IEEE Transactions on Knowledge and Data Engineering.

[10]  Jiuyong Li,et al.  Introduction to the Special Section on Advances in Causal Discovery and Inference , 2019, ACM Trans. Intell. Syst. Technol..

[11]  Cynthia Rudin,et al.  Why Are We Using Black Box Models in AI When We Don’t Need To? A Lesson From An Explainable AI Competition , 2019, 1.2.

[12]  Ananth Balashankar,et al.  Identifying Predictive Causal Factors from News Streams , 2019, EMNLP.

[13]  Bernhard Schölkopf,et al.  Inferring causation from time series in Earth system sciences , 2019, Nature Communications.

[14]  Heng Ji,et al.  Joint Entity and Event Extraction with Generative Adversarial Imitation Learning , 2019, Data Intelligence.

[15]  Ana Gabriela Maguitman,et al.  A Flexible Supervised Term-Weighting Technique and its Application to Variable Extraction and Information Retrieval , 2019, Inteligencia Artif..

[16]  Dino Sejdinovic,et al.  Detecting and quantifying causal associations in large nonlinear time series datasets , 2017, Science Advances.

[17]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[18]  Ralph Grishman,et al.  Graph Convolutional Networks With Argument-Aware Pooling for Event Detection , 2018, AAAI.

[19]  Bernhard Schölkopf,et al.  Elements of Causal Inference: Foundations and Learning Algorithms , 2017 .

[20]  Christina Heinze-Deml,et al.  Causal Structure Learning , 2017, 1706.09141.

[21]  William B. Nicholson,et al.  BigVAR: Tools for Modeling Sparse High-Dimensional Multivariate Time Series , 2017, 1702.07094.

[22]  J. Pearl,et al.  Causal Inference in Statistics: A Primer , 2016 .

[23]  Elias Bareinboim,et al.  Causal Inference from Big Data: Theoretical Foundations and the Data-fusion Problem , 2015 .

[24]  Yücel Saygin,et al.  Sentimental causal rule discovery from Twitter , 2014, Expert Syst. Appl..

[25]  Hal R. Varian,et al.  Big Data: New Tricks for Econometrics , 2014 .

[26]  Kira Radinsky,et al.  Learning causality for news events prediction , 2012, WWW.

[27]  Aapo Hyvärinen,et al.  DirectLiNGAM: A Direct Method for Learning a Linear Non-Gaussian Structural Equation Model , 2011, J. Mach. Learn. Res..

[28]  Lawrence J. Mazlack,et al.  Lexico-syntactic causal pattern text mining , 2010 .

[29]  D. Sculley,et al.  Web-scale k-means clustering , 2010, WWW '10.

[30]  Nir Friedman,et al.  Probabilistic Graphical Models - Principles and Techniques , 2009 .

[31]  Tim Berners-Lee,et al.  Linked Data - The Story So Far , 2009, Int. J. Semantic Web Inf. Syst..

[32]  Christophe Ambroise,et al.  SIMoNe: Statistical Inference for MOdular NEtworks , 2009, Bioinform..

[33]  James J. Heckman,et al.  Econometric Evaluation of Social Programs, Part I: Causal Models, Structural Models and Econometric Policy Evaluation , 2007 .

[34]  Aapo Hyvärinen,et al.  A Linear Non-Gaussian Acyclic Model for Causal Discovery , 2006, J. Mach. Learn. Res..

[35]  Massimo Poesio,et al.  Acquiring Bayesian Networks from Text , 2004, LREC.

[36]  Rajeev Motwani,et al.  Scalable Techniques for Mining Causal Structures , 1998, Data Mining and Knowledge Discovery.

[37]  D.M. Mount,et al.  An Efficient k-Means Clustering Algorithm: Analysis and Implementation , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[38]  Dan I. Moldovan,et al.  Text Mining for Causal Relations , 2002, FLAIRS.

[39]  Schreiber,et al.  Measuring information transfer , 2000, Physical review letters.

[40]  R Scheines,et al.  The TETRAD Project: Constraint Based Aids to Causal Model Specification. , 1998, Multivariate behavioral research.

[41]  P. Spirtes,et al.  An Algorithm for Fast Recovery of Sparse Causal Graphs , 1991 .

[42]  C. Sims MACROECONOMICS AND REALITY , 1977 .

[43]  C. Granger Investigating causal relations by econometric models and cross-spectral methods , 1969 .

[44]  R. L. Thorndike Who belongs in the family? , 1953 .