Learning Rules from Incomplete Examples via Implicit Mention Models

We study the problem of learning general rules from concrete facts extracted from natural data sources such as the newspaper stories and medical histories. Natural data sources present two challenges to automated learning, namely, radical incompleteness and systematic bias. In this paper, we propose an approach that combines simultaneous learning of multiple predictive rules with differential scoring of evidence which adapts to a presumed model of data generation. Learning multiple predicates simultaneously mitigates the problem of radical incompleteness, while the differential scoring would help reduce the effects of systematic bias. We evaluate our approach empirically on both textual and non-textual sources. We further present a theoretical analysis that elucidates our approach and explains the empirical results.

[1]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[2]  Roger A. Sugden,et al.  Multiple Imputation for Nonresponse in Surveys , 1988 .

[3]  D. Rubin,et al.  Statistical Analysis with Missing Data , 1988 .

[4]  J. R. Quinlan Learning Logical Definitions from Relations , 1990 .

[5]  Ramanathan V. Guha,et al.  CYC: A Midterm Report , 1990, AI Mag..

[6]  Annabel M. Patterson Reading Between the Lines , 1992 .

[7]  Rich Caruana,et al.  Multitask Learning: A Knowledge-Based Source of Inductive Bias , 1993, ICML.

[8]  David Yarowsky,et al.  Unsupervised Word Sense Disambiguation Rivaling Supervised Methods , 1995, ACL.

[9]  Avrim Blum,et al.  The Bottleneck , 2021, Monopsony Capitalism.

[10]  Nir Friedman,et al.  The Bayesian Structural EM Algorithm , 1998, UAI.

[11]  J. Schafer Multiple imputation: a primer , 1999, Statistical methods in medical research.

[12]  Raymond J. Mooney,et al.  A Mutually Beneficial Integration of Data Mining and Information Extraction , 2000, AAAI/IAAI.

[13]  Joost N. Kok,et al.  Efficient Frequent Query Discovery in FARMER , 2003, PKDD.

[14]  Dan Roth,et al.  Learning to Reason with a Restricted View , 1995, COLT '95.

[15]  Matthew Richardson,et al.  Markov logic networks , 2006, Machine Learning.

[16]  Leslie G. Valiant,et al.  A First Experimental Demonstration of Massive Knowledge Infusion , 2008, KR.

[17]  M. Eisenstein Reading between the lines , 2009, Nature Methods.

[18]  Marjorie Freedman,et al.  Empirical Studies in Learning to Read , 2010, HLT-NAACL 2010.

[19]  Oren Etzioni,et al.  Learning First-Order Horn Clauses from Web Text , 2010, EMNLP.

[20]  Estevam R. Hruschka,et al.  Coupled semi-supervised learning for information extraction , 2010, WSDM '10.

[21]  Thomas G. Dietterich,et al.  Inverting Grice's Maxims to Learn Rules from Natural Language Extractions , 2011, NIPS.