Human-Guided Learning for Probabilistic Logic Models

Advice-giving has been long explored in the artificial intelligence community to build robust learning algorithms when the data is noisy, incorrect or even insufficient. While logic based systems were effectively used in building expert systems, the role of the human has been restricted to being a “mere labeler” in recent times. We hypothesize and demonstrate that probabilistic logic can provide an effective and natural way for the expert to specify domain advice. Specifically, we consider different types of advice-giving in relational domains where noise could arise due to systematic errors or class-imbalance inherent in the domains. The advice is provided as logical statements or privileged features that are thenexplicitly considered by an iterative learning algorithm at every update. Our empirical evidence shows that human advice can effectively accelerate learning in noisy, structured domains where so far humans have been merely used as labelers or as designers of the (initial or final) structure of the model.

[1]  Manfred Jaeger,et al.  Parameter learning for relational Bayesian networks , 2007, ICML '07.

[2]  Sriraam Natarajan,et al.  Knowledge-Based Probabilistic Logic Learning , 2015, AAAI.

[3]  Glenn Fung,et al.  Knowledge-Based Support Vector Machine Classifiers , 2002, NIPS.

[4]  Stefano Ferilli,et al.  Structure Learning of Markov Logic Networks through Iterated Local Search , 2008, ECAI.

[5]  Jude W. Shavlik,et al.  Online Knowledge-Based Support Vector Machines , 2010, ECML/PKDD.

[6]  Cheng G. Weng,et al.  A New Evaluation Measure for Imbalanced Datasets , 2008, AusDM.

[7]  Pedro M. Domingos,et al.  Learning Markov Logic Networks Using Structural Motifs , 2010, ICML.

[8]  Nitesh V. Chawla,et al.  Data Mining for Imbalanced Datasets: An Overview , 2005, The Data Mining and Knowledge Discovery Handbook.

[9]  Sriraam Natarajan,et al.  Learning Relational Dependency Networks for Relation Extraction , 2016, ILP.

[10]  A. Cardona,et al.  An Integrated Micro- and Macroarchitectural Analysis of the Drosophila Brain by Computer-Assisted Serial Section Electron Microscopy , 2010, PLoS biology.

[11]  Matthew Richardson,et al.  Markov logic networks , 2006, Machine Learning.

[12]  Thomas G. Dietterich,et al.  Learning from Sparse Data by Exploiting Monotonicity Constraints , 2005, UAI.

[13]  Andrew McCallum,et al.  Introduction to Statistical Relational Learning , 2007 .

[14]  Jun'ichi Tsujii,et al.  A Markov Logic Approach to Bio-Molecular Event Extraction , 2009, BioNLP@HLT-NAACL.

[15]  Christoph H. Lampert,et al.  Learning to Rank Using Privileged Information , 2013, 2013 IEEE International Conference on Computer Vision.

[16]  Craig Boutilier,et al.  A POMDP formulation of preference elicitation problems , 2002, AAAI/IAAI.

[17]  Nello Cristianini,et al.  An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .

[18]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[19]  Toby Walsh,et al.  Uncertainty in Preference Elicitation and Aggregation , 2007, AAAI.

[20]  Luc De Raedt,et al.  Bayesian Logic Programming: Theory and Tool , 2007 .

[21]  Saso Dzeroski,et al.  Inductive Logic Programming: Techniques and Applications , 1993 .

[22]  Jennifer Neville,et al.  Relational Dependency Networks , 2007, J. Mach. Learn. Res..

[23]  Taisuke Sato,et al.  Efficient EM Learning with Tabulation for Parameterized Logic Programs , 2000, Computational Logic.

[24]  Kristian Kersting,et al.  Imitation Learning in Relational Domains: A Functional-Gradient Boosting Approach , 2011, IJCAI.

[25]  Shuo Yang,et al.  Identifying Rare Diseases from Behavioural Data: A Machine Learning Approach , 2016, 2016 IEEE First International Conference on Connected Health: Applications, Systems and Engineering Technologies (CHASE).

[26]  Kristian Kersting,et al.  Learning Markov Logic Networks via Functional Gradient Boosting , 2011, 2011 IEEE 11th International Conference on Data Mining.

[27]  Christopher Ré,et al.  Effectively Creating Weakly Labeled Training Examples via Approximate Domain Knowledge , 2014, ILP.

[28]  Jude W. Shavlik,et al.  Knowledge-Based Artificial Neural Networks , 1994, Artif. Intell..

[29]  Yuji Matsumoto,et al.  Jointly Identifying Temporal Relations with Markov Logic , 2009, ACL.

[30]  Pedro M. Domingos,et al.  Markov Logic: An Interface Layer for Artificial Intelligence , 2009, Markov Logic: An Interface Layer for Artificial Intelligence.

[31]  Benjamin Kuipers,et al.  Qualitative reasoning: Modeling and simulation with incomplete knowledge , 1994, Autom..

[32]  Sriraam Natarajan,et al.  Extracting Adverse Drug Events from Text Using Human Advice , 2015, AIME.

[33]  Kristian Kersting,et al.  Gradient-based boosting for statistical relational learning: The relational dependency network case , 2011, Machine Learning.

[34]  Alan Fern,et al.  Imitation Learning with Demonstrations and Shaping Rewards , 2014, AAAI.

[35]  Pedro M. Domingos,et al.  Joint Inference in Information Extraction , 2007, AAAI.

[36]  Hendrik Blockeel,et al.  Top-Down Induction of First Order Logical Decision Trees , 1998, AI Commun..

[37]  Sriraam Natarajan,et al.  Guiding Autonomous Agents to Better Behaviors through Human Advice , 2013, 2013 IEEE 13th International Conference on Data Mining.

[38]  Lise Getoor,et al.  Learning Probabilistic Relational Models , 1999, IJCAI.

[39]  Noah A. Smith,et al.  Softmax-Margin CRFs: Training Log-Linear Models with Cost Functions , 2010, NAACL.

[40]  Sriraam Natarajan,et al.  Boosting for Postpartum Depression Prediction , 2017, 2017 IEEE/ACM International Conference on Connected Health: Applications, Systems and Engineering Technologies (CHASE).

[41]  Shuo Yang,et al.  Knowledge Intensive Learning: Combining Qualitative Constraints with Causal Independence for Parameter Learning in Probabilistic Models , 2013, ECML/PKDD.

[42]  Kristian Kersting,et al.  Accelerating Imitation Learning in Relational Domains via Transfer by Initialization , 2013, ILP.

[43]  Kristian Kersting,et al.  Non-parametric policy gradients: a unified treatment of propositional and relational domains , 2008, ICML '08.

[44]  Kristian Kersting,et al.  Boosting Relational Sequence Alignments , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[45]  Luc De Raedt,et al.  Probabilistic logic learning , 2003, SKDD.

[46]  Hoifung Poon,et al.  Joint Inference for Knowledge Extraction from Biomedical Literature , 2010, NAACL.

[47]  Pedro M. Domingos,et al.  Learning Markov logic network structure via hypergraph lifting , 2009, ICML '09.

[48]  Rauf Izmailov,et al.  Learning using privileged information: similarity control and knowledge transfer , 2015, J. Mach. Learn. Res..

[49]  Kristian Kersting,et al.  Learning from Imbalanced Data in Relational Domains: A Soft Margin Approach , 2014, 2014 IEEE International Conference on Data Mining.

[50]  Alex M. Andrew,et al.  Boosting: Foundations and Algorithms , 2012 .

[51]  Avi Pfeffer,et al.  Learning Probabilities for Noisy First-Order Rules , 1997, IJCAI.

[52]  Ben Taskar,et al.  Learning Probabilistic Models of Relational Structure , 2001, ICML.

[53]  Vladimir Vapnik,et al.  A new learning paradigm: Learning using privileged information , 2009, Neural Networks.