Verb Physics: Relative Physical Knowledge of Actions and Objects

Learning commonsense knowledge from natural language text is nontrivial due to reporting bias: people rarely state the obvious, e.g., "My house is bigger than me." However, while rarely stated explicitly, this trivial everyday knowledge does influence the way people talk about the world, which provides indirect clues to reason about the world. For example, a statement like, "Tyler entered his house" implies that his house is bigger than Tyler. In this paper, we present an approach to infer relative physical knowledge of actions and objects along five dimensions (e.g., size, weight, and strength) from unstructured natural language text. We frame knowledge acquisition as joint inference over two closely related problems: learning (1) relative physical knowledge of object pairs and (2) physical implications of actions when applied to those object pairs. Empirical results demonstrate that it is possible to extract knowledge of actions and objects from language and that joint inference over different types of knowledge improves performance.

[1]  Yotaro Watanabe,et al.  Is a 204 cm Man Tall or Small ? Acquisition of Numerical Common Sense from the Web , 2013, ACL.

[2]  David R. Dowty Thematic proto-roles and argument selection , 1991 .

[3]  Li Fei-Fei,et al.  Reasoning about Object Affordances in a Knowledge Base Representation , 2014, ECCV.

[4]  Ari Rappoport,et al.  Extraction and Approximation of Numerical Attributes from the Web , 2010, ACL.

[5]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[6]  Lenhart K. Schubert,et al.  Learning from the Web: Extracting General World Knowledge from Noisy Text , 2010, Collaboratively-Built Knowledge Sources and AI.

[7]  Ali Farhadi,et al.  VisKE: Visual knowledge extraction and question answering by visual verification of relation phrases , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[8]  Roy Schwartz,et al.  How Well Do Distributional Models Capture Different Types of Semantic Knowledge? , 2015, ACL.

[9]  Sheng Zhang,et al.  Ordinal Common-sense Inference , 2016, TACL.

[10]  Siobhan Chapman Logic and Conversation , 2005 .

[11]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[12]  Benjamin Van Durme,et al.  Reporting bias and knowledge acquisition , 2013, AKBC '13.

[13]  Gerhard Weikum,et al.  Acquiring Comparative Commonsense Knowledge from the Web , 2014, AAAI.

[14]  Ross B. Girshick,et al.  Seeing through the Human Reporting Bias: Visual Classifiers from Noisy Human-Centric Labels , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[15]  Joyce Yue Chai,et al.  Incremental Acquisition of Verb Hypothesis Space towards Physical World Interaction , 2016, ACL.

[16]  Greg Carlson,et al.  A unified analysis of the English bare plural , 1977 .

[17]  Yejin Choi,et al.  Segment-Phrase Table for Semantic Segmentation, Visual Entailment and Paraphrasing , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[18]  Xiang Li,et al.  Commonsense Knowledge Base Completion , 2016, ACL.

[19]  Francis Ferraro,et al.  Semantic Proto-Roles , 2015, TACL.

[20]  Beth Levin,et al.  English Verb Classes and Alternations: A Preliminary Investigation , 1993 .

[21]  Erez Lieberman Aiden,et al.  Quantitative Analysis of Culture Using Millions of Digitized Books , 2010, Science.

[22]  Shaohua Yang,et al.  Physical Causality of Action Verbs in Grounded Language Understanding , 2016, ACL.

[23]  Carina Silberer,et al.  Models of Semantic Representation with Visual Attributes , 2013, ACL.

[24]  Lenhart K. Schubert,et al.  Using Textual Patterns to Learn Expected Event Frequencies , 2012, AKBC-WEKEX@NAACL-HLT.

[25]  Yoav Goldberg,et al.  A Dataset of Syntactic-Ngrams over Time from a Very Large Corpus of English Books , 2013, *SEMEVAL.

[26]  Jun'ichi Tsujii,et al.  Estimating Numerical Attributes by Bringing Together Fragmentary Clues , 2015, HLT-NAACL.

[27]  Christopher D. Manning,et al.  Philosophers are Mortal: Inferring the Truth of Unseen Facts , 2013, CoNLL.

[28]  Gerhard Weikum,et al.  Commonsense in Parts: Mining Part-Whole Relations from the Web and Image Tags , 2016, AAAI.

[29]  Martha Palmer,et al.  Class-Based Construction of a Verb Lexicon , 2000, AAAI/IAAI.

[30]  E. Kako Thematic role properties of subjects and objects , 2006, Cognition.

[31]  Gemma Boleda,et al.  Distributional Semantics in Technicolor , 2012, ACL.

[32]  John B. Lowe,et al.  The Berkeley FrameNet Project , 1998, ACL.

[33]  Thomas G. Dietterich,et al.  Inverting Grice's Maxims to Learn Rules from Natural Language Extractions , 2011, NIPS.

[34]  Ali Farhadi,et al.  Are Elephants Bigger than Butterflies? Reasoning about Sizes of Objects , 2016, AAAI.

[35]  C. Fillmore FRAME SEMANTICS AND THE NATURE OF LANGUAGE * , 1976 .

[36]  Christopher D. Manning,et al.  NaturalLI: Natural Logic Inference for Common Sense Reasoning , 2014, EMNLP.

[37]  Daniel Gildea,et al.  The Proposition Bank: An Annotated Corpus of Semantic Roles , 2005, CL.

[38]  Van Durme,et al.  Extracting implicit knowledge from text , 2009 .