Semantic interpretation of noun compounds using verbal and other paraphrases

We study the problem of semantic interpretation of noun compounds such as bee honey, malaria mosquito, apple cake, and stem cell. In particular, we explore the potential of using predicates that make explicit the hidden relation that holds between the nouns that form the noun compound. For example, mosquito that carries malaria is a paraphrase of the compound malaria mosquito in which the verb explicitly states the semantic relation between the two nouns. We study the utility of using such paraphrasing verbs, with associated weights, to build a representation of the semantics of a noun compound, for example, malaria mosquito can be represented as follows: carry (23), spread (16), cause (12), transmit (9), and so on. We also explore the potential of using multiple paraphrasing verbs as features for predicting abstract semantic relations such as CAUSE, and we demonstrate that using explicit paraphrases can help improve statistical machine translation.

[1]  Preslav Nakov,et al.  UCB System Description for the WMT 2007 Shared Task , 2007, WMT@ACL.

[2]  Barbara Rosario,et al.  The Descent of Hierarchy, and Selection in Relational Semantics , 2002, ACL.

[3]  Dan I. Moldovan,et al.  A Semantic Approach to Recognizing Textual Entailment , 2005, HLT.

[4]  Judith N. Levi,et al.  The syntax and semantics of complex nominals , 1978 .

[5]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[6]  Preslav Nakov The Web as an Implicit Training Set: Application to Noun Compounds Syntax and Semantics , 2015, ACL 2015.

[7]  John B. Lowe,et al.  The Berkeley FrameNet Project , 1998, ACL.

[8]  Hiyan Alshawi,et al.  Training and Scaling Preference Functions for Disambiguation , 1994, Comput. Linguistics.

[9]  Preslav Nakov,et al.  SemEval-2007 Task 04: Classification of Semantic Relations between Nominals , 2007, Fourth International Workshop on Semantic Evaluations (SemEval-2007).

[10]  Miriam R. L. Petruck FRAME SEMANTICS , 1996 .

[11]  Diarmuid Ó Séaghdha,et al.  Using Lexical and Relational Similarity to Classify Semantic Relations , 2009, EACL.

[12]  Diarmuid Ó Séaghdha Semantic Classification with WordNet Kernels , 2009, HLT-NAACL.

[13]  J. Program Chair-Tsujii Proceedings of the 16th conference on Computational linguistics - Volume 2 , 1996 .

[14]  Timothy Baldwin,et al.  Automatic Interpretation of Noun Compounds Using WordNet Similarity , 2005, IJCNLP.

[15]  Maria Lapata,et al.  The Disambiguation of Nominalizations , 2002, CL.

[16]  Jimmy J. Lin,et al.  A Paraphrase-Based Approach to Machine Translation Evaluation , 2005 .

[17]  Philipp Koehn,et al.  Improved Statistical Machine Translation Using Paraphrases , 2006, NAACL.

[18]  Eduard H. Hovy,et al.  A Taxonomy, Dataset, and Classifier for Automatic Noun Compound Interpretation , 2010, ACL.

[19]  Peter D. Turney Similarity of Semantic Relations , 2006, CL.

[20]  Beatrice Warren,et al.  Semantic patterns of noun-noun compounds , 1978 .

[21]  J. Fodor,et al.  The structure of a semantic theory , 1963 .

[22]  Alon Lavie,et al.  The Meteor metric for automatic evaluation of machine translation , 2009, Machine Translation.

[23]  Preslav Nakov,et al.  Large-Scale Noun Compound Interpretation Using Bootstrapping and the Web as a Corpus , 2011, EMNLP.

[24]  Preslav Nakov,et al.  SemEval-2 Task 9: The Interpretation of Noun Compounds Using Paraphrasing Verbs and Prepositions , 2009, HLT-NAACL 2009.

[25]  Marti A. Hearst Automatic Acquisition of Hyponyms from Large Text Corpora , 1992, COLING.

[26]  Ralph Grishman,et al.  Generalizing Automatically Generated Selectional Patterns , 1994, COLING.

[27]  Hermann Ney,et al.  A Systematic Comparison of Various Statistical Alignment Models , 2003, CL.

[28]  Preslav Nakov,et al.  Classification of semantic relations between nominals , 2009, Lang. Resour. Evaluation.

[29]  Preslav Nakov,et al.  Improving English-Spanish Statistical Machine Translation: Experiments in Domain Adaptation, Sentence Paraphrasing, Tokenization, and Recasing , 2008, WMT@ACL.

[30]  ChengXiang Zhai,et al.  Fast Statistical Parsing of Noun Phrases for Document Indexing , 1997, ANLP.

[31]  Preslav Nakov,et al.  Improved Statistical Machine Translation Using Monolingual Paraphrases , 2008, ECAI.

[32]  Timothy Baldwin,et al.  Noun-Noun Compound Machine Translation A Feasibility Study on Shallow Processing , 2003, Proceedings of the ACL 2003 workshop on Multiword expressions analysis, acquisition and treatment -.

[33]  Preslav Nakov,et al.  Using Verbs to Characterize Noun-Noun Relations , 2006, AIMSA.

[34]  Dan Klein,et al.  Accurate Unlexicalized Parsing , 2003, ACL.

[35]  Preslav Nakov Noun Compound Interpretation Using Paraphrasing Verbs: Feasibility Study , 2008, AIMSA.

[36]  Ann Copestake,et al.  Co-occurrence Contexts for Noun Compound Interpretation , 2007 .

[37]  Satoshi Sekine,et al.  Automatic paraphrase acquisition from news articles , 2002 .

[38]  Regina Barzilay,et al.  Paraphrasing for Automatic Evaluation , 2006, NAACL.

[39]  Timothy Baldwin,et al.  Interpreting Semantic Relations in Noun Compounds via Verb Semantics , 2006, ACL.

[40]  Ying Zhang,et al.  Measuring confidence intervals for the machine translation evaluation metrics , 2004, TMI.

[41]  Diarmuid Ó Séaghdha Designing and Evaluating a Semantic Annotation Scheme for Compound Nouns , 2007 .

[42]  Dan I. Moldovan,et al.  On the semantics of noun compounds , 2005, Comput. Speech Lang..

[43]  David G. Hays Proceedings of the 8th conference on Computational linguistics , 1980 .

[44]  Liang Zhou,et al.  Re-evaluating Machine Translation Results with Paraphrase Support , 2006, EMNLP.

[45]  Gregory Grefenstette,et al.  The World Wide Web as a Resource for Example-Based Machine Translation Tasks , 1999, TC.

[46]  Martin F. Porter,et al.  An algorithm for suffix stripping , 1997, Program.

[47]  Pamela A. Downing On the Creation and Use of English Compound Nouns. , 1977 .

[48]  Christopher D. Manning,et al.  Enriching the Knowledge Sources Used in a Maximum Entropy Part-of-Speech Tagger , 2000, EMNLP.

[49]  Tony Veale,et al.  A Concept-Centered Approach to Noun-Compound Interpretation , 2008, COLING.

[50]  Preslav Nakov,et al.  Solving Relational Similarity Problems Using the Web as a Corpus , 2008, ACL.

[51]  Emmon W. Bach,et al.  Universals in Linguistic Theory , 1970 .

[52]  Marti A. Hearst,et al.  Citances: Citation Sentences for Semantic Analysis of Bioscience Text , 2004 .

[53]  PietraVincent J. Della,et al.  The mathematics of statistical machine translation , 1993 .

[54]  Preslav Nakov,et al.  SemEval-2013 Task 4: Free Paraphrases of Noun Compounds , 2013, SemEval@NAACL-HLT.

[55]  A. Wierzbicka,et al.  Semantics and cognition. , 2006, Wiley interdisciplinary reviews. Cognitive science.

[56]  Lucy Vanderwende,et al.  Algorithm for Automatic Interpretation of Noun Sequences , 1994, COLING.

[57]  Dekang Lin,et al.  An Information-Theoretic Definition of Similarity , 1998, ICML.

[58]  Graeme Hirst,et al.  Evaluating WordNet-based Measures of Lexical Semantic Relatedness , 2006, CL.

[59]  Robert L. Mercer,et al.  The Mathematics of Statistical Machine Translation: Parameter Estimation , 1993, CL.

[60]  Dan Moldovan,et al.  Models for the Semantic Classification of Noun Phrases , 2004, HLT-NAACL 2004.

[61]  Josef Ruppenhofer,et al.  FrameNet's Frames vs. Levin's Verb Classes , 2002 .

[62]  Preslav Nakov,et al.  SemEval-2010 Task 8: Multi-Way Classification of Semantic Relations Between Pairs of Nominals , 2009, SEW@NAACL-HLT.

[63]  Preslav Nakov,et al.  SemEval-2010 Task 9: The Interpretation of Noun Compounds Using Paraphrasing Verbs and Prepositions , 2010, SemEval@ACL.

[64]  Philipp Koehn,et al.  Moses: Open Source Toolkit for Statistical Machine Translation , 2007, ACL.

[65]  Stan Szpakowicz,et al.  Semi-Automatic Recognition of Noun Modifier Relationships , 1998, ACL.

[66]  M. F. Porter,et al.  An algorithm for suffix stripping , 1997 .

[67]  Patrick Pantel,et al.  Discovery of inference rules for question-answering , 2001, Natural Language Engineering.

[68]  Timothy W. Finin,et al.  The semantic interpretation of compound nominals , 1980 .

[69]  Chris Callison-Burch,et al.  Paraphrasing with Bilingual Parallel Corpora , 2005, ACL.

[70]  A HearstMarti,et al.  Semantic interpretation of noun compounds using verbal and other paraphrases , 2013 .

[71]  Oren Etzioni,et al.  Relational Web Search , 2006 .

[72]  Michael L. Littman,et al.  Corpus-based Learning of Analogies and Semantic Relations , 2005, Machine Learning.

[73]  Franz Josef Och,et al.  Minimum Error Rate Training in Statistical Machine Translation , 2003, ACL.

[74]  Gerda Ruge,et al.  Experiments on Linguistically-Based Term Associations , 1992, Inf. Process. Manag..

[75]  Matthew G. Snover,et al.  A Study of Translation Edit Rate with Targeted Human Annotation , 2006, AMTA.

[76]  E. H. Hutten SEMANTICS , 1953, The British Journal for the Philosophy of Science.

[77]  Timothy Baldwin,et al.  Translation by Machine of Complex Nominals: Getting it Right , 2004 .

[78]  Mark Lauer,et al.  Designing Statistical Language Learners: Experiments on Noun Compounds , 1996, ArXiv.

[79]  Peter Norvig,et al.  The Unreasonable Effectiveness of Data , 2009, IEEE Intelligent Systems.

[80]  Ralph Weischedel,et al.  A STUDY OF TRANSLATION ERROR RATE WITH TARGETED HUMAN ANNOTATION , 2005 .

[81]  Preslav Nakov,et al.  UCB: System Description for SemEval Task #4 , 2007, SemEval@ACL.

[82]  Preslav Nakov,et al.  On the interpretation of noun compounds: Syntax, semantics, and entailment , 2013, Natural Language Engineering.

[83]  Ray Jackendoff Semantics and Cognition , 1983 .

[84]  Preslav Nakov,et al.  Combining Relational and Attributional Similarity for Semantic Relation Classification , 2011, RANLP.

[85]  Suzanne Stevenson,et al.  Automatically Identifying the Source Words of Lexical Blends in English , 2010, CL.

[86]  Fintan Costello,et al.  Modeling the Interpretation and Interpretation Ease of Noun-Noun Compounds Using a Relation Space Approach to Compound Meaning , 2006 .

[87]  Salim Roukos,et al.  Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.