Semantic Interpretation of Compound Nominalization Using TreeBank and the World Wide Web

The interpretation of nominal compounds is one of the most difficult problems in natural language processing. This paper proposes a new model for the automatic classification of four coarse-grained semantic relations involved in Chinese compound nominalizations. In such a model, for a compound nominalization, its paraphrased syntactic role occurrences (PSRO) in a treebank are exploited to form feature vectors for supervised classifiers. To solve the problem of data sparseness, the World Wide Web is used to discover relational clusters and such clusters are employed to produce smoothed PSRO feature vectors for the compound nominalizations. The experimental results show that such a method is very effective.

[1]  S. Siegel,et al.  Nonparametric Statistics for the Behavioral Sciences , 2022, The SAGE Encyclopedia of Research Design.

[2]  James Arthur Kohl,et al.  HARNESS: Heterogeneous Adaptable Reconfigurable NEtworked SystemS , 1998, Proceedings. The Seventh International Symposium on High Performance Distributed Computing (Cat. No.98TB100244).

[3]  Maria Lapata,et al.  The Automatic Interpretation of Nominalizations , 2000, AAAI/IAAI.

[4]  Jeremy Nicholson,et al.  Statistical interpretation of compound nouns , 2005 .

[5]  Ronald Rosenfeld,et al.  Improving trigram language modeling with the World Wide Web , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[6]  Fernando Gomez,et al.  Semantic Interpretation of Nominalizations , 1996, AAAI/IAAI, Vol. 2.

[7]  Gregory Grefenstette,et al.  Estimation of English and non-English Language Use on the WWW , 2000, RIAO.

[8]  Qiang Dong,et al.  Hownet And The Computation Of Meaning , 2006 .

[9]  Guillaume Mercier,et al.  MPICH/MADIII : a cluster of clusters enabled MPI implementation , 2003, CCGrid 2003. 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid, 2003. Proceedings..

[10]  Peter D. Turney Measuring Semantic Similarity by Latent Relational Analysis , 2005, IJCAI.

[11]  Barbara Rosario,et al.  Classifying the Semantic Relations in Noun Compounds via a Domain-Specific Lexical Hierarchy , 2001, EMNLP.

[12]  Sharon A. Caraballo Automatic construction of a hypernym-labeled noun hierarchy from text , 1999, ACL.

[13]  Dan Moldovan,et al.  Models for the Semantic Classification of Noun Phrases , 2004, HLT-NAACL 2004.

[14]  Rosie Jones,et al.  Automatically Building a Corpus for a Minority Language from the Web , 2000, ACL 2000.

[15]  Timothy W. Finin,et al.  The semantic interpretation of compound nominals , 1980 .

[16]  Philip Resnik,et al.  Semantic Similarity in a Taxonomy: An Information-Based Measure and its Application to Problems of Ambiguity in Natural Language , 1999, J. Artif. Intell. Res..

[17]  Ari Rappoport,et al.  Efficient Unsupervised Discovery of Word Categories Using Symmetric Patterns and High Frequency Words , 2006, ACL.

[18]  P. Resnik Selection and information: a class-based approach to lexical relationships , 1993 .

[19]  Rosemary Leonard,et al.  The Interpretation of English Noun Sequences on the Computer , 1984 .

[20]  Frank Keller,et al.  Using the Web to Obtain Frequencies for Unseen Bigrams , 2003, CL.

[21]  Mirella Lapata,et al.  A comparison of parsing technologies for the biomedical domain , 2005, Natural Language Engineering.

[22]  Adam L. Berger,et al.  A Maximum Entropy Approach to Natural Language Processing , 1996, CL.

[23]  Dhabaleswar K. Panda,et al.  High performance RDMA-based MPI implementation over InfiniBand , 2003, ICS.

[24]  David R. Dowty Thematic proto-roles and argument selection , 1991 .

[25]  Timothy Baldwin,et al.  Interpreting Semantic Relations in Noun Compounds via Verb Semantics , 2006, ACL.

[26]  Avneesh Pant,et al.  Communicating efficiently on cluster based grids with MPICH-VMI , 2004, 2004 IEEE International Conference on Cluster Computing (IEEE Cat. No.04EX935).

[27]  Timothy Baldwin,et al.  Automatic Interpretation of Noun Compounds Using WordNet Similarity , 2005, IJCNLP.

[28]  Donald Loritz,et al.  The analysis of noun sequences using semantic information extracted from on-line dictionaries , 1996 .

[29]  Daniel Jurafsky,et al.  Parsing Arguments of Nominalizations in English and Chinese , 2004, HLT-NAACL.

[30]  Nianwen Xue,et al.  Semantic role labeling of nominalized predicates in Chinese , 2006, NAACL.

[31]  Deborah A. Dahl,et al.  Nominalizations in PUNDIT , 1987, ACL.

[32]  David Blair Mcdonald,et al.  Understanding noun compounds , 1982 .