Towards the Automatic Identification of Adjectival Scales: Clustering Adjectives According to Meaning

In this paper we present a method to group adjectives according to their meaning, as a first step towards the automatic identification of adjectival scales. We discuss the properties of adjectival scales and of groups of semantically related adjectives and how they imply sources of linguistic knowledge in text corpora. We describe how our system exploits this linguistic knowledge to compute a measure of similarity between two adjectives, using statistical techniques and without having access to any semantic information about the adjectives. We also show how a clustering algorithm can use these similarities to produce the groups of adjectives, and we present results produced by our system for a sample set of adjectives. We conclude by presenting evaluation methods for the task at hand, and analyzing the significance of the results obtained.

[1]  M. Kendall A NEW MEASURE OF RANK CORRELATION , 1938 .

[2]  Yuen Ren Chao,et al.  Human Behavior and the Principle of Least Effort: An Introduction to Human Ecology , 1950 .

[3]  J. M. Kittross The measurement of meaning , 1959 .

[4]  Donald E. Knuth,et al.  The Art of Computer Programming, Volume I: Fundamental Algorithms, 2nd Edition , 1997 .

[5]  John A. Swets,et al.  Effectiveness of information retrieval methods , 1969 .

[6]  Laurence R. Horn,et al.  On the semantic properties of logical operators in english' reproduced by the indiana university lin , 1972 .

[7]  D. Bolinger Neutrality, norm, and bias , 1977 .

[8]  Judith N. Levi,et al.  The syntax and semantics of complex nominals , 1978 .

[9]  David S. Johnson,et al.  Computers and Intractability: A Guide to the Theory of NP-Completeness , 1978 .

[10]  Reuven Y. Rubinstein,et al.  Simulation and the Monte Carlo method , 1981, Wiley series in probability and mathematical statistics.

[11]  Dirk P. Kroese,et al.  Simulation and the Monte Carlo Method (Wiley Series in Probability and Statistics) , 1981 .

[12]  Fionn Murtagh,et al.  Cluster Dissection and Analysis: Theory, Fortran Programs, Examples. , 1986 .

[13]  Karen Sparck Jones Synonymy and semantic classification , 1986 .

[14]  Donald Hindle,et al.  Acquiring Disambiguation Rules from Text , 1989, ACL.

[15]  George A. Miller,et al.  Introduction to WordNet: An On-line Lexical Database , 1990 .

[16]  Peter J. Rousseeuw,et al.  Finding Groups in Data: An Introduction to Cluster Analysis , 1990 .

[17]  Michael Elhadad,et al.  Generating Adjectives to Express the Speaker's Argumentative Intent , 1991, AAAI.

[18]  Slava M. Katz,et al.  Co-Occurrences of Antonymous Adjectives and Their Contexts , 1991, Comput. Linguistics.

[19]  David D. Lewis,et al.  Text filtering in MUC-3 and MUC-4 , 1992, MUC.

[20]  Robert L. Mercer,et al.  Class-Based n-gram Models of Natural Language , 1992, CL.

[21]  Kevin Knight,et al.  Building a Large Ontology for Machine Translation , 1993, HLT.

[22]  Naftali Tishby,et al.  Distributional Clustering of English Words , 1993, ACL.

[23]  Philip Resnik,et al.  Semantic Classes and Syntactic Ambiguity , 1993, HLT.

[24]  Douglas E. Appelt,et al.  FASTUS: A System for Extracting Information from Text , 1993, HLT.

[25]  Tomoyoshi Matsukawa,et al.  Hypothesizing Word Association From Untagged Text , 1993, HLT.

[26]  Michael Elhadad,et al.  Using argumentation to control lexical choice: a functional unification implementation , 1993 .