Automatically Identifying Changes in the Semantic Orientation of Words

The meanings of words are not fixed but in fact undergo change, with new word senses arising and established senses taking on new aspects of meaning or falling out of usage. Two types of semantic change are amelioration and pejoration; in these processes a word sense changes to become more positive or negative, respectively. In this first computational study of amelioration and pejoration we adapt a web-based method for determining semantic orientation to the task of identifying ameliorations and pejorations in corpora from differing time periods. We evaluate our proposed method on a small dataset of known historical ameliorations and pejorations, and find it to perform better than a random baseline. Since this test dataset is small, we conduct a further evaluation on artificial examples of amelioration and pejoration, and again find evidence that our proposed method is able to identify changes in semantic orientation. Finally, we conduct a preliminary evaluation in which we apply our methods to the task of finding words which have recently undergone amelioration or pejoration.

[1]  Victor Sadler,et al.  Review of Lexical acquisition: exploiting on-line resources to build a lexicon by Uri Zernik. Lawrence Erlbaum Associates 1991. , 1993 .

[2]  Andrea Esuli,et al.  SENTIWORDNET: A Publicly Available Lexical Resource for Opinion Mining , 2006, LREC.

[3]  Lillian Lee,et al.  Opinion Mining and Sentiment Analysis , 2008, Found. Trends Inf. Retr..

[4]  Rainer Siemund,et al.  The Lampeter Corpus of Early Modern English Tracts , 1997 .

[5]  Hendrik De Smet,et al.  A corpus of late modern English texts , 2005 .

[6]  Michael L. Littman,et al.  Measuring praise and criticism: Inference of semantic orientation from association , 2003, TOIS.

[7]  Vasileios Hatzivassiloglou,et al.  Predicting the Semantic Orientation of Adjectives , 1997, ACL.

[8]  Uri Zernik,et al.  Lexical acquisition: Exploiting on-line resources to build a lexicon. , 1991 .

[9]  David K. Barnhart Prizes and Pitfalls of Computerized Searching for New Words for Dictionaries , 2012 .

[10]  Philip J. Stone,et al.  Extracting Information. (Book Reviews: The General Inquirer. A Computer Approach to Content Analysis) , 1967 .

[11]  H. Kucera,et al.  Computational analysis of present-day American English , 1967 .

[12]  Lyle Campbell,et al.  Historical Linguistics: An Introduction , 1991 .

[13]  Claudia Claridge,et al.  The Lampeter Corpus of Early Modern English Tracts , 2000 .

[14]  Hinrich Schütze,et al.  Automatic Word Sense Discrimination , 1998, Comput. Linguistics.

[15]  Saif Mohammad,et al.  Generating High-Coverage Semantic Orientation Lexicons From Overtly Marked Words and a Thesaurus , 2009, EMNLP.

[16]  J. Simpson,et al.  Neologism: The Long View , 2012 .

[17]  Eyal Sagi,et al.  Semantic Density Analysis: Comparing Word Meaning across Time and Phonetic Space , 2009 .

[18]  Marshall S. Smith,et al.  The general inquirer: A computer approach to content analysis. , 1967 .

[19]  Claire Cardie,et al.  Adapting a Polarity Lexicon using Integer Linear Programming for Domain-Specific Sentiment Classification , 2009, EMNLP.

[20]  Stefan Th. Gries,et al.  Assessing frequency changes in multistage diachronic corpora: Applications for historical corpus linguistics and the study of language acquisition , 2009, Lit. Linguistic Comput..

[21]  Adrian Room,et al.  Dictionary of changes in meaning , 1986 .

[22]  Yorick Wilks,et al.  Making Preferences More Active , 1978, Artif. Intell..

[23]  Laurel J. Brinton,et al.  The English Language: A Linguistic History , 2006 .