Automatic acquisition for sensibility knowledge using co-occurrence relation

When companies obtain customer opinions and requests from free-styled writings, sensibility expressions are important because they include personal subjectivity such as claims 'Too much dust'. Sensibility knowledge registered in the expressions is difficult to build manually. In order to reduce personal burden for building sensibility knowledge, this paper presents a method to acquire sensibility expressions automatically by using co-occurrence relation. According to experimental results of co-occurrence knowledge registered 1,300,000 terms, the rate of correct answers included in acquired sensibility expressions is 73.5%.

[1]  Ido Dagan,et al.  Similarity-Based Methods for Word Sense Disambiguation , 1997, ACL.

[2]  Dekang Lin,et al.  Automatic Retrieval and Clustering of Similar Words , 1998, ACL.

[3]  Gregory Grefenstette,et al.  Explorations in automatic thesaurus discovery , 1994 .

[4]  Vasileios Hatzivassiloglou,et al.  Predicting the Semantic Orientation of Adjectives , 1997, ACL.

[5]  Masao Fuketa,et al.  A method of extracting and evaluating good and bad reputations for natural language expressions , 2005, Int. J. Inf. Technol. Decis. Mak..

[6]  Donald Hindle,et al.  Noun Classification From Predicate-Argument Structures , 1990, ACL.

[7]  Dekang Lin,et al.  Using Syntactic Dependency as Local Context to Resolve Word Sense Ambiguity , 1997, ACL.

[8]  David M. Pennock,et al.  Mining the peanut gallery: opinion extraction and semantic classification of product reviews , 2003, WWW '03.

[9]  David E. Millard,et al.  Automatic extraction of knowledge from web documents , 2003 .

[10]  Alexiei Dingli,et al.  Mining Web Sites Using Unsupervised Adaptive Information Extraction , 2003, EACL.

[11]  Ido Dagan,et al.  Contextual Word Similarity and Estimation from Sparse Data , 1993, ACL.

[12]  Masao Fuketa,et al.  Word classification and hierarchy using co-occurrence word information , 2004, Inf. Process. Manag..

[13]  Li Ding,et al.  How the Semantic Web is Being Used: An Analysis of FOAF Documents , 2005, Proceedings of the 38th Annual Hawaii International Conference on System Sciences.

[14]  Ricardo Baeza-Yates,et al.  Information Retrieval: Data Structures and Algorithms , 1992 .

[15]  Masao Fuketa,et al.  Documents similarity measurement using field association terms , 2003, Inf. Process. Manag..

[16]  Peter D. Turney Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews , 2002, ACL.

[17]  George A. Miller,et al.  Introduction to WordNet: An On-line Lexical Database , 1990 .

[18]  Naftali Tishby,et al.  Distributional Clustering of English Words , 1993, ACL.

[19]  Gerda Ruge,et al.  Experiments on Linguistically-Based Term Associations , 1992, Inf. Process. Manag..

[20]  Ralph Grishman,et al.  Machine Learning of Extraction Patterns from Unannotated Corpora: Position Statement , 2000 .

[21]  Volker Steinbiss,et al.  Cooccurrence smoothing for stochastic language modeling , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[22]  Kenneth Ward Church,et al.  Word Association Norms, Mutual Information, and Lexicography , 1989, ACL.

[23]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[24]  Dedre Gentner,et al.  Why Nouns Are Learned before Verbs: Linguistic Relativity Versus Natural Partitioning. Technical Report No. 257. , 1982 .

[25]  Ido Dagan,et al.  Similarity-Based Estimation of Word Cooccurrence Probabilities , 1994, ACL.

[26]  Alexiei Dingli,et al.  Mining web sites using adaptive information extraction , 2003 .