A multi-level annotation model for fine-grained opinion detection in German blog comments

Subject of this paper is a fine-grained multilevel annotation model to enhance opinion detection in German blog comments. Up to now, only little research deals with the finegrained analysis of evaluative expressions in German blog comments. Therefore, we suggest a multi-level annotation model where different linguistic means as well as linguistic peculiarities of users’ formulation and evaluation styles in blog comments are considered. The model is intended as a basic scheme for the annotation of evaluative expressions in blog data. This annotation provides suitable features for implementing methods to automatically detect user opinions in blog comments.

[1]  Ashley J. Llorens,et al.  Coarse-and Fine-Grained Sentiment Analysis of Social Media Text , 2011 .

[2]  Helmut Schmid,et al.  Improvements in Part-of-Speech Tagging with an Application to German , 1999 .

[3]  Stefanie Dipper,et al.  XML-based Stand-off Representation and Exploitation of Multi-Level Linguistic Annotation , 2005, Berliner XML Tage.

[4]  Peter Schlobinski,et al.  Sprachliche und textuelle Aspekte in Weblogs. Ein internationales Projekt , 2005 .

[5]  Angelika Storrer,et al.  A TEI Schema for the Representation of Computer-mediated Communication , 2012 .

[6]  Christian Hänig,et al.  Towards Well-Grounded Phrase-Level Polarity Analysis , 2011, CICLing.

[7]  Gerhard Weikum,et al.  Stylistic Analysis Of Text For Information Access , 2005 .

[8]  Claire Cardie,et al.  Annotating Expressions of Opinions and Emotions in Language , 2005, Lang. Resour. Evaluation.

[9]  P. Ekman Facial expression and emotion. , 1993, The American psychologist.

[10]  Stefan Evert,et al.  Is Part-of-Speech Tagging a Solved Task? An Evaluation of POS Taggers for the German Web as Corpus , 2009 .

[11]  G. A. Mishne,et al.  Expiriments with mood classification in blog posts , 2005, SIGIR 2005.

[12]  Dirk Thorleuchter,et al.  Extracting Consumers Needs for New Products - A Web Mining Approach , 2010, 2010 Third International Conference on Knowledge Discovery and Data Mining.

[13]  Angelika Storrer,et al.  Corpora of computer-mediated communication , 2008 .

[14]  Mark Kaplan John Langshaw Austin , 2010 .

[15]  Anke Lüdeling,et al.  Multi-level error annotation in learner corpora , 2005 .

[16]  Brendan T. O'Connor,et al.  Part-of-Speech Tagging for Twitter: Annotation, Features, and Experiments , 2010, ACL.

[17]  Birger Andersson,et al.  Natural Language Processing and Information Systems , 2003, Lecture Notes in Computer Science.

[18]  Claire Cardie,et al.  Annotating Topics of Opinions , 2008, LREC.

[19]  Mohand Boughanem,et al.  Challenges for Sentence Level Opinion Detection in Blogs , 2009, 2009 Eighth IEEE/ACIS International Conference on Computer and Information Science.

[20]  Carlo Strapparava,et al.  Learning to identify emotions in text , 2008, SAC '08.

[21]  Kristin Luckhardt Stilanalysen zur Chat-Kommunikation: eine korpusgestützte Untersuchung am Beispiel eines medialen Chats , 2009 .

[22]  Simon Clematide,et al.  MLSA - A Multi-layered Reference Corpus for German Sentiment Analysis , 2012, LREC.

[23]  Eva-Maria Jakobs,et al.  Talking about mobile communication systems: verbal comments in the web as a source for acceptance research in large-scale technologies , 2010, 2010 IEEE International Professional Comunication Conference.

[24]  Theresa Wilson Fine-grained subjectivity and sentiment analysis: recognizing the intensity, polarity, and attitudes of private states , 2008 .

[25]  Andrés Montoyo,et al.  Semantic Approaches to Fine and Coarse-Grained Feature-Based Opinion Mining , 2009, NLDB.

[26]  Michael Beißwenger,et al.  Empirische Untersuchungen zur Produktion von Chat-Beiträgen , 2010 .

[27]  F. E. R. Pollard-Urquhart San sebastian, spain , 1902 .

[28]  MELANIE NEUNERDT,et al.  Detecting Irregularities in Blog Comment Language Affecting POS Tagging Accuracy , 2012 .

[29]  Vilmos Ágel,et al.  Chattern unter die Finger geschaut: Formulieren und Revidieren bei der schriftlichen Verbalisierung in synchroner internetbasierter Kommunikation , 2010 .

[30]  Gerhard Heyer,et al.  SentiWS - A Publicly Available German-language Resource for Sentiment Analysis , 2010, LREC.