Extraction of High-Level Semantically Rich Features from Natural Language Text

To represent a text, Natural Language Processing applications are to determine and extract from the text features that are essential for the particular task. Although high-level features seem to be promising for many tasks, they were rarely addressed, since the extraction of those features is a big challenge. This thesis aims at extracting high-level semantically rich features from natural language text. The algorithms we will propose will enable development of novel applications in different areas.

[1]  Jörg Kindermann,et al.  Authorship Attribution with Support Vector Machines , 2003, Applied Intelligence.

[2]  Peter D. Turney Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews , 2002, ACL.

[3]  Fuchun Peng,et al.  N-GRAM-BASED AUTHOR PROFILES FOR AUTHORSHIP ATTRIBUTION , 2003 .

[4]  Carlo Strapparava,et al.  SemEval-2007 Task 14: Affective Text , 2007, Fourth International Workshop on Semantic Evaluations (SemEval-2007).

[5]  Dan Fass,et al.  met*: A Method for Discriminating Metonymy and Metaphor by Computer , 1991, CL.

[6]  Anna Korhonen,et al.  Metaphor Identification Using Verb and Noun Clustering , 2010, COLING.

[7]  P. Niedenthal,et al.  Psychology of emotion: Interpersonal, experiential, and cognitive approaches. , 2006 .

[8]  Shlomo Argamon,et al.  Automatically profiling the author of an anonymous text , 2009, CACM.

[9]  Sudeshna Sarkar,et al.  Stylometric Analysis of Bloggers' Age and Gender , 2009, ICWSM.

[10]  Daria Bogdanova,et al.  A Framework for Figurative Language Detection Based on Sense Differentiation , 2010, ACL.

[11]  Paolo Rosso,et al.  Personal Sense and Idiolect: Combining Authorship Attribution and Opinion Analysis , 2010, LREC.

[12]  John Bryant,et al.  Catching Metaphors , 2006 .

[13]  D. Holmes,et al.  The Federalist Revisited: New Directions in Authorship Attribution , 1995 .

[14]  E. Stamatatos Ensemble-based Author Identification Using Character N-grams , 2006 .

[15]  R. Dunn,et al.  Survey of research on learning styles , 1989 .

[16]  George M. Mohay,et al.  Mining e-mail content for author identification forensics , 2001, SGMD.

[17]  Anoop Sarkar,et al.  A Clustering Approach for Nearly Unsupervised Recognition of Nonliteral Language , 2006, EACL.

[18]  Shlomo Argamon,et al.  Effects of Age and Gender on Blogging , 2006, AAAI Spring Symposium: Computational Approaches to Analyzing Weblogs.

[19]  S. Straja,et al.  Student learning preferences and teaching implications. , 2004, Journal of dental education.

[20]  Bo Pang,et al.  A Sentimental Education: Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts , 2004, ACL.

[21]  Shlomo Argamon,et al.  Automatically Categorizing Written Texts by Author Gender , 2002, Lit. Linguistic Comput..

[22]  R. Snyder The Relationship between Learning Styles/ Multiple Intelligences and Academic Achievement of High School Students , 1999 .

[23]  O. John The "Big Five" factor taxonomy: Dimensions of personality in the natural language and in questionnaires. , 1990 .

[24]  Hugo Liu,et al.  Of Men, Women, and Computers: Data-driven Gender Modeling for Improved User Interfaces , 2022 .

[25]  Andrea Esuli,et al.  SentiWordNet 3.0: An Enhanced Lexical Resource for Sentiment Analysis and Opinion Mining , 2010, LREC.

[26]  George A. Vouros,et al.  Sentiment Analysis of Figurative Language using a Word Sense Disambiguation Approach , 2009, RANLP.

[27]  Janyce Wiebe,et al.  Recognizing Contextual Polarity in Phrase-Level Sentiment Analysis , 2005, HLT.