Classification of Dreams Using Machine Learning

We describe a project undertaken by an interdisciplinary team of researchers in sleep and in and machine learning. The goal is sentiment extraction from a corpus containing short textual descriptions of dreams. Dreams are categorized in a four-level scale of affections. The approach is based on a novel representation, taking into account the leading themes of the dream and the sequential unfolding of associated affective feelings during the dream. The dream representation is based on three combined parts, two of which are automatically produced from the description of the dream. The first part consists of co-occurrence vectors, which ---unlike the standard Bag-of-words model ---capture non-local relationships between meanings of word in a corpus. The second part introduces the dynamic representation that captures the change in affections throughout the progress of the dream. The third part is the self-reported assessment of the dream by the dreamer according to eight given attributes. The three representations are subject to aggressive feature selection. Using an ensemble of classifiers and the combined 3-partite representation, we have achieved 64% accuracy, which is in the range of human experts' consensus in that domain.

[1]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[2]  Stan Matwin,et al.  A new algorithm for reducing the workload of experts in performing systematic reviews , 2010, J. Am. Medical Informatics Assoc..

[3]  Prem Melville,et al.  Sentiment analysis of blogs by combining lexical knowledge with text classification , 2009, KDD.

[4]  Claire Cardie,et al.  Learning with Compositional Semantics as Structural Inference for Subsentential Sentiment Analysis , 2008, EMNLP.

[5]  Ian Witten,et al.  Data Mining , 2000 .

[6]  Ernest Hartmann,et al.  Dreams And Nightmares: The New Theory on the Origin and Meaning of Dreams , 1998 .

[7]  James W. Pennebaker,et al.  Linguistic Inquiry and Word Count (LIWC2007) , 2007 .

[8]  Philip J. Stone,et al.  The general inquirer: A computer system for content analysis and retrieval based on the sentence as a unit of information , 2007 .

[9]  T. Nielsen,et al.  What are the memory sources of dreaming? , 2005, Nature.

[10]  J. De Koninck,et al.  Stress and Coping in the Waking and Dreaming States During an Examination Period , 2002 .

[11]  Ted Pedersen,et al.  Knowledge Lean Word-Sense Disambiguation , 1997, AAAI/IAAI.

[12]  Peter D. Turney Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews , 2002, ACL.

[13]  Stan Matwin,et al.  Offensive Language Detection Using Multi-level Classification , 2010, Canadian Conference on AI.

[14]  Hinrich Schütze,et al.  Automatic Word Sense Discrimination , 1998, Comput. Linguistics.

[15]  G. William Domhoff,et al.  The scientific study of dreams : neural networks, cognitive development, and content analysis , 2003 .

[16]  Ted Pedersen,et al.  Unsupervised Discrimination of Person Names in Web Contexts , 2009, CICLing.

[17]  Peter D. Turney,et al.  Automatic Dream Sentiment Analysis , 2006 .

[18]  Pierre Mercier,et al.  Emotions in the Diary and REM Dreams of Young and Late Adulthood Women and Their Relation to Life Satisfaction. , 2005 .