Towards an NLP-Based Topic Characterization of Social Relations

The unstructured text content of online communication artifacts is a salient source of information about social relationships. We investigate the utility of keywords extracted from the message body as a representation of the relationship's characteristics, which are reflected by the conversation topics to a certain extent. Keyword extraction is performed using standard natural language processing methods. Communication data and human assessments of the extracted keywords are obtained from Facebook users via a custom application. The overall positive quality assessment provides evidence that the keywords indeed convey relevant information about the relationship. This kind of representation may be of value for various computational tasks from the domain of social computing.

[1]  Philipp Koehn,et al.  Europarl: A Parallel Corpus for Statistical Machine Translation , 2005, MTSUMMIT.

[2]  Brendan T. O'Connor,et al.  Part-of-Speech Tagging for Twitter: Annotation, Features, and Experiments , 2010, ACL.

[3]  Grace Hui Yang,et al.  Ontology generation for large email collections , 2008, DG.O.

[4]  Paul P. Maglio,et al.  Expertise identification using email communications , 2003, CIKM '03.

[5]  Jan Hauffa,et al.  Beyond FOAF: Challenges in Characterizing Social Relations , 2011, 2011 IEEE Third Int'l Conference on Privacy, Security, Risk and Trust and 2011 IEEE Third Int'l Conference on Social Computing.

[6]  Gordon W. Paynter,et al.  Human evaluation of Kea, an automatic keyphrasing system , 2001, JCDL '01.

[7]  Ken Barker,et al.  Using Noun Phrase Heads to Extract Document Keyphrases , 2000, Canadian Conference on AI.

[8]  Claire Cardie,et al.  Annotating Expressions of Opinions and Emotions in Language , 2005, Lang. Resour. Evaluation.

[9]  Rada Mihalcea,et al.  TextRank: Bringing Order into Text , 2004, EMNLP.

[10]  Xin Li,et al.  Tag-based social interest discovery , 2008, WWW.

[11]  Vincent Ng,et al.  Conundrums in Unsupervised Keyphrase Extraction: Making Sense of the State-of-the-Art , 2010, COLING.

[12]  L. Troll,et al.  Handbook of Aging and the Family , 1995 .

[13]  Peter D. Turney Learning to Extract Keyphrases from Text , 2002, ArXiv.

[14]  Erin M Bryant,et al.  Relational Maintenance Strategies on Facebook , 2009 .

[15]  Abraham Kandel,et al.  DegExt - A Language-Independent Graph-Based Keyphrase Extractor , 2011, AWIC.

[16]  Hinrich Schütze,et al.  Book Reviews: Foundations of Statistical Natural Language Processing , 1999, CL.

[17]  Timo Honkela,et al.  A Language-Independent Approach to Keyphrase Extraction and Evaluation , 2008, COLING.

[18]  Andrew McCallum,et al.  Topic and Role Discovery in Social Networks with Experiments on Enron and Academic Email , 2007, J. Artif. Intell. Res..

[19]  Peter Mika,et al.  Ontologies are us: A unified model of social networks and semantics , 2005, J. Web Semant..

[20]  T. Antonucci,et al.  Convoys of social relations: Family and friendships within a life span context. , 1995 .

[21]  Peter D. Turney Coherent Keyphrase Extraction via Web Mining , 2003, IJCAI.

[22]  Dan Klein,et al.  Feature-Rich Part-of-Speech Tagging with a Cyclic Dependency Network , 2003, NAACL.

[23]  Mitsuru Ishizuka,et al.  Extracting Keyphrases to Represent Relations in Social Networks from Web , 2007, IJCAI.