Mining Consumer Dialog in Online Forums

Purpose – The paper's aim is to mine and analyze opinion formation on the basis of consumer dialogs in online forums.Design/methodology/approach – The study identifies opinions, communication relationships, and dialog acts of forum users using different text mining methods. Utilizing this data, social networks can be derived and analyzed to detect influential users and opinion tendencies. The approach is applied to sample online forums discussing the iPhone.Findings – Combining text mining and social network analysis enables the study of opinion formation and yields encouraging results. Out of the four methods employed for text mining, support vector machines performed best.Research limitations/implications – The data set applied here is fairly small. More threads on different products will be considered in future work to improve validation.Practical implications – The approach represents a valuable instrument for online market research. It enables companies to recognize opportunities and risks and to ini...

[1]  Paul Mutton,et al.  Inferring and visualizing social networks on Internet relay chat , 2004, Proceedings. Eighth International Conference on Information Visualisation, 2004. IV 2004..

[2]  Bing Liu,et al.  Opinion observer: analyzing and comparing opinions on the Web , 2005, WWW '05.

[3]  Lada A. Adamic,et al.  Tracking information epidemics in blogspace , 2005, The 2005 IEEE/WIC/ACM International Conference on Web Intelligence (WI'05).

[4]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[5]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[6]  J. R. Landis,et al.  The measurement of observer agreement for categorical data. , 1977, Biometrics.

[7]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[8]  Laks V. S. Lakshmanan,et al.  Discovering leaders from community actions , 2008, CIKM '08.

[9]  Philip S. Yu,et al.  Identifying the influential bloggers in a community , 2008, WSDM '08.

[10]  Hsi-Peng Lu,et al.  Understanding intention to continuously share information on weblogs , 2007, Internet Res..

[11]  T. Valente,et al.  Network models of the diffusion of innovations , 1995, Comput. Math. Organ. Theory.

[12]  Peter D. Turney Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews , 2002, ACL.

[13]  Kuo-Ming Chu A study of members' helping behaviors in online community , 2009, Internet Res..

[14]  Mitsuru Ishizuka,et al.  Emerging topic tracking system in WWW , 2006, Knowl. Based Syst..

[15]  H. Schlosberg Three dimensions of emotion. , 1954, Psychological review.

[16]  Edward Ivanovic,et al.  Dialogue Act Tagging for Instant Messaging Chat Sessions , 2005, ACL.

[17]  William M. Pottenger,et al.  Posting Act Tagging Using Transformation-Based Learning , 2005, Foundations of Data Mining and knowledge Discovery.

[18]  Qi He,et al.  TwitterRank: finding topic-sensitive influential twitterers , 2010, WSDM '10.

[19]  Tyng-Ruey Chuang,et al.  Browsing newsgroups with a social network analyzer , 2002, Proceedings Sixth International Conference on Information Visualisation.

[20]  P. Lazarsfeld,et al.  6. Katz, E. Personal Influence: The Part Played by People in the Flow of Mass Communications , 1956 .

[21]  Cai-Nicolas Ziegler,et al.  Tracking Topic Evolution in News Environments , 2008, 2008 10th IEEE Conference on E-Commerce Technology and the Fifth IEEE Conference on Enterprise Computing, E-Commerce and E-Services.

[22]  Wesley Shu,et al.  The Perceived Benefits of 6-Degree-Separation Social Networks , 2011, Internet Res..

[23]  Sigi Goode,et al.  Individualist and collectivist factors affecting online repurchase intentions , 2010, Internet Res..

[24]  Per Linell,et al.  Acts in Discourse: From Monological Speech Acts to Dialogical Inter‐Acts , 1993 .

[25]  Matthew Hurst,et al.  Analyzing online discussion for marketing intelligence , 2005, WWW '05.

[26]  David Knoke,et al.  Social Network Analysis: Methods and Applications. , 1996 .

[27]  Gösta Ekman,et al.  Dimensions of emotion , 1955 .

[28]  J. Bortz,et al.  Verteilungsfreie Methoden in der Biostatistik , 1982 .

[29]  Freimut Bodendorf,et al.  Detecting opinion leaders and trends in online social networks , 2009, CIKM-SWSM.

[30]  Stanley Wasserman,et al.  Social Network Analysis: Methods and Applications , 1994, Structural analysis in the social sciences.

[31]  Craig H. Martell,et al.  Lexical and Discourse Analysis of Online Chat Dialog , 2007, International Conference on Semantic Computing (ICSC 2007).

[32]  Mark A. Przybocki,et al.  The Automatic Content Extraction (ACE) Program – Tasks, Data, and Evaluation , 2004, LREC.

[33]  Yiming Yang,et al.  A re-examination of text categorization methods , 1999, SIGIR '99.

[34]  Vicenç Gómez,et al.  Statistical analysis of the social network and discussion threads in slashdot , 2008, WWW.

[35]  M. Randic Characterization of molecular branching , 1975 .

[36]  Bing Liu,et al.  Mining Opinions in Comparative Sentences , 2008, COLING.

[37]  Ingoo Han,et al.  The Different Effects of Online Consumer Reviews on Consumers' Purchase Intentions Depending on Trust in Online Shopping Mall: An Advertising Perspective , 2011, Internet Res..

[38]  Tong Zhang,et al.  Text Mining: Predictive Methods for Analyzing Unstructured Information , 2004 .

[39]  John Scott Social Network Analysis , 1988 .

[40]  Robert M. Schindler,et al.  Internet forums as influential sources of consumer information , 2001 .

[41]  Tim Oates,et al.  Modeling the Spread of Influence on the Blogosphere , 2006 .

[42]  P. Ekman Facial expression and emotion. , 1993, The American psychologist.

[43]  Janyce Wiebe,et al.  Just How Mad Are You? Finding Strong and Weak Opinion Clauses , 2004, AAAI.

[44]  Pedro M. Domingos,et al.  On the Optimality of the Simple Bayesian Classifier under Zero-One Loss , 1997, Machine Learning.

[45]  David M. Pennock,et al.  Mining the peanut gallery: opinion extraction and semantic classification of product reviews , 2003, WWW '03.

[46]  Csr Young,et al.  How to Do Things With Words , 2009 .

[47]  C. Apte,et al.  Data mining with decision trees and decision rules , 1997, Future Gener. Comput. Syst..

[48]  P. Lang The emotion probe. Studies of motivation and attention. , 1995, The American psychologist.

[49]  Jeonghee Yi,et al.  Sentiment analysis: capturing favorability using natural language processing , 2003, K-CAP '03.

[50]  Hsin-Hsi Chen,et al.  Opinion Extraction, Summarization and Tracking in News and Blog Corpora , 2006, AAAI Spring Symposium: Computational Approaches to Analyzing Weblogs.

[51]  Rosalind W. Picard Affective Computing , 1997 .

[52]  Wesley Shu,et al.  Continual use of microblogs , 2014, Behav. Inf. Technol..

[53]  Harold Garfinkel,et al.  On Formal Structures of Practical Actions , 2005 .

[54]  Stanley Wasserman,et al.  Social Network Analysis: Methods and Applications , 1994 .

[55]  Yubo Chen,et al.  Online Consumer Review: Word-of-Mouth as a New Element of Marketing Communication Mix , 2004, Manag. Sci..

[56]  Danyel Fisher,et al.  Visualizing the Signatures of Social Roles in Online Discussion Groups , 2007, J. Soc. Struct..

[57]  Freimut Bodendorf,et al.  Opinion and Relationship Mining in Online Forums , 2009, 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology.

[58]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[59]  Yun Chi,et al.  Identifying opinion leaders in the blogosphere , 2007, CIKM '07.

[60]  Rui Ma,et al.  OOLAM: an opinion oriented link analysis model for influence persona discovery , 2011, WSDM '11.

[61]  P. Chatterjee,et al.  Online Reviews: Do Consumers Use Them? , 2006 .

[62]  Thomas W. Valente Network models of the diffusion of innovations , 1996, Comput. Math. Organ. Theory.

[63]  E. Rogers Diffusion of Innovations , 1962 .

[64]  P. Lazarsfeld,et al.  Personal Influence: The Part Played by People in the Flow of Mass Communications , 1956 .

[65]  Andreas Stolcke,et al.  Dialogue act modeling for automatic tagging and recognition of conversational speech , 2000, CL.

[66]  Jacob Cohen A Coefficient of Agreement for Nominal Scales , 1960 .

[67]  Erin M. Steffes,et al.  Social ties and online word of mouth , 2009, Internet Res..

[68]  Danyel Fisher,et al.  Picturing Usenet: Mapping Computer-Mediated Collective Action , 2005, J. Comput. Mediat. Commun..