Classifying online corporate reputation with machine learning: a study in the banking domain

User-generated social media comments can be a useful source of information for understanding online corporate reputation. However, the manual classification of these comments is challenging due to their high volume and unstructured nature. The purpose of this paper is to develop a classification framework and machine learning model to overcome these limitations.,The authors create a multi-dimensional classification framework for the online corporate reputation that includes six main dimensions synthesized from prior literature: quality, reliability, responsibility, successfulness, pleasantness and innovativeness. To evaluate the classification framework’s performance on real data, the authors retrieve 19,991 social media comments about two Finnish banks and use a convolutional neural network (CNN) to classify automatically the comments based on manually annotated training data.,After parameter optimization, the neural network achieves an accuracy between 52.7 and 65.2 percent on real-world data, which is reasonable given the high number of classes. The findings also indicate that prior work has not captured all the facets of online corporate reputation.,For practical purposes, the authors provide a comprehensive classification framework for online corporate reputation, which companies and organizations operating in various domains can use. Moreover, the authors demonstrate that using a limited amount of training data can yield a satisfactory multiclass classifier when using CNN.,This is the first attempt at automatically classifying online corporate reputation using an online-specific classification framework.

[1]  K. Lagus,et al.  Suomi24: muodonantoa aineistolle , 2016 .

[2]  Detlef Schoder,et al.  Listen to Your Customers: Insights into Brand Image Using Online Consumer-Generated Product Reviews , 2015, Int. J. Electron. Commer..

[3]  John M. T. Balmer,et al.  Managing Corporate Image and Corporate Reputation , 1998 .

[4]  José M. Molina López,et al.  Combining Machine Learning Techniques and Natural Language Processing to Infer Emotions Using Spanish Twitter Corpus , 2013, PAAMS.

[5]  Davide Aloini,et al.  Big Data-enabled Customer Relationship Management: A holistic approach , 2018, Inf. Process. Manag..

[6]  Björn W. Schuller,et al.  New Avenues in Opinion Mining and Sentiment Analysis , 2013, IEEE Intelligent Systems.

[7]  Mohand Boughanem,et al.  Using language models to improve opinion detection , 2018, Inf. Process. Manag..

[8]  Paola Barbara Floreddu,et al.  Inside your social media ring: How to optimize online corporate reputation , 2014 .

[9]  Tomas Mikolov,et al.  Bag of Tricks for Efficient Text Classification , 2016, EACL.

[10]  Jason Bell,et al.  Machine Learning: Hands-On for Developers and Technical Professionals , 2014 .

[11]  G. Tellis,et al.  Mining Marketing Meaning from Online Chatter: Strategic Brand Analysis of Big Data Using Latent Dirichlet Allocation , 2014 .

[12]  Bernard J. Jansen,et al.  Twitter power: Tweets as electronic word of mouth , 2009, J. Assoc. Inf. Sci. Technol..

[13]  Bernard J. Jansen,et al.  Business engagement on Twitter: a path analysis , 2011, Electron. Mark..

[14]  Terry Anthony Byrd,et al.  Big data analytics: Understanding its capabilities and potential benefits for healthcare organizations , 2018 .

[15]  G. Puth,et al.  Towards a Conceptual Model of the Relationship between Corporate Trust and Corporate Reputation , 2014 .

[16]  Juyoung Kang,et al.  Analyzing the discriminative attributes of products using text mining focused on cosmetic reviews , 2018, Inf. Process. Manag..

[17]  Elnaz Jahani Heravi,et al.  Guide to Convolutional Neural Networks , 2017 .

[18]  Shintaro Okazaki,et al.  How to mine brand Tweets: Procedural guidelines and pretest , 2014 .

[19]  Sangwon Lee,et al.  Understanding the majority opinion formation process in online environments: An exploratory approach to Facebook , 2018, Inf. Process. Manag..

[20]  Arnold Picot,et al.  Reflections on societal and business model transformation arising from digitization and big data analytics: A research agenda , 2015, J. Strateg. Inf. Syst..

[21]  Stuart Roper,et al.  A Corporate Character Scale to Assess Employee and Customer Views of Organization Reputation , 2004 .

[22]  Jun Zhao,et al.  Recurrent Convolutional Neural Networks for Text Classification , 2015, AAAI.

[23]  Leonard J. Ponzi,et al.  Stakeholder Tracking and Analysis: The RepTrak® System for Measuring Corporate Reputation , 2015 .

[24]  Ronen Feldman,et al.  Techniques and applications for sentiment analysis , 2013, CACM.

[25]  Anselm L. Strauss,et al.  Basics of qualitative research : techniques and procedures for developing grounded theory , 1998 .

[26]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[27]  R. Chun Corporate Reputation: Meaning and Measurement , 2005 .

[28]  Ding Xiao,et al.  Coupled matrix factorization and topic modeling for aspect mining , 2018, Inf. Process. Manag..

[29]  Rui Vinhas da Silva,et al.  Online and Offline Corporate Brand Images: Do They Differ? , 2007 .

[30]  R. Abratt,et al.  Corporate identity, corporate branding and corporate reputations: Reconciliation and integration , 2012 .

[31]  G. Dowling Defining and Measuring Corporate Reputations , 2016 .

[32]  Kristina Heinonen Consumer activity in social media: Managerial approaches to consumers' social media behavior , 2011 .

[33]  Sunil Erevelles,et al.  Big Data consumer analytics and the transformation of marketing , 2016 .

[34]  Andrew N. Smith,et al.  How Does Brand-related User-generated Content Differ across YouTube, Facebook, and Twitter? , 2012 .

[35]  R. S. Zaharna,et al.  Going for the jugular in public diplomacy: How adversarial publics using social media are challenging state legitimacy , 2016 .

[36]  Doug Terry,et al.  Replicated data consistency explained through baseball , 2013, CACM.

[37]  V. Dutot,et al.  Designing a Measurement Scale for E-Reputation , 2015, Corporate Reputation Review.

[38]  C. Fombrun,et al.  The Reputation QuotientSM: A multi-stakeholder measure of corporate reputation , 2000 .

[39]  Mohamed M. Mostafa,et al.  More than words: Social networks' text mining for consumer brand sentiments , 2013, Expert Syst. Appl..

[40]  David W. Versailles,et al.  CSR communications strategies through social media and influence on e-reputation , 2016 .

[41]  T. C. Melewar,et al.  Measuring reputation in global markets—A comparison of reputation measures’ convergent and criterion validities , 2013 .

[42]  Suhang Wang,et al.  Fake News Detection on Social Media: A Data Mining Perspective , 2017, SKDD.

[43]  Erin M. Steffes,et al.  Social ties and online word of mouth , 2009, Internet Res..

[44]  Bernard J. Jansen,et al.  Anatomy of Online Hate: Developing a Taxonomy and Machine Learning Models for Identifying and Classifying Hate in Online News Media , 2018, ICWSM.

[45]  Yung-Ming Li,et al.  Deriving market intelligence from microblogs , 2013, Decis. Support Syst..

[46]  Alberto Costa,et al.  RBFOpt: an open-source library for black-box optimization with costly function evaluations , 2018, Mathematical Programming Computation.

[47]  Tapio Salakoski,et al.  Building the essential resources for Finnish: the Turku Dependency Treebank , 2013, Language Resources and Evaluation.

[48]  William Y. Degbey,et al.  Social Media Espionage — A Strategic Grid , 2015 .

[49]  Jacob Goldenberg,et al.  Mine Your Own Business: Market-Structure Surveillance Through Text Mining , 2012, Mark. Sci..

[50]  Leslie de Chernatony,et al.  Dimensionalising on‐ and offline brands' composite equity , 2004 .

[51]  Imoh Antai,et al.  Which UGC features drive web purchase intent? A spike-and-slab Bayesian Variable Selection Approach , 2016, Internet Res..

[52]  Gianfranco Walsh,et al.  Customer-based corporate reputation of a service firm: scale development and validation , 2007 .

[53]  Ana Isabel Canhoto,et al.  ‘We (don’t) know how you feel’ – a comparative study of automated vs. manual analysis of social media conversations , 2015 .

[54]  J. Aaker,et al.  Dimensions of Brand Personality , 1997 .

[55]  Vicenta Sierra,et al.  How does the Perceived Ethicality of Corporate Services Brands Influence Loyalty and Positive Word-of-Mouth? Analyzing the Roles of Empathy, Affective Commitment, and Perceived Quality , 2018 .

[56]  H. Boateng,et al.  Consumers’ attitude towards social media advertising and their behavioural response: The moderating role of corporate reputation , 2015 .

[57]  Josef Steinberger,et al.  Supervised sentiment analysis in Czech social media , 2014, Inf. Process. Manag..

[58]  Filippo Menczer,et al.  BotOrNot: A System to Evaluate Social Bots , 2016, WWW.

[59]  Natalie Lee-San Pang,et al.  Responding to the haze: information cues and incivility in the online small world , 2014, Inf. Res..

[60]  P. Argenti,et al.  Reputation and the Corporate Brand , 2003 .

[61]  Rosa M. Carro,et al.  Sentiment analysis in Facebook and its application to e-learning , 2014, Comput. Hum. Behav..

[62]  Tom Fawcett,et al.  An introduction to ROC analysis , 2006, Pattern Recognit. Lett..

[63]  H. Russell Bernard,et al.  Analyzing Qualitative Data: Systematic Approaches , 2009 .

[64]  Bernard J. Jansen,et al.  From 2, 772 segments to five personas: Summarizing a diverse online audience by generating culturally adapted personas , 2018, First Monday.

[65]  Ian Ruthven,et al.  The language of information need: Differentiating conscious and formalized information needs , 2019, Inf. Process. Manag..