Automated feature extraction from social media for systematic lead user identification

ABSTRACT Manufacturers strive to rapidly develop novel products and offer solutions that meet the emerging customer needs. The Lead User Method, emerging from studies on sources of innovation by the scientific community, offers a validated approach to identify users with innovation ideas to support rapid and successful new product development process. The approach has been more recently applied on online communities, where collection and analysis of rich user data are performed by expert practitioners. In this paper, feature extraction techniques are outlined, that enable automated classification and identification of lead users that are present in online communities. The authors describe two case studies to construct a classification model that is then used to identify online lead users for confectionery products, and to evaluate the outlined feature extraction techniques. The presented research points to opportunities in automated identification within the lead user approach that further reduce the resource and time costs.

[1]  Cornelius Herstatt,et al.  The Lead User Method: An Outline of Empirical Findings and Issues for Future Research , 2004 .

[2]  John C. Platt,et al.  Fast training of support vector machines using sequential minimal optimization, advances in kernel methods , 1999 .

[3]  Reinhard Prügl,et al.  Extending Lead User Theory: Antecedents and Consequences of Consumers' Lead Userness , 2007 .

[4]  Lars Frederiksen,et al.  Why Do Users Contribute to Firm-Hosted User Communities? The Case of Computer-Controlled Music Instruments , 2006, Organ. Sci..

[5]  J. Füller,et al.  Innovation creation by online basketball communities , 2007 .

[6]  Sybille V. Reichart Kundenorientierung im Innovationsprozess , 2002 .

[7]  Christopher Lettl,et al.  The Social Network Position of Lead Users , 2016 .

[8]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[9]  Kerstin Denecke,et al.  Using SentiWordNet for multilingual sentiment analysis , 2008, 2008 IEEE 24th International Conference on Data Engineering Workshop.

[10]  Sonali K. Shah,et al.  How Communities Support Innovative Activities: An Exploration of Assistance and Sharing Among End-Users , 2003 .

[11]  Johann Füller,et al.  Community based innovation: How to integrate members of virtual communities into new product development , 2006, Electron. Commer. Res..

[12]  Daniel A. Levinthal,et al.  Strategy making in novel and complex worlds: the power of analogy , 2005 .

[13]  Gert Sabidussi,et al.  The centrality index of a graph , 1966 .

[14]  J. Anthonisse The rush in a directed graph , 1971 .

[15]  S. Kotha Mass Customization: The New Frontier in Business Competition , 1992 .

[16]  Erik L. Olson,et al.  Implementing the lead user method in a high technology firm: A longitudinal study of intentions versus actions , 2001 .

[17]  U. Brandes A faster algorithm for betweenness centrality , 2001 .

[18]  Frank-Martin Belz,et al.  Netnography as a Method of Lead User Identification , 2010 .

[19]  Robert Tibshirani,et al.  Classification by Pairwise Coupling , 1997, NIPS.

[20]  F. Piller,et al.  Leading Edge Users and Latent Consumer Needs in Electromobility: Findings from a Nethnographic Study of User Innovation in High-Tech Online Communities , 2014, SSRN Electronic Journal.

[21]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[22]  Andrea Esuli,et al.  SENTIWORDNET: A Publicly Available Lexical Resource for Opinion Mining , 2006, LREC.

[23]  Robert V. Kozinets,et al.  Click to Connect: Netnography and Tribal Advertising , 2006, Journal of Advertising Research.

[24]  Rok Sosic,et al.  SNAP , 2016, ACM Trans. Intell. Syst. Technol..

[25]  Karen Spärck Jones A statistical interpretation of term specificity and its application in retrieval , 2021, J. Documentation.

[26]  M. Rocío Martínez-Torres,et al.  Analysis of open innovation communities from the perspective of social network analysis , 2014, Technol. Anal. Strateg. Manag..

[27]  R. Kozinets On Netnography: Initial Reflections on Consumer Research Investigations of Cyberculture , 1998 .

[28]  Volker Bilgram,et al.  User-Centric Innovations in New Product Development - Systematic Identification of Lead Users Harnes , 2008 .

[29]  Igor Kononenko,et al.  Estimating Attributes: Analysis and Extensions of RELIEF , 1994, ECML.

[30]  Karim Lakhani,et al.  Broadcast Search in Problem Solving: Attracting Solutions From the Periphery , 2006, 2006 Technology Management for the Global Future - PICMET 2006 Conference.

[31]  Alex Bavelas,et al.  Communication Patterns in Task‐Oriented Groups , 1950 .

[32]  Din J. Wasem,et al.  Mining of Massive Datasets , 2014 .

[33]  Andrea Hemetsberger,et al.  Fostering Cooperation on the Internet: Social Exchange Processes in Innovative Virtual Consumer Communities , 2002 .

[34]  Larry A. Rendell,et al.  A Practical Approach to Feature Selection , 1992, ML.

[35]  Joost Duflou,et al.  SYSTEMATIC ONLINE LEAD USER IDENTIFICATION - CASE STUDY FOR ELECTRICAL INSTALLATIONS , 2015 .

[36]  Jon W Beard,et al.  The Management of Technological Innovation: An International and Strategic Approach , 2002 .

[37]  Leonard M. Freeman,et al.  A set of measures of centrality based upon betweenness , 1977 .

[38]  Marko Robnik-Sikonja,et al.  An adaptation of Relief for attribute estimation in regression , 1997, ICML.

[39]  S. Sudman Efficient Screening Methods for the Sampling of Geographically Clustered Special Populations , 1985 .

[40]  V. Latora,et al.  A measure of centrality based on network efficiency , 2004, cond-mat/0402050.

[41]  Karim Lakhani,et al.  Student Paper Award Winner; "Broadcast Search in Problem Solving: Attracting Solutions From the Periphery" , 2006, 2006 Technology Management for the Global Future - PICMET 2006 Conference.

[42]  Christopher Lettl,et al.  A Social Network Perspective of Lead Users and Creativity: An Empirical Study among Children , 2008 .

[43]  Rod Coombs,et al.  The Management of Technological Innovation , 2002 .

[44]  S. Sathiya Keerthi,et al.  Improvements to Platt's SMO Algorithm for SVM Classifier Design , 2001, Neural Computation.

[45]  Joost Duflou,et al.  Lead User Identification through Twitter: Case Study for Camera Lens Products , 2014 .

[46]  R. Kozinets E-tribalized Marketing?: The Strategic Implications of Virtual Communities of Consumption , 1999 .

[47]  E. von Hippel,et al.  Sources of Innovation , 2016 .

[48]  J. Fleiss Measuring nominal scale agreement among many raters. , 1971 .

[49]  John Scott Social Network Analysis , 1988 .

[50]  F. J. Arenas-Márqueza,et al.  Electronic word-of-mouth communities from the perspective of social network analysis , 2014 .

[51]  Yicheng Zhang,et al.  Identifying influential nodes in complex networks , 2012 .

[52]  Mark Newman,et al.  Networks: An Introduction , 2010 .

[53]  R. Weisberg,et al.  Following the wrong footsteps: fixation effects of pictorial examples in a design problem-solving task. , 2005, Journal of experimental psychology. Learning, memory, and cognition.

[54]  T. J. Allen,et al.  Positive and Negative Biasing Sets: The Effects of Prior Experience on Research Performance , 2015 .

[55]  Sonali Shah Sources and Patterns of Innovation in a Consumer Products Field: Innovations in Sporting Equipment , 2000 .