Mining knowledge-sharing sites for viral marketing

Viral marketing takes advantage of networks of influence among customers to inexpensively achieve large changes in behavior. Our research seeks to put it on a firmer footing by mining these networks from data, building probabilistic models of them, and using these models to choose the best viral marketing plan. Knowledge-sharing sites, where customers review products and advise each other, are a fertile source for this type of data mining. In this paper we extend our previous techniques, achieving a large reduction in computational cost, and apply them to data from a knowledge-sharing site. We optimize the amount of marketing funds spent on each customer, rather than just making a binary decision on whether to market to him. We take into account the fact that knowledge of the network is partial, and that gathering that knowledge can itself have a cost. Our results show the robustness and utility of our approach.

[1]  George Kingsley Zipf,et al.  Human behavior and the principle of least effort , 1949 .

[2]  J. Davenport Editor , 1960 .

[3]  Ronald A. Howard,et al.  Information Value Theory , 1966, IEEE Trans. Syst. Sci. Cybern..

[4]  Sharon L. Milgram,et al.  The Small World Problem , 1967 .

[5]  Michael F. Schwartz,et al.  Discovering shared interests using graph analysis , 1993, CACM.

[6]  Stanley Wasserman,et al.  Social Network Analysis: Methods and Applications , 1994 .

[7]  Arthur Middleton Hughes The Complete Database Marketer: Second Generation Strategies and Techniques for Tapping the Power of Your Customer Database , 1995 .

[8]  D. Iacobucci Networks in Marketing , 1996 .

[9]  D. Krackhardt Structural Leverage in Marketing , 1996 .

[10]  Bart Selman,et al.  Referral Web: combining social networks and collaborative filtering , 1997, CACM.

[11]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[12]  Charles X. Ling,et al.  Data Mining for Direct Marketing: Problems and Solutions , 1998, KDD.

[13]  M. KleinbergJon Authoritative sources in a hyperlinked environment , 1999 .

[14]  Gregory Piatetsky-Shapiro,et al.  Estimating campaign benefits and modeling lift , 1999, KDD '99.

[15]  Ravi Kumar,et al.  Extracting Large-Scale Knowledge Bases from the Web , 1999, VLDB.

[16]  Piew Datta,et al.  Statistics and data mining techniques for lifetime value modeling , 1999, KDD '99.

[17]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[18]  Peter Stone,et al.  Cobot in LambdaMOO: A Social Statistics Agent , 2000, AAAI/IAAI.

[19]  David Maxwell Chickering,et al.  A Decision Theoretic Approach to Targeted Advertising , 2000, UAI.

[20]  Katja Gelbrich,et al.  Value Miner: A Data Mining Environment for the Calculation of the Customer Lifetime Value with Application to the Automotive Industry , 2000, ECML.

[21]  S. Jurvetson What exactly is viral marketing , 2000 .

[22]  A. Barabasi,et al.  Scale-free characteristics of random networks: the topology of the world-wide web , 2000 .

[23]  Matthew Richardson,et al.  Mining the network value of customers , 2001, KDD '01.

[24]  Pedro M. Domingos,et al.  On the Optimality of the Simple Bayesian Classifier under Zero-One Loss , 1997, Machine Learning.