Which Brand Purchasers Are Lost to Counterfeiters? An Application of New Data Fusion Approaches

Firms and organizations often need to collect and analyze sensitive consumer data. A common problem encountered in such evidence-based research is that they cannot collect all essential information from one sample, and they may need to link nonoverlapping data items across independent samples. We propose an automated nonparametric data fusion solution to this problem. The proposed methods are not restricted to specific types of variables and distributions. They require no prior knowledge about how data at hand may behave differently from standard theoretical distributions, and they automate the process of generating suitable distributions that match data, therefore making our methods particularly useful for linking data with complex distributional shapes. In addition, these methods have strong theoretical support; permit highly efficient direct fusion to relate a mixture of continuous, semicontinuous, and discrete variables; and enable nonparametric identification of entire distributions of fusion variables, including higher moments and tail percentiles. These novel and promising features overcome important limitations of existing methods and have the potential to increase fusion effectiveness. We apply the proposed methods to overcome data constraints in a study of counterfeiting. By combining data sets from multiple sources, data fusion provides a feasible approach to studying the relationship between counterfeit purchases and various marketing elements, such as consumers' purchase motivations, behaviors, and attitudes; brand marketing channels; promotions; and advertisements. Therefore, data fusion sheds light on counterfeit purchase behaviors and suggests ways to counter counterfeits that would not be available if these data sets were analyzed separately.

[1]  S. Duane,et al.  Hybrid Monte Carlo , 1987 .

[2]  Susanne Rässler,et al.  Statistical Matching: "A Frequentist Theory, Practical Applications, And Alternative Bayesian Approaches" , 2002 .

[3]  Yi Qian,et al.  No Customer Left Behind: A Distribution-Free Bayesian Approach to Accounting for Missing Xs in Marketing Models , 2011, Mark. Sci..

[4]  Carl F. Mela,et al.  Choice Models and Customer Relationship Management , 2005 .

[5]  Donald B. Rubin,et al.  Characterizing the Estimation of Parameters in Incomplete-Data Problems , 1974 .

[6]  Yi Qian,et al.  Impacts of Entry by Counterfeiters , 2008 .

[7]  Yi Qian,et al.  Counterfeiters: Foes or Friends? How Counterfeits Affect Sales by Product Quality Tier , 2014, Manag. Sci..

[8]  Maurizio Dapor Monte Carlo Strategies , 2020, Transport of Energetic Electrons in Solids.

[9]  D. Rubin,et al.  Multiple Imputation for Nonresponse in Surveys , 1989 .

[10]  John K Kruschke,et al.  Bayesian data analysis. , 2010, Wiley interdisciplinary reviews. Cognitive science.

[11]  H. Y. Chen Nonparametric and Semiparametric Models for Missing Covariates in Parametric Regression , 2004 .

[12]  Tim Hesterberg,et al.  Monte Carlo Strategies in Scientific Computing , 2002, Technometrics.

[13]  Jerry Nedelman,et al.  Book review: “Bayesian Data Analysis,” Second Edition by A. Gelman, J.B. Carlin, H.S. Stern, and D.B. Rubin Chapman & Hall/CRC, 2004 , 2005, Comput. Stat..

[14]  Robert C. Blattberg,et al.  Database Marketing: Analyzing and Managing Customers , 2008 .

[15]  Yi Qian,et al.  Counterfeiters: Foes or Friends , 2010 .

[16]  R. Winer A Framework for Customer Relationship Management , 2001 .

[17]  Peter E. Rossi,et al.  A Direct Approach to Data Fusion , 2004 .

[18]  J. M. Sanz-Serna,et al.  Optimal tuning of the hybrid Monte Carlo algorithm , 2010, 1001.4460.

[19]  Statistical Matching , 2004 .

[20]  Michel Wedel,et al.  Factor Analysis and Missing Data , 2000 .

[21]  Fred M. Feinberg,et al.  Reality Check: Combining Choice Experiments with Market Data to Estimate the Importance of Product Attributes , 2010, Manag. Sci..

[22]  Eija Pouta,et al.  Attitude and belief questions as a source of context effect in a contingent valuation survey , 2004 .

[23]  Wagner A. Kamakura,et al.  Statistical Data Fusion for Cross-Tabulation , 1997 .