On Generating Dominators of Customer Preferences

Manufacturing decisions on how to design new products have tremendous impact on the profitability of the manufacturer. This problem has recently attracted extensive research interests and motivated highly productive activities in developing the microeconomic framework for data mining and finding skyline objects in high-dimensional data. In this paper, we investigate a basic designing problem: designing products that satisfy the preferences of all customers. We formalize this problem as generating dominators (products) that dominate the preference dataset. The problem is naturally related to the microeconomic framework of data mining and the problem of finding skyline objects. The designing problem can be optimized from either the manufacturer’s perspective or the customer’s perspective. Our framework integrates these two perspectives and achieves optimization in a single effort. We show that this problem is NP-complete and study its computational properties. A deterministic greedy algorithm and a randomized greedy algorithm are developed. Extensive experimental evaluation on both real and simulated datasets demonstrates the effectiveness and efficiency of the proposed algorithms.

[1]  S. Khuller,et al.  Approximation algorithms for partial covering problems , 2001, J. Algorithms.

[2]  Anthony K. H. Tung,et al.  Finding k-dominant skylines in high dimensional space , 2006, SIGMOD Conference.

[3]  Donald Kossmann,et al.  The Skyline operator , 2001, Proceedings 17th International Conference on Data Engineering.

[4]  Werner Kießling,et al.  Foundations of Preferences in Database Systems , 2002, VLDB.

[5]  Anthony K. H. Tung,et al.  Microeconomic analysis using dominant relationship analysis , 2010, Knowledge and Information Systems.

[6]  Anthony K. H. Tung,et al.  Categorical skylines for streaming data , 2008, SIGMOD Conference.

[7]  Raymond Chi-Wing Wong,et al.  Finding top-k profitable products , 2011, 2011 IEEE 27th International Conference on Data Engineering.

[8]  David S. Johnson,et al.  Computers and Intractability: A Guide to the Theory of NP-Completeness , 1978 .

[9]  Jon M. Kleinberg,et al.  A Microeconomic View of Data Mining , 1998, Data Mining and Knowledge Discovery.

[10]  Raymond Chi-Wing Wong,et al.  Creating Competitive Products , 2009, Proc. VLDB Endow..

[11]  Muhammed Miah Most Popular Package Design , 2012 .

[12]  Anthony K. H. Tung,et al.  Understanding the meaning of a shifted sky: a general framework on extending skyline query , 2010, The VLDB Journal.

[13]  Jon M. Kleinberg,et al.  Segmentation problems , 2004, JACM.

[14]  Rong Ge,et al.  A microeconomic data mining problem: customer-oriented catalog segmentation , 2004, KDD.

[15]  Xuemin Lin,et al.  Selecting Stars: The k Most Representative Skyline Operator , 2007, 2007 IEEE 23rd International Conference on Data Engineering.

[16]  Jan Chomicki,et al.  Discovering Relative Importance of Skyline Attributes , 2009, Proc. VLDB Endow..

[17]  Anthony K. H. Tung,et al.  Call to order: a hierarchical browsing approach to eliciting users' preference , 2010, SIGMOD Conference.

[18]  Anthony K. H. Tung,et al.  On domination game analysis for microeconomic data mining , 2009, TKDD.