Generating Realistic Online Auction Data

To combat online auction fraud, researchers have developed fraud detection and prevention methods. However, it is difficult to effectively evaluate these methods using commercial or synthetic auction data. For commercial data, it is not possible to accurately identify cases of fraud. For synthetic auction data, the conclusions drawn may not extend to the real world. The availability of realistic synthetic auction data, which models real auction data, will be invaluable for effective evaluation of fraud detection algorithms. We present an agent-based simulator that is capable of generating realistic English auction data. The agents and model are based on data collected from the TradeMe online auction site. We evaluate the generated data in two ways to show that it is similar to the TradeMe data. Evaluation of individual features show that correlation is greater than 0.9 for 8 of the 10 features, and evaluation using multiple features gives a median accuracy of 0.87.

[1]  Byungtae Lee,et al.  An Empirical Analysis of Fraud Detection in Online Auctions: Credit Card Phantom Transaction , 2007, 2007 40th Annual Hawaii International Conference on System Sciences (HICSS'07).

[2]  Christos Faloutsos,et al.  Detecting Fraudulent Personalities in Networks of Online Auctioneers , 2006, PKDD.

[3]  Bharat K. Bhargava,et al.  Counteracting shill bidding in online english auction , 2005, Int. J. Cooperative Inf. Syst..

[4]  Jaideep Srivastava,et al.  WEBKDD 2002 - Mining Web Data for Discovering Usage Patterns and Profiles , 2003, Lecture Notes in Computer Science.

[5]  Joachim M. Buhmann,et al.  Stability-Based Validation of Clustering Solutions , 2004, Neural Computation.

[6]  Jarrod Trevathan,et al.  Detecting Collusive Shill Bidding , 2007, Fourth International Conference on Information Technology (ITNG'07).

[7]  Michael J. North,et al.  Agent-based modeling and simulation , 2009, Proceedings of the 2009 Winter Simulation Conference (WSC).

[8]  Ashish Sureka,et al.  Mining eBay: Bidding Strategies and Shill Detection , 2002, WEBKDD.

[9]  Christos Faloutsos,et al.  Netprobe: a fast and scalable system for fraud detection in online auction networks , 2007, WWW '07.

[10]  Johannes Fürnkranz,et al.  Knowledge Discovery in Databases: PKDD 2006, 10th European Conference on Principles and Practice of Knowledge Discovery in Databases, Berlin, Germany, September 18-22, 2006, Proceedings , 2006, PKDD.

[11]  P. Monteiro,et al.  An Introduction to Auction Theory , 2004 .

[12]  Yousef Saad,et al.  Farthest Centroids Divisive Clustering , 2008, 2008 Seventh International Conference on Machine Learning and Applications.

[13]  Kenneth Steiglitz,et al.  Agent-based simulation of dynamic online auctions , 2000, 2000 Winter Simulation Conference Proceedings (Cat. No.00CH37165).

[14]  Alok Gupta,et al.  Simulating online Yankee auctions to optimize sellers revenue , 2001, Proceedings of the 34th Annual Hawaii International Conference on System Sciences.