A Reinforcement Learning Approach to Optimize Discount and Reputation Tradeoffs in E-commerce Systems

Feedback-based reputation systems are widely deployed in E-commerce systems. Evidence shows that earning a reputable label (for sellers of such systems) may take a substantial amount of time, and t...

[1]  Partha Dasgupta,et al.  P2P Reputation Management Using Distributed Identities and Decentralized Recommendation Chains , 2010, IEEE Transactions on Knowledge and Data Engineering.

[2]  J. Wooders,et al.  Reputation in Auctions: Theory, and Evidence from Ebay , 2006 .

[3]  John N. Tsitsiklis,et al.  Neuro-Dynamic Programming , 1996, Encyclopedia of Machine Learning.

[4]  Ling Liu,et al.  TrustMe: anonymous management of trust relationships in decentralized P2P systems , 2003, Proceedings Third International Conference on Peer-to-Peer Computing (P2P2003).

[5]  H. Robbins A Stochastic Approximation Method , 1951 .

[6]  Cristina Nita-Rotaru,et al.  A survey of attack and defense techniques for reputation systems , 2009, CSUR.

[7]  John C. S. Lui,et al.  Mining Deficiencies of Online Reputation Systems: Methodologies, Experiments and Implications , 2020, IEEE Transactions on Services Computing.

[8]  Stephen P. Boyd,et al.  Convex Optimization , 2004, Algorithms and Theory of Computation Handbook.

[9]  John C. S. Lui,et al.  Modeling eBay-like reputation systems: Analysis, characterization and insurance mechanism design , 2015, Perform. Evaluation.

[10]  Warren B. Powell,et al.  An Approximate Dynamic Programming Algorithm for Monotone Value Functions , 2014, Oper. Res..

[11]  Sean J. Taylor,et al.  Social Influence Bias: A Randomized Experiment , 2013, Science.

[12]  Xin Li,et al.  Self-selection, slipping, salvaging, slacking, and stoning: the impacts of negative feedback at eBay , 2005, EC '05.

[13]  Ling Liu,et al.  PeerTrust: supporting reputation-based trust for peer-to-peer electronic communities , 2004, IEEE Transactions on Knowledge and Data Engineering.

[14]  Martin L. Puterman,et al.  Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .

[15]  A. C. Chiang Fundamental methods of mathematical economics , 1974 .

[16]  Yan Wang,et al.  CommTrust: Computing Multi-Dimensional Trust by Mining E-Commerce Feedback Comments , 2014, IEEE Transactions on Knowledge and Data Engineering.

[17]  Paul Resnick,et al.  Reputation systems , 2000, CACM.

[18]  Stuart Landon,et al.  Quality Expectations, Reputation, and Price , 1998 .

[19]  Sean P. Meyn,et al.  Zap Q-Learning , 2017, NIPS.

[20]  Hector Garcia-Molina,et al.  The Eigentrust algorithm for reputation management in P2P networks , 2003, WWW '03.

[21]  Paul Resnick,et al.  Sybilproof transitive trust protocols , 2009, EC '09.

[22]  Paul A. Pavlou,et al.  Evidence of the Effect of Trust Building Technology in Electronic Markets: Price Premiums and Buyer Behavior , 2002, MIS Q..

[23]  Ramanathan V. Guha,et al.  Propagation of trust and distrust , 2004, WWW '04.

[24]  Angelos Stavrou,et al.  E-commerce Reputation Manipulation: The Emergence of Reputation-Escalation-as-a-Service , 2015, WWW.

[25]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.