Colloquial region discovery for retail products: discovery and application

Consumer behavior is of quintessential importance to the retail industry. Purchase trends for retail products are often affected by consumer location. Region-specific purchase trends, such as the French love wine, are typically supported by anecdotal evidence. An automated technique for the discovery of such trends from retail data has so far been absent. In this paper, we address the challenge of colloquial region discovery for retail products. More specifically, we target the problem of examining product sales across a chain of stores to extract the geographic regions that characterize a product. We introduce DICE, a diffusion-based technique to uncover all such regions for a given product, when they exist. In contrast to the current state of the art, DICE involves minimal usage of parameters and shows remarkable tolerance to noise that is often ubiquitous in retail data. We present results of experiments conducted on real datasets from a supermarket chain in France. Empirical evaluation and user studies establish that the proposed technique significantly outperforms the natural baseline and previous state-of-the-art approaches. Further, we study the impact of time and product category on DICE and discuss use cases for application of DICE in the retail world.

[1]  Bart Thomee,et al.  Uncovering locally characterizing regions within geotagged data , 2013, WWW.

[2]  Nicola Barbieri,et al.  Influence-Based Network-Oblivious Community Detection , 2013, 2013 IEEE 13th International Conference on Data Mining.

[3]  Nicola Barbieri,et al.  Influence Maximization with Viral Product Design , 2014, SDM.

[4]  Éva Tardos,et al.  Maximizing the Spread of Influence through a Social Network , 2015, Theory Comput..

[5]  Henriette Cramer,et al.  Describing and Understanding Neighborhood Characteristics through Online Social Media , 2015, WWW.

[6]  Tao Ye,et al.  A recursive random search algorithm for large-scale network parameter configuration , 2003, SIGMETRICS '03.

[7]  Mor Naaman,et al.  Towards automatic extraction of event and place semantics from flickr tags , 2007, SIGIR.

[8]  Mao Ye,et al.  Exploring social influence for recommendation: a generative model approach , 2012, SIGIR '12.

[9]  Éva Tardos,et al.  Influential Nodes in a Diffusion Model for Social Networks , 2005, ICALP.

[10]  Kristina Lerman,et al.  Learning boundaries of vague places from noisy annotations , 2011, GIS.

[11]  Chuan Zhou,et al.  Personalized influence maximization on social networks , 2013, CIKM.

[12]  Mor Naaman,et al.  Methods for extracting place semantics from Flickr tags , 2009, TWEB.

[13]  Kevin Lynch,et al.  The Image of the City , 1960 .

[14]  Laks V. S. Lakshmanan,et al.  Viral Marketing Meets Social Advertising: Ad Allocation with Minimum Regret , 2014, Proc. VLDB Endow..

[15]  Laks V. S. Lakshmanan,et al.  SIMPATH: An Efficient Algorithm for Influence Maximization under the Linear Threshold Model , 2011, 2011 IEEE 11th International Conference on Data Mining.

[16]  Jon M. Kleinberg,et al.  Mapping the world's photos , 2009, WWW '09.

[17]  Shivam Srivastava,et al.  Geo-Social Clustering of Places from Check-in Data , 2015, 2015 IEEE International Conference on Data Mining.

[18]  Michael F. Goodchild,et al.  Where's Downtown?: Behavioral Methods for Determining Referents of Vague Spatial Queries , 2003 .

[19]  Norman M. Sadeh,et al.  The Livehoods Project: Utilizing Social Media to Understand the Dynamics of a City , 2012, ICWSM.

[20]  Yifei Yuan,et al.  Scalable Influence Maximization in Social Networks under the Linear Threshold Model , 2010, 2010 IEEE International Conference on Data Mining.

[21]  Li Guo,et al.  UBLF: An Upper Bound Based Approach to Discover Influential Nodes in Social Networks , 2013, 2013 IEEE 13th International Conference on Data Mining.