Demand driven store site selection via multiple spatial-temporal data

Choosing a good location when opening a new store is crucial for the future success of a business. Traditional methods include offline manual survey, analytic models based on census data, which are either unable to adapt to the dynamic market or very time consuming. The rapid increase of the availability of big data from various types of mobile devices, such as online query data and offline positioning data, provides us with the possibility to develop automatic and accurate data- driven prediction models for business store site selection. In this paper, we propose a Demand Driven Store Site Selection (DD3S) framework for business store site selection by mining search query data from Baidu Maps. DD3S first detects the spatial-temporal distributions of customer demands on different business services via query data from Baidu Maps, the largest online map search engine in China, and detects the gaps between demand and supply. Then we determine candidate locations via clustering such gaps. In the final stage, we solve the location optimization problem by predicting and ranking the number of customers. We not only deploy supervised regression models to predict the number of customers, but also use learning-to-rank model to directly rank the locations. We evaluate our framework on various types of businesses in real-world cases, and the experiment results demonstrate the effectiveness of our methods. DD3S as the core function for store site selection has already been implemented as a core component of our business analytics platform and could be potentially used by chain store merchants on Baidu Nuomi.

[1]  Jaana Kekäläinen,et al.  Cumulated gain-based evaluation of IR techniques , 2002, TOIS.

[2]  Adee Athiyaman Location Decision Making: The Case of Retail Service Development in a Closed Population , 2011 .

[3]  Alexander Kubis,et al.  Analysis of Location of Large-area Shopping Centres. A Probabilistic Gravity Model for the Halle–Leipzig Area , 2007 .

[4]  T. Hernández,et al.  The art and science of retail location decisions , 2000 .

[5]  Pablo Jensen Network-based predictions of retail store commercial categories and optimal locations. , 2006, Physical review. E, Statistical, nonlinear, and soft matter physics.

[6]  J. Reynolds,et al.  Leveraging locational insights within retail store development? Assessing the use of location planners’ knowledge in retail marketing , 2012 .

[7]  Oded Berman,et al.  The generalized maximal covering location problem , 2002, Comput. Oper. Res..

[8]  Hui Xiong,et al.  Exploiting geographic dependencies for real estate appraisal: a mutual perspective of ranking and clustering , 2014, KDD.

[9]  Farnoush Banaei Kashani,et al.  Optimal network location queries , 2010, GIS '10.

[10]  Dorin Comaniciu,et al.  Mean Shift: A Robust Approach Toward Feature Space Analysis , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[11]  Cheng Long,et al.  Efficient algorithms for optimal location queries in road networks , 2014, SIGMOD Conference.

[12]  Yuhong Li,et al.  Location selection for ambulance stations: a data-driven approach , 2015, SIGSPATIAL/GIS.

[13]  Andreas Christmann,et al.  Support vector machines , 2008, Data Mining and Knowledge Discovery Handbook.

[14]  Christopher J. C. Burges,et al.  From RankNet to LambdaRank to LambdaMART: An Overview , 2010 .

[15]  Haishan Wu,et al.  Predict User In-World Activity via Integration of Map Query and Mobility Trace , 2015 .

[16]  Kevin P. Murphy,et al.  Machine learning - a probabilistic perspective , 2012, Adaptive computation and machine learning series.

[17]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[18]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[19]  Jun Zhang,et al.  Trade area analysis using user generated mobile location data , 2013, WWW '13.

[20]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[21]  P. Cochat,et al.  Et al , 2008, Archives de pediatrie : organe officiel de la Societe francaise de pediatrie.

[22]  Jian Li,et al.  Consensus answers for queries over probabilistic databases , 2008, PODS.

[23]  Cecilia Mascolo,et al.  Geo-spotting: mining online location-based services for optimal retail store placement , 2013, KDD.

[24]  Hui Xiong,et al.  Sparse Real Estate Ranking with Online User Reviews and Offline Moving Behaviors , 2014, 2014 IEEE International Conference on Data Mining.

[25]  Feifei Li,et al.  Optimal location queries in road network databases , 2011, 2011 IEEE 27th International Conference on Data Engineering.

[26]  D. Rogers Retail location analysis in practice , 1992 .