Cost exploration of data sharings in the cloud

Enabling data sharing among mobile apps hosted in the same cloud infrastructure can provide a competitive advantage to the mobile apps by giving them access to rich information as well as increasing the revenue for the cloud provider. We introduce a costing tool that allows application owners (i.e., consumers) and the cloud service provider to assess the cost of a desired data sharing. The costing tool enables the consumers to effectively explore the cost space by choosing between alternative configurations of varying data qualities, specified by the staleness and the accuracy of the data sharing. In other words, staleness and accuracy requirements on the data sharing are used as levers for controlling costs. These capabilities are implemented in a What-if analysis tool, which has been integrated with a large data-sharing platform. We conducted extensive experiments on the integrated platform with a sharing ecosystem created around Twitter data and show the effectiveness of the results produced by the What-if tool.

[1]  Kaisa Miettinen,et al.  Nonlinear multiobjective optimization , 1998, International series in operations research and management science.

[2]  Rajeev Motwani,et al.  On random sampling over joins , 1999, SIGMOD '99.

[3]  Dean Daniels,et al.  Query Processing in R* , 1985, Query Processing in Database Systems.

[4]  Kenneth A. Ross,et al.  Materialized view maintenance and integrity constraint checking: trading space for time , 1996, SIGMOD '96.

[5]  Verena Kantere,et al.  Predicting cost amortization for query services , 2011, SIGMOD '11.

[6]  Surajit Chaudhuri,et al.  An overview of query optimization in relational systems , 1998, PODS.

[7]  Timos K. Sellis,et al.  Multiple-query optimization , 1988, TODS.

[8]  P. K. Kannan,et al.  Pricing of Information Products on Online Servers: Issues, Models, and Analysis , 2002, Manag. Sci..

[9]  Laurent d'Orazio,et al.  Cost models for view materialization in the cloud , 2012, EDBT-ICDT '12.

[10]  Dan Suciu,et al.  A Discussion on Pricing Relational Data , 2013, In Search of Elegance in the Theory and Practice of Computation.

[11]  Hakan Hacigümüs,et al.  COSMOS: A Platform for Seamless Mobile Services in the Cloud , 2011, 2011 IEEE 12th International Conference on Mobile Data Management.

[12]  Val Tannen,et al.  ORCHESTRA: facilitating collaborative data sharing , 2007, SIGMOD '07.

[13]  Dan Suciu,et al.  How to Price Shared Optimizations in the Cloud , 2012, Proc. VLDB Endow..

[14]  Krithi Ramamritham,et al.  Materialized view selection and maintenance using multi-query optimization , 2000, SIGMOD '01.

[15]  Michael J. Franklin,et al.  Cache investment: integrating query optimization and distributed data placement , 2000, TODS.

[16]  Dan Suciu,et al.  Data Markets in the Cloud: An Opportunity for the Database Community , 2011, Proc. VLDB Endow..

[17]  Alfons Kemper,et al.  Extensibility and Data Sharing in evolving multi-tenant databases , 2011, 2011 IEEE 27th International Conference on Data Engineering.

[18]  Jennifer Widom,et al.  View maintenance in a warehousing environment , 1995, SIGMOD '95.

[19]  Ambuj K. Singh,et al.  Efficient view maintenance at data warehouses , 1997, SIGMOD '97.

[20]  Bruce G. Lindsay,et al.  How to roll a join: asynchronous incremental view maintenance , 2000, SIGMOD '00.

[21]  Yue Zhuge,et al.  The Strobe algorithms for multi-source warehouse consistency , 1996, Fourth International Conference on Parallel and Distributed Information Systems.

[22]  Bingsheng He,et al.  Distributed Systems Meet Economics: Pricing in the Cloud , 2010, HotCloud.

[23]  A. Messac,et al.  The normalized normal constraint method for generating the Pareto frontier , 2003 .

[24]  Alexandros Labrinidis,et al.  Reduction of Materialized View Staleness Using Online Updates , 1998 .

[25]  B Praveen Kumar,et al.  Mariposa a Wide-Area Distributed Database System , 2010, ICCA 2010.