Global workload characterization of a large scale satellite image distribution system

Online content distribution systems, which store incredibly large amounts of information and provide service to large numbers of users, are becoming increasingly commonplace. To fulfill the wide range of requests sent by different users, these systems must ensure efficient handling of massive amount of data. To achieve this goal, the in-depth analysis and comprehensive understanding of user behaviors are critical. However, analyzing the behaviors of worldwide users with different needs is a very challenging task. This is especially true when historical user behaviors evolve over time or may be affected by unpredictable events. In this paper, we present a number of workload characterization techniques applied to one of the world's largest online satellite image distribution systems operated by the U.S. Geological Survey (USGS) and NASA.

[1]  Nimrod Megiddo,et al.  ARC: A Self-Tuning, Low Overhead Replacement Cache , 2003, FAST.

[2]  Xiao Qin,et al.  Performance Evaluation of Traditional Caching Policies on a Large System with Petabytes of Data , 2012, 2012 IEEE Seventh International Conference on Networking, Architecture, and Storage.

[3]  Mais Nijim,et al.  Case study of visualizing global user download patterns using Google Earth and NASA World Wind , 2012 .

[4]  Xuesong Zhang,et al.  Geospatial visualization of global satellite images with Vis-EROS , 2011, Environ. Model. Softw..

[5]  David J. DeWitt,et al.  An evaluation of buffer management strategies for relational database systems , 1986, Algorithmica.

[6]  Yuanyuan Zhou,et al.  The Multi-Queue Replacement Algorithm for Second Level Buffer Caches , 2001, USENIX Annual Technical Conference, General Track.

[7]  Richard B. Bunt,et al.  Disk cache replacement policies for network fileservers , 1993, [1993] Proceedings. The 13th International Conference on Distributed Computing Systems.

[8]  Divesh Srivastava,et al.  Semantic Data Caching and Replacement , 1996, VLDB.