ROCK: A Characterization Methodology for Web Multimedia Services Based on an Informational Hierarchy

Despite the increasing popularity of multimedia content in the Brazilian Web, we still do not understand much about the dynamics of how such content is consumed. The majority of previous characterization strategies focuses on data access, employing dimensions such as users and sessions and ignoring content-related information that may be potentially relevant. In this work we present ROCK, a novel characterization methodology of multimedia web services that has four parts: Request, Object, Content, and Knowledge. Besides quantitative characterization criteria, the methodology employs a segmentation-based analysis to grasp the semantic characteristics of the delivered content. We apply our methodology to real data from a distributor of corporate's multimedia content, demonstrating its applicability and usefulness.

[1]  Peter Parnes,et al.  Characterizing user access to videos on the World Wide Web , 1999, Electronic Imaging.

[2]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[3]  Alec Wolman,et al.  Measurement and Analysis of a Streaming Media Workload , 2001, USITS.

[4]  Zongpeng Li,et al.  Youtube traffic characterization: a view from the edge , 2007, IMC '07.

[5]  Pablo Rodriguez,et al.  I tube, you tube, everybody tubes: analyzing the world's largest user generated content video system , 2007, IMC '07.

[6]  Daniel A. Menascé,et al.  Scaling for E-Business: Technologies, Models, Performance, and Capacity Planning , 2000 .

[7]  Virgílio A. F. Almeida,et al.  Characterization and Analysis of User Profiles in Online Video Sharing Systems , 2010, J. Inf. Data Manag..

[8]  Martin F. Arlitt,et al.  Web server workload characterization: the search for invariants , 1996, SIGMETRICS '96.

[9]  Virgílio A. F. Almeida,et al.  A hierarchical characterization of a live streaming media workload , 2006 .

[10]  Bruce M. Maggs,et al.  An analysis of live streaming workloads on the internet , 2004, IMC '04.

[11]  Ítalo S. Cunha,et al.  Analyzing client interactivity in streaming media , 2004, WWW '04.

[12]  Christopher Olston,et al.  Building a HighLevel Dataflow System on top of MapReduce: The Pig Experience , 2009, Proc. VLDB Endow..

[13]  Ian Witten,et al.  Data Mining , 2000 .

[14]  Mary K. Vernon,et al.  Analysis of educational media server workloads , 2001, NOSSDAV '01.

[15]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[16]  Virgílio A. F. Almeida,et al.  Traffic Characteristics and Communication Patterns in Blogosphere , 2006, ICWSM.