Profiling and Predicting User Activity on a Home Network

This paper reports a study on the characterization of in-home Internet activity behavior based on application usage logs. We collected online activity data from 86 Belgium households for 60 days. We analyzed the activity traces to gain insights on the temporal traffic distribution, interaction regularity, and activity correlations. This analysis is then used to develop a generic method to segment households into designated groups showing similar behavioral profiles. Our technique combines interaction frequencies and regularities across activities for segmentation, and is able to reveal interesting time-slotted profile for each segment. These profiles aim to show the strength of routine behaviors in Internet usage, based on which we present a novel algorithm to predict future Internet activities of a household. Our algorithm shows that 60% of the households online activities can be predicted accurately 70% of times.

[1]  Ravi Kumar,et al.  A characterization of online browsing behavior , 2010, WWW '10.

[2]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[3]  Sharad Goel,et al.  Who Does What on the Web: A Large-Scale Study of Browsing Behavior , 2012, ICWSM.

[4]  M. Falch Handbook of Research on Global Diffusion of Broadband Data Transmission , 2008 .

[5]  Christos Faloutsos,et al.  Identifying Web Browsing Trends and Patterns , 2001, Computer.

[6]  C. Tappert,et al.  A Survey of Binary Similarity and Distance Measures , 2010 .

[7]  Thad Starner,et al.  Using GPS to learn significant locations and predict movement across multiple users , 2003, Personal and Ubiquitous Computing.

[8]  David Poole,et al.  Predicting Future User Actions by Observing Unmodified Applications , 2000, AAAI/IAAI.

[9]  Eszter Hargittai,et al.  Whose Space? Differences Among Users and Non-Users of Social Network Sites , 2007, J. Comput. Mediat. Commun..

[10]  Rebecca E. Grinter,et al.  Why is my internet slow?: making network speeds visible , 2011, CHI.

[11]  David W. McDonald,et al.  Social matching: A framework and research agenda , 2005, TCHI.

[12]  Eric Horvitz,et al.  Predestination: Inferring Destinations from Partial Trajectories , 2006, UbiComp.

[13]  Anna Monreale,et al.  WhereNext: a location predictor on trajectory pattern mining , 2009, KDD.

[14]  Robert R. Sokal,et al.  A statistical method for evaluating systematic relationships , 1958 .

[15]  Robert E. Kraut,et al.  The HomeNet field trial of residential Internet services , 1996, CACM.

[16]  Matthew Richardson,et al.  Yes, there is a correlation: - from social networks to personal behavior on the web , 2008, WWW.

[17]  Susan T. Dumais,et al.  Resonance on the web: web dynamics and revisitation patterns , 2009, CHI.

[18]  Xing Xie,et al.  Mining user similarity based on location history , 2008, GIS '08.

[19]  Haym Hirsh,et al.  Learning to Predict Rare Events in Categorical Time-Series Data , 1998 .

[20]  Thomas Beauvisage,et al.  Computer usage in daily life , 2009, CHI.

[21]  Taghi M. Khoshgoftaar,et al.  A Survey of Collaborative Filtering Techniques , 2009, Adv. Artif. Intell..

[22]  Shanton Chang,et al.  The Adoption of Broadband Internet in Australia and Canada , 2008 .

[23]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[24]  Lars Backstrom,et al.  ePluribus: Ethnicity on Social Networks , 2010, ICWSM.

[25]  James E. Pitkow,et al.  Characterizing Browsing Strategies in the World-Wide Web , 1995, Comput. Networks ISDN Syst..