Handling big data on agent-based modeling of Online Social Networks with MapReduce

There is an increasing interest on using Online Social Networks (OSNs) in a wide range of applications. Two interesting problems that have received a lot of attention in OSNs is how to provide effective ways to understand and predict how users behave, and how to build accurate models for specific domains (e.g., marketing campaigns). In this context, stochastic multi-agent based simulation can be employed to reproduce the behavior observed in OSNs. Nevertheless, the first step to build an accurate behavior model is to create an agent-based system. Hence, a modeler needs not only to be effective, but also to scale up given the huge volume of streaming graph data. To tackle the above challenges, this paper proposes a MapReduce-based method to build a modeler to handle big data. We demonstrate in our experiments how efficient and effective our proposal is using the Obama's Twitter network on the 2012 U.S. presidential election.

[1]  Magdalena Balazinska,et al.  Scalable Clustering Algorithm for N-Body Simulations in a Shared-Nothing Cluster , 2010, SSDBM.

[2]  Christos Faloutsos,et al.  V-SMART-Join: A Scalable MapReduce Framework for All-Pair Similarity Joins of Multisets and Vectors , 2012, Proc. VLDB Endow..

[3]  E. Rogers,et al.  Diffusion of innovations , 1964, Encyclopedia of Sport Management.

[4]  Krishna P. Gummadi,et al.  A measurement-driven analysis of information propagation in the flickr social network , 2009, WWW '09.

[5]  Antony I. T. Rowstron,et al.  Scale-up vs scale-out for Hadoop: time to rethink? , 2013, SoCC.

[6]  Ana Paula Appel,et al.  Reaction times for user behavior models in microblogging online social networks , 2013, DUBMOD '13.

[7]  David J. DeWitt,et al.  Parallel database systems: the future of high performance database systems , 1992, CACM.

[8]  Ana Paula Appel,et al.  Large-Scale Multi-agent-Based Modeling and Simulation of Microblogging-Based Online Social Network , 2013, MABS.

[9]  Michael Stonebraker,et al.  A comparison of approaches to large-scale data analysis , 2009, SIGMOD Conference.

[10]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[11]  Carlos Maltzahn,et al.  A framework for an in-depth comparison of scale-up and scale-out , 2013, International Symposium on Design and Implementation of Symbolic Computation Systems.

[12]  Éva Tardos,et al.  Maximizing the Spread of Influence through a Social Network , 2015, Theory Comput..

[13]  Alin Deutsch,et al.  ASTERIX: towards a scalable, semistructured data platform for evolving-world models , 2011, Distributed and Parallel Databases.

[14]  Jure Leskovec,et al.  Planetary-scale views on a large instant-messaging network , 2008, WWW.

[15]  R. Hill,et al.  A SIMULATION-BASED APPROACH TO ANALYZE THE INFORMATION DIFFUSION IN MICROBLOGGING ONLINE SOCIAL NETWORK , 2013 .

[16]  F. Al-Shamali,et al.  Author Biographies. , 2015, Journal of social work in disability & rehabilitation.

[17]  NICHOLAS R. JENNINGS,et al.  An agent-based approach for building complex software systems , 2001, CACM.

[18]  Jimeng Sun,et al.  Social influence analysis in large-scale networks , 2009, KDD.

[19]  Ramanathan V. Guha,et al.  Information diffusion through blogspace , 2004, WWW '04.

[20]  Pablo Rodriguez,et al.  The little engine(s) that could: scaling online social networks , 2010, SIGCOMM '10.

[21]  Kathleen C. Schwartzman,et al.  DIFFUSION IN ORGANIZATIONS AND SOCIAL MOVEMENTS: From Hybrid Corn to Poison Pills , 2007 .

[22]  Jure Leskovec,et al.  The dynamics of viral marketing , 2005, EC '06.