A social content delivery network for e‐Science

We are in the midst of a scientific data explosion in which the rate of data growth is rapidly increasing. While large‐scale research projects have developed sophisticated data distribution networks to share their data with researchers globally, there is no such support for the many millions of research projects generating data of interest to much smaller audiences (as exemplified by the long tail scientist). In data‐oriented research, every aspect of the research process is influenced by data access. However, sharing and accessing data efficiently as well as lowering access barriers are difficult. In the absence of dedicated large‐scale storage, many have noted that there is an enormous storage capacity available via connected peers, none more so than the storage resources of many research groups. With widespread usage of the content delivery network model for disseminating web content, we believe a similar model can be applied to distributing, sharing, and accessing long tail research data in an e‐Science context. We describe the vision and architecture of a social content delivery network – a model that leverages the social networks of researchers to automatically share and replicate data on peers' resources based upon shared interests and trust. Using this model, we describe a simulator and investigate how aspects such as user activity, geographic distribution, trust, and replica selection algorithms affect data access and storage performance. From these results, we show that socially informed replication strategies are comparable with more general strategies in terms of availability and outperform them in terms of spatial efficiency. Copyright © 2016 John Wiley & Sons, Ltd.

[1]  Carole A. Goble,et al.  The design and realisation of the myExperiment Virtual Research Environment for social sharing of workflows , 2009, Future Gener. Comput. Syst..

[2]  David P. Anderson,et al.  BOINC: a system for public-resource computing and storage , 2004, Fifth IEEE/ACM International Workshop on Grid Computing.

[3]  Liviu Iftode,et al.  Social Butterfly: Social Caches for Distributed Social Networks , 2011, 2011 IEEE Third Int'l Conference on Privacy, Security, Risk and Trust and 2011 IEEE Third Int'l Conference on Social Computing.

[4]  Ian T. Foster,et al.  Networking Materials Data: Accelerating Discovery at Experimental Facilities , 2014, High Performance Computing Workshop.

[5]  Daniel S. Katz,et al.  Constructing a Social Content Delivery Network for eScience , 2013, 2013 IEEE 9th International Conference on e-Science.

[6]  Aravind Srinivasan,et al.  Predicting Trust and Distrust in Social Networks , 2011, 2011 IEEE Third Int'l Conference on Privacy, Security, Risk and Trust and 2011 IEEE Third Int'l Conference on Social Computing.

[7]  Christian Haas,et al.  Co-operative Infrastructures: An Economic Model for Providing Infrastructures for Social Cloud Computing , 2013, 2013 46th Hawaii International Conference on System Sciences.

[8]  Krzysztof Rzadca,et al.  Replica Placement in P2P Storage: Complexity and Game Theoretic Analyses , 2010, 2010 IEEE 30th International Conference on Distributed Computing Systems.

[9]  Karl Aberer,et al.  My3: A highly-available P2P-based online social network , 2011, 2011 IEEE International Conference on Peer-to-Peer Computing.

[10]  Zahir Tari,et al.  MetaCDN: Harnessing 'Storage Clouds' for high performance content delivery , 2009, J. Netw. Comput. Appl..

[11]  Mark S. Granovetter The Strength of Weak Ties , 1973, American Journal of Sociology.

[12]  Michael Ley,et al.  The DBLP Computer Science Bibliography: Evolution, Research Issues, Perspectives , 2002, SPIRE.

[13]  Xiaoming Fu,et al.  Gemstone: Empowering Decentralized Social Networking with High Data Availability , 2011, 2011 IEEE Global Telecommunications Conference - GLOBECOM 2011.

[14]  Alexandros G. Dimakis,et al.  Network Coding for Distributed Storage Systems , 2007, IEEE INFOCOM 2007 - 26th IEEE International Conference on Computer Communications.

[15]  Ling Liu,et al.  PeerTrust: supporting reputation-based trust for peer-to-peer electronic communities , 2004, IEEE Transactions on Knowledge and Data Engineering.

[16]  Ian T. Foster,et al.  Globus Nexus: Research Identity, Profile, and Group Management as a Service , 2014, 2014 IEEE 10th International Conference on e-Science.

[17]  Akbar Ghaffarpour Rahbar,et al.  PowerTrust: A Robust and Scalable Reputation System for Trusted Peer-to-Peer Computing , 2007, IEEE Transactions on Parallel and Distributed Systems.

[18]  Daniel S. Katz,et al.  On Replica Placement in a Social CDN for e-Science , 2014, 2014 IEEE 10th International Conference on e-Science.

[19]  Ian T. Foster,et al.  Efficient and Secure Transfer, Synchronization, and Sharing of Big Data , 2014, IEEE Cloud Computing.

[20]  William E. Allcock,et al.  The Globus Striped GridFTP Framework and Server , 2005, ACM/IEEE SC 2005 Conference (SC'05).

[21]  Raúl Gracia Tinedo,et al.  FriendBox: A Hybrid F2F Personal Storage Application , 2012, 2012 IEEE Fifth International Conference on Cloud Computing.

[22]  Christian Haas,et al.  Foundations of Trust: Contextualising Trust in Social Clouds , 2012, 2012 Second International Conference on Cloud and Green Computing.

[23]  Gerhard Klimeck,et al.  nanoHUB.org: Advancing Education and Research in Nanotechnology , 2008, Computing in Science & Engineering.

[24]  Ramesh K. Sitaraman,et al.  The Akamai network: a platform for high-performance internet applications , 2010, OPSR.

[25]  Athanasios V. Vasilakos,et al.  CoDaaS: An experimental cloud-centric content delivery platform for user-generated contents , 2012, 2012 International Conference on Computing, Networking and Communications (ICNC).

[26]  Raúl Gracia Tinedo,et al.  F2Box: Cloudifying F2F Storage Systems with High Availability Correlation , 2012, IEEE CLOUD.

[27]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[28]  Daniel S. Katz,et al.  A Social Content Delivery Network for Scientific Cooperation: Vision, Design, and Architecture , 2012, 2012 SC Companion: High Performance Computing, Networking Storage and Analysis.

[29]  Ian T. Foster,et al.  Big Data Remote Access Interfaces for Light Source Science , 2015, 2015 IEEE/ACM 2nd International Symposium on Big Data Computing (BDC).

[30]  Kyle Chard,et al.  Social Cloud Computing: A Vision for Socially Motivated Resource Sharing , 2012, IEEE Transactions on Services Computing.

[31]  Jinyang Li,et al.  Friendstore: cooperative online backup using trusted nodes , 2008, SocialNets '08.

[32]  C. Haas Incentives and Two-Sided Matching - Engineering Coordination Mechanisms for Social Clouds , 2014 .

[33]  Ian T. Foster,et al.  Globus Online: Accelerating and Democratizing Science through Cloud-Based Services , 2011, IEEE Internet Computing.

[34]  A. Pentland,et al.  Eigenbehaviors: identifying structure in routine , 2009, Behavioral Ecology and Sociobiology.

[35]  Cuong Pham,et al.  S-CLONE: Socially-aware data replication for social networks , 2012, Comput. Networks.

[36]  John C. Tang,et al.  Work rhythms: analyzing visualizations of awareness histories of distributed groups , 2002, CSCW '02.

[37]  Liviu Iftode,et al.  SocialCDN: Caching techniques for distributed social networks , 2012, 2012 IEEE 12th International Conference on Peer-to-Peer Computing (P2P).

[38]  Shaowen Wang,et al.  Interoperation of world‐wide production e‐Science infrastructures , 2009, Concurr. Comput. Pract. Exp..

[39]  Christian Haas,et al.  A Social Compute Cloud: Allocating and Sharing Infrastructure Resources via Social Networks , 2014, IEEE Transactions on Services Computing.

[40]  Kai Hwang,et al.  PowerTrust: A Robust and Scalable Reputation System for Trusted Peer-to-Peer Computing , 2007, IEEE Transactions on Parallel and Distributed Systems.

[41]  Refik Molva,et al.  Safebook: A privacy-preserving online social network leveraging on real-life trust , 2009, IEEE Communications Magazine.

[42]  Martin Florian,et al.  A socio- and locality-aware overlay for user-centric networking , 2014, 2014 International Conference on Computing, Networking and Communications (ICNC).

[43]  P ? ? ? ? ? ? ? % ? ? ? ? , 1991 .

[44]  Barbara Carminati,et al.  A Multi-dimensional and Event-Based Model for Trust Computation in the Social Web , 2012, SocInfo.

[45]  Anna Monreale,et al.  Classifying Trust/Distrust Relationships in Online Social Networks , 2012, 2012 International Conference on Privacy, Security, Risk and Trust and 2012 International Confernece on Social Computing.

[46]  Erwin Laure,et al.  Interoperation of world-wide production e-Science infrastructures , 2009 .

[47]  Sonja Buchegger,et al.  PeerSoN: P2P social networking: early experiences and insights , 2009, SNS '09.