Elastic Scaling of e-Infrastructures to Support Data-Intensive Research Collaborations

For many research endeavours, e-Infrastructures need to provide predictable, on-demand access to large-scale computational resources with high data availability. These need to scale with the research communities requirements and use. One example of such an e-Infrastructure is the Australian Urban Research Infrastructure Network (AURIN -- www.aurin.org.au) project, which supports Australia-wide research in and across the urban and built environment. This paper describes the architecture of the AURIN infrastructure and its support for access to distributed (federated) and highly heterogeneous data sets from a wide range of providers. We present how this architecture solution leverages the intersection of high throughput computing (HTC), infrastructure as a service (IaaS) Cloud services and big data technologies including use of NoSQL resources. The driving concept in this architecture and the focus of this paper is the ability for scaling up or down depending on resource demands at any given time. This is done automatically and on demand avoiding either under-or over-utilization of resources. This resource-optimization-driven infrastructure has been designed to ensure that peak loads can be predicted and successfully coped with, as well as avoid wasting resources during non-peak times. This overall management strategy has resulted in an e-Infrastructure that provides a flexible, evolving research environment that scales with research needs, rather than providing a rigid (static) end product.

[1]  Richard O. Sinnott,et al.  A data-driven urban research environment for Australia , 2012, 2012 IEEE 8th International Conference on E-Science.

[2]  Thomas Eickermann,et al.  Design and evaluation of a collaborative online visualization and steering framework implementation for computational grids , 2007, 2007 8th IEEE/ACM International Conference on Grid Computing.

[3]  Bo Gao,et al.  A Framework for Native Multi-Tenancy Application Development and Management , 2007, The 9th IEEE International Conference on E-Commerce Technology and The 4th IEEE International Conference on Enterprise Computing, E-Commerce and E-Services (CEC-EEE 2007).

[4]  Aman Kansal,et al.  Q-clouds: managing performance interference effects for QoS-aware clouds , 2010, EuroSys '10.

[5]  Laura Díaz,et al.  Service-oriented applications for environmental models: Reusable geospatial services , 2010, Environ. Model. Softw..

[6]  Marius Hillenbrand,et al.  High performance cloud computing , 2013, Future Gener. Comput. Syst..

[7]  Jean-Baptiste Filippi,et al.  JDEVS: an implementation of a DEVS based formal framework for environmental modelling , 2004, Environ. Model. Softw..

[8]  Calvin J. Ribbens,et al.  Hybrid Computing - Where HPC meets grid and Cloud Computing , 2011, Future Gener. Comput. Syst..

[9]  Anne E. Trefethen,et al.  The Data Deluge: An e-Science Perspective , 2003 .

[10]  Richard O. Sinnott,et al.  Decentralized orchestration of data-centric workflows in Cloud environments , 2013, Future Gener. Comput. Syst..

[11]  Paul Marshall,et al.  Elastic Site: Using Clouds to Elastically Extend Site Resources , 2010, 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing.

[12]  Achim Streit,et al.  Research advances by using interoperable e-science infrastructures , 2009, Cluster Computing.

[13]  Douglas C. Schmidt,et al.  Model-driven auto-scaling of green cloud computing infrastructure , 2012, Future Gener. Comput. Syst..

[14]  Frank Leymann,et al.  Variability modeling to support customization and deployment of multi-tenant-aware Software as a Service applications , 2009, 2009 ICSE Workshop on Principles of Engineering Service Oriented Systems.

[15]  Cor-Paul Bezemer,et al.  Multi-tenant SaaS applications: maintenance dream or nightmare? , 2010, IWPSE-EVOL '10.

[16]  José A. B. Fortes,et al.  Sky Computing , 2009, IEEE Internet Computing.

[17]  Rajkumar Buyya,et al.  CloudSim: a toolkit for modeling and simulation of cloud computing environments and evaluation of resource provisioning algorithms , 2011, Softw. Pract. Exp..

[18]  Eric Bouillet,et al.  Efficient resource provisioning in compute clouds via VM multiplexing , 2010, ICAC '10.

[19]  Thomas Kwok,et al.  A Software as a Service with Multi-tenancy Support for an Electronic Contract Management Application , 2008, 2008 IEEE International Conference on Services Computing.

[20]  Anne E. Trefethen,et al.  Cyberinfrastructure for e-Science , 2005, Science.

[21]  Ed Zaluska,et al.  Load Index Metrics for an Optimized Management of Web Services: A Systematic Evaluation , 2013, PloS one.

[22]  Yong Zhao,et al.  Cloud Computing and Grid Computing 360-Degree Compared , 2008, GCE 2008.

[23]  Rubén S. Montero,et al.  Scheduling strategies for optimal service deployment across multiple clouds , 2013, Future Gener. Comput. Syst..

[24]  Randy H. Katz,et al.  A view of cloud computing , 2010, CACM.

[25]  Alexandru Iosup,et al.  On the Performance Variability of Production Cloud Services , 2011, 2011 11th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing.

[26]  Petraq J. Papajorgji,et al.  An architecture for developing service-oriented and component-based environmental models , 2004 .

[27]  Martin Molina,et al.  A tenant-based resource allocation model for scaling Software-as-a-Service applications over cloud computing infrastructures , 2013, Future Gener. Comput. Syst..