A Comparative Analysis of Scheduling Mechanisms for Virtual Screening Workflow in a Shared Resource Environment

Traditional High-Throughput Computing (HTC) consists of running many loosely-coupled tasks that are independent but requires a large amount of computing power during significant period of time. However, recent emerging applications requiring millions or even billions of tasks to be processed within a relatively short period of time have expanded the traditional HTC into Many-Task Computing (MTC).In silico drug discovery offers an efficient alternative to reduce the cost of drug development and discovery process. For this purpose, virtual screening is used to select the most promising candidate drugs for in vitro testing from millions of chemical compounds. This process requires a substantial amount of computing resources and high-performance processing of docking simulations, which shows the typical characteristics of MTC applications. As the number of users performing this virtual screening process increases with limited available computing resources, it becomes crucial to devise an effective scheduling policy that can ensure a certain degree of fairness and user satisfaction. In this paper, we present a comparative analysis of scheduling mechanisms for the virtual screening workflow where multiple users in the system are sharing a common service infrastructure. To effectively support these multiple users, the underlying system should be able to consider fairness, user response time and overall system throughput. We have implemented two different scheduling algorithms which can address fairness and user response time respectively in a common middleware stack called HTCaaS which is a pilot-job based multi-level scheduling system running on top of a dedicated production-level cluster. Throughout comparative analysis of two different scheduling mechanisms targeting different metrics on top of a single H/W and S/W system, we can give an insight to the research community on the design and implementation of a scheduling mechanism that can trade-off user fairness and overall system performance which's crucial to support challenging MTC applications.

[1]  Yong Zhao,et al.  Falkon: a Fast and Light-weight tasK executiON framework , 2007, Proceedings of the 2007 ACM/IEEE Conference on Supercomputing (SC '07).

[2]  N. Jacq,et al.  Grid-Enabled High-Throughput In Silico Screening Against Influenza A Neuraminidase , 2006, IEEE Transactions on NanoBioscience.

[3]  Emmanuel Medernach,et al.  Workload Analysis of a Cluster in a Grid Environment , 2005, JSSPP.

[4]  Hairong Kuang,et al.  The Hadoop Distributed File System , 2010, 2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST).

[5]  D S Goodsell,et al.  Automated docking of flexible ligands: Applications of autodock , 1996, Journal of molecular recognition : JMR.

[6]  Soonwook Hwang,et al.  Abstract: HTCaaS: A Large-Scale High-Throughput Computing by Leveraging Grids, Supercomputers and Cloud , 2012, 2012 SC Companion: High Performance Computing, Networking Storage and Analysis.

[7]  Ian Foster,et al.  GT4 GRAM: A Functionality and Performance Study , 2007 .

[8]  Zhou Lei,et al.  The portable batch scheduler and the maui scheduler on linux clusters , 2000 .

[9]  V. Srinivasa Rao,et al.  Modern drug discovery process: An in silico approach , 2011 .

[10]  Johan Montagnat,et al.  Grid-enabled Virtual Screening Against Malaria , 2006, Journal of Grid Computing.

[11]  Zhao Zhang,et al.  Towards Loo on , 2008 .

[12]  Ryan G. Coleman,et al.  ZINC: A Free Tool to Discover Chemistry for Biology , 2012, J. Chem. Inf. Model..

[13]  Ryan G. Coleman,et al.  Protein Pockets: Inventory, Shape, and Comparison , 2010, J. Chem. Inf. Model..

[14]  Haruki Nakamura,et al.  Announcing the worldwide Protein Data Bank , 2003, Nature Structural Biology.

[15]  R. Capuzzo-Dolcetta PRACE: Partnership for Advanced Computing in Europe , 2010 .

[16]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[17]  Allen B. Downey,et al.  A parallel workload model and its implications for processor allocation , 1996, Proceedings. The Sixth IEEE International Symposium on High Performance Distributed Computing (Cat. No.97TB100183).

[18]  David P. Anderson,et al.  SETI@home: an experiment in public-resource computing , 2002, CACM.

[19]  Wolfgang Gentzsch,et al.  Sun Grid Engine: towards creating a compute power grid , 2001, Proceedings First IEEE/ACM International Symposium on Cluster Computing and the Grid.

[20]  Dror G. Feitelson,et al.  Packing Schemes for Gang Scheduling , 1996, JSSPP.

[21]  Martin Hofmann-Apitius,et al.  WISDOM-II: Screening against multiple targets implicated in malaria using computational grid infrastructures , 2009, Malaria Journal.

[22]  B. Allen,et al.  Designing a Runtime System for Volunteer Computing , 2006, ACM/IEEE SC 2006 Conference (SC'06).

[23]  Douglas Thain,et al.  Distributed computing in practice: the Condor experience , 2005, Concurr. Pract. Exp..

[24]  David Abramson,et al.  Mixing Grids and Clouds: High-Throughput Science Using the Nimrod Tool Family , 2010, Cloud Computing.

[25]  Seoyoung Kim,et al.  HTCaaS : Leveraging Distributed Supercomputing Infrastructures for Large-Scale Scientific Computing , 2013 .

[26]  Shantenu Jha,et al.  P∗: A model of pilot-abstractions , 2012, 2012 IEEE 8th International Conference on E-Science.

[27]  Vincent Breton,et al.  Optimizing the Stretch for Virtual Screening Application on Pilot-agent Platforms on Grid/Cloud by using Multi-level Queue-based Scheduling , 2014, CLOSER.

[28]  David P. Anderson,et al.  BOINC: a system for public-resource computing and storage , 2004, Fifth IEEE/ACM International Workshop on Grid Computing.

[29]  Edward Walker,et al.  Creating personal adaptive clusters for managing scientific jobs in a distributed computing environment , 2006, 2006 IEEE Challenges of Large Applications in Distributed Environments.

[30]  Yong Zhao,et al.  Many-task computing for grids and supercomputers , 2008, 2008 Workshop on Many-Task Computing on Grids and Supercomputers.

[31]  Soonwook Hwang,et al.  Poster: HTCaaS: A Large-Scale High-Throughput Computing by Leveraging Grids, Supercomputers and Cloud , 2012, 2012 SC Companion: High Performance Computing, Networking Storage and Analysis.

[32]  Soonwook Hwang,et al.  Towards effective science cloud provisioning for a large-scale high-throughput computing , 2014, Cluster Computing.

[33]  Joel Closier,et al.  DIRAC: a community grid solution , 2008 .

[34]  Alexander S. Szalay,et al.  Accelerating large-scale data exploration through data diffusion , 2008, DADC '08.

[35]  Nicolas Jacq,et al.  In silico drug discovery services in computing grid environments against neglected and emerging infectious diseases. (Recherche de médicaments in silico sur grilles de calcul contre des maladies négligées et émergentes) , 2006 .