ReMot Reputation and Resource-Based Model to Estimate the Reliability of the Host Machines in Volunteer Cloud Environment

Due to the need of green computing and low cost, emerging paradigms such as edge and fog computing, and volunteer cloud have recently been introduced. In general, the volunteer cloud model targets globally distributed volunteer, highly heterogeneous, and non-dedicated machines. The inherent high degree of resource heterogeneity leads to varying levels of hardware and software failures and configuration faults on the unreliable and volatile volunteer hosts. As a result, the performance of deployed tasks is detrimentally impacted and is a key challenge, particularly in the case of scheduling algorithms. Most of the reputation models that have been used for reliability evaluation only evaluate the reliability of host machines by simply using the ratio of successfully completed tasks to total tasks requested. These models do not consider the resource utilization and the daily or weekly patterns of job behaviors or characteristics (e.g. priority of a job). Thus, the performance of tasks that run on these resources suffers and may take a substantial time to complete. Therefore, there is a need to proactively consider the reliability of host machines for effective and efficient management of resources in highly heterogeneous and distributed cloud environments. To address these challenges, this paper proposes a reputation and resource-based reliability model called ReMot. ReMot is an intelligent machine learning based model that utilizes historical data of the tasks and host machines to extract their resource usage patterns, in addition to other metrics such as task failure rate and resource utilization to predict the reliability of host machines. To validate ReMots approach, the researchers have utilized a large usage trace of real world applications made available by Google Inc. The results indicate that ReMot obtained more accurate reliability estimation than existing models and dynamically adapts to workload variations.

[1]  Abhishek Chandra,et al.  Adaptive Reputation-Based Scheduling on Unreliable Distributed Infrastructures , 2007, IEEE Transactions on Parallel and Distributed Systems.

[2]  Antonio Puliafito,et al.  Cloud@Home: Bridging the Gap between Volunteer and Cloud Computing , 2009, ICIC.

[3]  Hector Garcia-Molina,et al.  The Eigentrust algorithm for reputation management in P2P networks , 2003, WWW '03.

[4]  Kaizar Amin,et al.  Reputation-Based Grid Resource Selectionpdfauthor , 2003 .

[5]  Rajkumar Buyya,et al.  Next generation cloud computing: New trends and research directions , 2017, Future Gener. Comput. Syst..

[6]  Rajkumar Buyya,et al.  Optimizing the makespan and reliability for workflow applications with reputation and a look-ahead genetic algorithm , 2011, Future Gener. Comput. Syst..

[7]  Hanan Lutfiyya,et al.  Decentralized approach to resource availability prediction using group availability in a P2P desktop grid , 2012, Future Gener. Comput. Syst..

[8]  Xiao Qin,et al.  An Availability-Aware Task Scheduling Strategy for Heterogeneous Systems , 2008, IEEE Transactions on Computers.

[9]  Francesco Tiezzi,et al.  Reputation-Based Cooperation in the Clouds , 2014, IFIPTM.

[10]  Kashi Venkatesh Vishwanath,et al.  Characterizing cloud computing hardware reliability , 2010, SoCC '10.

[11]  Dunren Che,et al.  cuCloud: Volunteer Computing as a Service (VCaaS) System , 2018, CLOUD.

[12]  David P. Anderson,et al.  SETI@home: an experiment in public-resource computing , 2002, CACM.

[13]  Jon B. Weissman,et al.  A quantitative comparison of reputation systems in the grid , 2005, The 6th IEEE/ACM International Workshop on Grid Computing, 2005..

[14]  David E. Culler,et al.  PlanetLab: an overlay testbed for broad-coverage services , 2003, CCRV.

[15]  Venkata U. B. Challagulla,et al.  A Machine Learning-Based Reliability Assessment Model for Critical Software Systems , 2007, 31st Annual International Computer Software and Applications Conference (COMPSAC 2007).

[16]  Luís Moura Silva,et al.  Predicting Machine Availabilities in Desktop Pools , 2006, 2006 IEEE/IFIP Network Operations and Management Symposium NOMS 2006.

[17]  Malcolm P. Atkinson,et al.  Ad Hoc Cloud Computing , 2015, 2015 IEEE 8th International Conference on Cloud Computing.

[18]  Niki Pissinou,et al.  Cluster-Based Reputation and Trust for Wireless Sensor Networks , 2007, 2007 4th IEEE Consumer Communications and Networking Conference.

[19]  Fukushi Masaru,et al.  Adaptive Group-Based Job Scheduling for High Performance and Reliable Volunteer Computing (特集 分散処理とネットワークサービス) -- (クラウドコンピューティング・グリッド) , 2011 .

[20]  Masaru Fukushi,et al.  Generalized Spot-Checking for Reliable Volunteer Computing , 2010, IEICE Trans. Inf. Syst..

[21]  Aniruddha S. Gokhale,et al.  Intelligent, Performance Interference-Aware Resource Management for IoT Cloud Backends , 2016, 2016 IEEE First International Conference on Internet-of-Things Design and Implementation (IoTDI).

[22]  Yuguang Fang,et al.  A Fine-Grained Reputation System for Reliable Service Selection in Peer-to-Peer Networks , 2007, IEEE Transactions on Parallel and Distributed Systems.

[23]  Dunren Che,et al.  A "No Data Center" Solution to Cloud Computing , 2017, 2017 IEEE 10th International Conference on Cloud Computing (CLOUD).

[24]  David P. Anderson,et al.  BOINC: a system for public-resource computing and storage , 2004, Fifth IEEE/ACM International Workshop on Grid Computing.

[25]  Abhishek Chandra,et al.  Ridge: combining reliability and performance in open grid platforms , 2007, HPDC '07.

[26]  Garth V. Crosby,et al.  SaRa: A Stochastic Model to Estimate Reliability of Edge Resources in Volunteer Cloud , 2018, 2018 IEEE International Conference on Edge Computing (EDGE).

[27]  Aniruddha S. Gokhale,et al.  iOverbook: Intelligent Resource-Overbooking to Support Soft Real-Time Applications in the Cloud , 2014, 2014 IEEE 7th International Conference on Cloud Computing.

[28]  Audun Jøsang,et al.  A survey of trust and reputation systems for online service provision , 2007, Decis. Support Syst..

[29]  Randy H. Katz,et al.  A view of cloud computing , 2010, CACM.