Improving energy efficiency and security in cluster computing systems

Cluster computing systems are widely used in parallel and distributed computing research. Besides performance, energy cost and security should also be carefully concerned in large scale cluster computing systems to reduce budget and to avoid information leak. In this dissertation, I proposed a Time Aware Dynamic Voltage Scaling scheduling algorithm to conserve energy cost of processors in parallel computing systems and a design of an energy-efficient I/O System with write buffer disks to conserve energy cost of large scale storage systems. To explain when the energy consumption could be reduced in cluster computing systems, I analyzed the CPU and I/O system performance in a security-aware storage system. Security is another issue which has not been well explored in cluster computing systems. I implemented a transparent encryption/decryption layer in a popular Message Passing Interface implementation: MPICH2. Then I quantitatively evaluate the system performance on two cluster computing systems.

[1]  William Gropp,et al.  MPI-2: Extending the Message-Passing Interface , 1996, Euro-Par, Vol. I.

[2]  E. N. Elnozahy,et al.  Energy Conservation Policies for Web Servers , 2003, USENIX Symposium on Internet Technologies and Systems.

[3]  Peter Honeyman,et al.  Large files, small writes, and pNFS , 2006, ICS '06.

[4]  Mahmut T. Kandemir,et al.  Software-directed disk power management for scientific applications , 2005, 19th IEEE International Parallel and Distributed Processing Symposium.

[5]  D. S. Greenberg,et al.  Barriers to creating a secure MPI , 1997 .

[6]  Rami G. Melhem,et al.  Dynamic and aggressive scheduling techniques for power-aware real-time systems , 2001, Proceedings 22nd IEEE Real-Time Systems Symposium (RTSS 2001) (Cat. No.01PR1420).

[7]  Feng Pan,et al.  Exploring the energy-time tradeoff in MPI programs on a power-scalable cluster , 2005, 19th IEEE International Parallel and Distributed Processing Symposium.

[8]  Rong Ge,et al.  Performance-constrained Distributed DVS Scheduling for Scientific Applications on Power-aware Clusters , 2005, ACM/IEEE SC 2005 Conference (SC'05).

[9]  Edward Y. Chang,et al.  MEMS-based disk buffer for streaming media servers , 2003, Proceedings 19th International Conference on Data Engineering (Cat. No.03CH37405).

[10]  M. Karplus,et al.  A combined quantum mechanical and molecular mechanical potential for molecular dynamics simulations , 1990 .

[11]  R.W. Brodersen,et al.  A dynamic voltage scaled microprocessor system , 2000, IEEE Journal of Solid-State Circuits.

[12]  Mahmut T. Kandemir,et al.  MPISec I/O: Providing Data Confidentiality in MPI-I/O , 2009, 2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid.

[13]  Jack Dongarra,et al.  An Introduction to the MPI Standard , 1995 .

[14]  Subhash Saini,et al.  Performance evaluation of supercomputers using HPCC and IMB Benchmarks , 2008, J. Comput. Syst. Sci..

[15]  Gregory A. Koenig,et al.  Clusters and security: distributed security for distributed systems , 2005, CCGrid 2005. IEEE International Symposium on Cluster Computing and the Grid, 2005..

[16]  Christof Paar,et al.  An FPGA-based performance evaluation of the AES block cipher candidate algorithm finalists , 2001, IEEE Trans. Very Large Scale Integr. Syst..

[17]  Hakan Aydin,et al.  On energy-constrained real-time scheduling , 2004, Proceedings. 16th Euromicro Conference on Real-Time Systems, 2004. ECRTS 2004..

[18]  P. Krishnan,et al.  Thwarting the Power-Hungry Disk , 1994, USENIX Winter.

[19]  Ian F. Blake,et al.  Elliptic curves in cryptography , 1999 .

[20]  Rakesh M. Verma,et al.  Tight Bounds for Prefetching and Buffer Management Algorithms for Parallel I/O Systems , 1996, FSTTCS.

[21]  K. M. S. Soyjaudah,et al.  A Highly Available Transparent Linux Cluster Security Model , 2008, 2008 IEEE International Performance, Computing and Communications Conference.

[22]  Philip M. Long,et al.  Adaptive Disk Spindown via Optimal Rent-to-Buy in Probabilistic Environments , 1999, Algorithmica.

[23]  Gang Qu,et al.  Analysis of energy reduction on dynamic voltage scaling-enabled systems , 2005, IEEE Trans. Comput. Aided Des. Integr. Circuits Syst..

[24]  Xiao Qin,et al.  A prefetching scheme for energy conservation in parallel disk systems , 2008, 2008 IEEE International Symposium on Parallel and Distributed Processing.

[25]  Yiming Hu,et al.  DCD—disk caching disk: a new approach for boosting I/O performance , 1996, ISCA '96.

[26]  Kuldip Singh,et al.  An Improved Duplication Strategy for Scheduling Precedence Constrained Graphs in Multiprocessor Systems , 2003, IEEE Trans. Parallel Distributed Syst..

[27]  Nikolai Joukov,et al.  GreenFS: making enterprise computers greener by protecting them better , 2008, Eurosys '08.

[28]  Xiao Qin,et al.  Distributed Energy-Efficient Scheduling for Data-Intensive Applications with Deadline Constraints on Data Grids , 2008, 2008 IEEE International Performance, Computing and Communications Conference.

[29]  Peter J. Varman,et al.  DiskGroup: Energy Efficient Disk Layout for RAID1 Systems , 2007, 2007 International Conference on Networking, Architecture, and Storage (NAS 2007).

[30]  Xiao Qin,et al.  Performance comparisons of load balancing algorithms for I/O-intensive workloads on clusters , 2008, J. Netw. Comput. Appl..

[31]  Yiming Yang,et al.  A simulation framework for energy efficient data grids , 2007, 2007 Winter Simulation Conference.

[32]  Amin Vahdat,et al.  Managing energy and server resources in hosting centers , 2001, SOSP.

[33]  Ronald L. Rivest,et al.  The MD5 Message-Digest Algorithm , 1992, RFC.

[34]  Kang G. Shin,et al.  Real-time dynamic voltage scaling for low-power embedded operating systems , 2001, SOSP.

[35]  Jeffrey Scott Vitter,et al.  Competitive parallel disk prefetching and buffer management , 1997, IOPADS '97.

[36]  Xiao Qin,et al.  A dynamic and reliability-driven scheduling algorithm for parallel real-time jobs executing on heterogeneous clusters , 2005, J. Parallel Distributed Comput..

[37]  D. Schecter,et al.  Vortex crystals from 2D Euler flow: Experiment and simulation , 1999 .

[38]  Ricardo Bianchini,et al.  Conserving disk energy in network servers , 2003, ICS '03.

[39]  Darrell D. E. Long,et al.  Adaptive disk spin‐down for mobile computers , 2000, Mob. Networks Appl..

[40]  Manhee Lee,et al.  A Comprehensive Framework for Enhancing Security in InfiniBand Architecture , 2007, IEEE Transactions on Parallel and Distributed Systems.

[41]  Kenli Li,et al.  An Adaptive Energy-Conserving Strategy for Parallel Disk Systems , 2008, 2008 12th IEEE/ACM International Symposium on Distributed Simulation and Real-Time Applications.

[42]  Anthony Skjellum,et al.  A High-Performance, Portable Implementation of the MPI Message Passing Interface Standard , 1996, Parallel Comput..

[43]  Wei Luo,et al.  Energy-Aware Prefetching for Parallel Disk Systems: Algorithms, Models, and Evaluation , 2009, 2009 Eighth IEEE International Symposium on Network Computing and Applications.

[44]  H. Venkateswaran,et al.  Responsive security for stored data , 2003, 23rd International Conference on Distributed Computing Systems, 2003. Proceedings..

[45]  Ute Drechsler,et al.  Highly parallel data storage system based on scanning probe arrays , 2000 .

[46]  C. Parikh,et al.  Performance Evaluation of AES Algorithm on Various Development Platforms , 2007, 2007 IEEE International Symposium on Consumer Electronics.

[47]  Depei Qian,et al.  A Study on Data Placement of Extensible Parallel Storage System , 2007, 6th IEEE/ACIS International Conference on Computer and Information Science (ICIS 2007).

[48]  Sandra Dominikus,et al.  A Highly Regular and Scalable AES Hardware Architecture , 2003, IEEE Trans. Computers.

[49]  Sung Hoon Baek,et al.  Matrix-Stripe-Cache-Based Contiguity Transform for Fragmented Writes in RAID-5 , 2007, IEEE Transactions on Computers.

[50]  Srivaths Ravi,et al.  A study of the energy consumption characteristics of cryptographic algorithms and security protocols , 2006, IEEE Transactions on Mobile Computing.

[51]  R. N. Uma,et al.  Battery power-aware encryption , 2006, TSEC.

[52]  Hong Jiang,et al.  GRAID: A Green RAID Storage Architecture with Improved Energy Efficiency and Reliability , 2008, 2008 IEEE International Symposium on Modeling, Analysis and Simulation of Computers and Telecommunication Systems.

[53]  SangKeun Lee,et al.  Efficient, Energy Conserving Transaction Processing in Wireless Data Broadcast , 2006, IEEE Transactions on Knowledge and Data Engineering.

[54]  Ishfaq Ahmad,et al.  Dynamic Critical-Path Scheduling: An Effective Technique for Allocating Task Graphs to Multiprocessors , 1996, IEEE Trans. Parallel Distributed Syst..

[55]  Enrique V. Carrera,et al.  Load balancing and unbalancing for power and performance in cluster-based systems , 2001 .

[56]  Stephen M. Matyas,et al.  A Cryptographic Key Management Scheme for Implementing the Data Encryption Standard , 1978, IBM Syst. J..

[57]  Wolfgang Rehm,et al.  Implementing an MPICH-2 channel device over VAPI on InfiniBand , 2004, 18th International Parallel and Distributed Processing Symposium, 2004. Proceedings..

[58]  Michael Kistler,et al.  The case for power management in web servers , 2002 .

[59]  Miodrag Potkonjak,et al.  Synthesis techniques for low-power hard real-time systems on variable voltage processors , 1998, Proceedings 19th IEEE Real-Time Systems Symposium (Cat. No.98CB36279).

[60]  Remzi H. Arpaci-Dusseau,et al.  Storage-Aware Caching: Revisiting Caching for Heterogeneous Storage Systems , 2002, FAST.

[61]  Paul Horton,et al.  A Quantitative Analysis of Disk Drive Power Management in Portable Computers , 1994, USENIX Winter.

[62]  Yi Mu,et al.  Privacy-enhanced Internet storage , 2005, 19th International Conference on Advanced Information Networking and Applications (AINA'05) Volume 1 (AINA papers).

[63]  Xiao Qin,et al.  Energy efficient scheduling for parallel applications on mobile clusters , 2008, Cluster Computing.

[64]  E. N. Elnozahy,et al.  Energy-Efficient Server Clusters , 2002, PACS.

[65]  Juan-Carlos Cano,et al.  CERA: Cluster-Based Energy Saving Algorithm to Coordinate Routing in Short-Range Wireless Networks , 2003, ICOIN.

[66]  Xiao Qin,et al.  An Energy-Efficient Framework for Large-Scale Parallel Storage Systems , 2007, 2007 IEEE International Parallel and Distributed Processing Symposium.

[67]  Antony I. T. Rowstron,et al.  Write off-loading: Practical power management for enterprise storage , 2008, TOS.

[68]  Xiao Qin,et al.  DARAW: a new write buffer to improve parallel I/O energy-efficiency , 2009, SAC '09.

[69]  Garth A. Gibson,et al.  Parity logging disk arrays , 1994, TOCS.

[70]  Mahmut T. Kandemir,et al.  A holistic approach to designing energy-efficient cluster interconnects , 2005, IEEE Transactions on Computers.

[71]  D. Chen,et al.  Task scheduling and voltage selection for energy minimization , 2002, Proceedings 2002 Design Automation Conference (IEEE Cat. No.02CH37324).

[72]  Chao-Chin Wu,et al.  A dynamic critical path duplication task scheduling algorithm for distributed heterogeneous computing systems , 2006, 12th International Conference on Parallel and Distributed Systems - (ICPADS'06).

[73]  Ricardo Bianchini,et al.  Exploiting redundancy to conserve energy in storage systems , 2006, SIGMETRICS '06/Performance '06.

[74]  Frank Bellosa,et al.  Cooperative I / O-- A Novel I / O Semantics for Energy-Aware Applications , 2003 .

[75]  Stephen M. Matyas,et al.  Triple DES Cipher Block Chaining with Output Feedback Masking , 1996 .

[76]  Dorothy E. Denning Secure personal computing in an insecure network , 1979, CACM.

[77]  Azer Bestavros,et al.  GISMO: a Generator of Internet Streaming Media Objects and workloads , 2001, PERV.

[78]  Dong Li,et al.  eRAID: Conserving Energy in Conventional Disk-Based RAID System , 2008, IEEE Transactions on Computers.

[79]  Dhabaleswar K. Panda,et al.  Design and implementation of MPICH2 over InfiniBand with RDMA support , 2003, 18th International Parallel and Distributed Processing Symposium, 2004. Proceedings..

[80]  Peter J. Varman,et al.  Improving parallel-disk buffer management using randomized writeback , 1998, Proceedings. 1998 International Conference on Parallel Processing (Cat. No.98EX205).

[81]  Petru Eles,et al.  Energy-efficient mapping and scheduling for DVS enabled distributed embedded systems , 2002, Proceedings 2002 Design, Automation and Test in Europe Conference and Exhibition.

[82]  Dharma P. Agrawal,et al.  A Scalable Scheduling Scheme for Functional Parallelism on Distributed Memory Multiprocessor Systems , 1995, IEEE Trans. Parallel Distributed Syst..

[83]  Yuanyuan Zhou,et al.  Reducing Energy Consumption of Disk Storage Using Power-Aware Cache Management , 2004, 10th International Symposium on High Performance Computer Architecture (HPCA'04).

[84]  Gregory A. Koenig,et al.  Cluster security with NVisionCC: process monitoring by leveraging emergent properties , 2005, CCGrid 2005. IEEE International Symposium on Cluster Computing and the Grid, 2005..

[85]  Panu Hämäläinen,et al.  Configurable hardware implementation of triple-DES encryption algorithm for wireless local area network , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[86]  Ravi Wijayaratne,et al.  Integrated QOS management for disk I/O , 1999, Proceedings IEEE International Conference on Multimedia Computing and Systems.

[87]  David Robinson,et al.  Network File System (NFS) version 4 Protocol , 2003, RFC.

[88]  Mitsuhisa Sato,et al.  Profile-based optimization of power performance by using dynamic voltage scaling on a PC cluster , 2006, Proceedings 20th IEEE International Parallel & Distributed Processing Symposium.

[89]  Dharma P. Agrawal,et al.  A task duplication based scheduling algorithm for heterogeneous systems , 2000, Proceedings 14th International Parallel and Distributed Processing Symposium. IPDPS 2000.

[90]  Xiao Qin,et al.  Modeling and improving security of a local disk system for write-intensive workloads , 2006, TOS.

[91]  Dharmendra S. Modha,et al.  CacheCOW: QoS for storage system caches , 2003, IWQoS'03.

[92]  J. M. Haile,et al.  Molecular dynamics simulation : elementary methods / J.M. Haile , 1992 .

[93]  Majid Sarrafzadeh,et al.  Provably good algorithm for low power consumption with dual supply voltages , 1999, 1999 IEEE/ACM International Conference on Computer-Aided Design. Digest of Technical Papers (Cat. No.99CH37051).

[94]  Tei-Wei Kuo,et al.  Energy-efficient and performance-enhanced disks using flash-memory cache , 2007, Proceedings of the 2007 international symposium on Low power electronics and design (ISLPED '07).

[95]  Mahmut T. Kandemir,et al.  Energy-aware data prefetching for multi-speed disks , 2006, CF '06.

[96]  Dharmendra S. Modha,et al.  WOW: wise ordering for writes - combining spatial and temporal locality in non-volatile caches , 2005, FAST'05.

[97]  Marvin A. Sirbu,et al.  Distributed network storage service with quality-of-service guarantees , 2000, J. Netw. Comput. Appl..

[98]  Yennun Huang Developing reliable applications on cluster systems , 1996, Proceedings 15th Symposium on Reliable Distributed Systems.

[99]  Xiaoyun Wang,et al.  Finding Collisions in the Full SHA-1 , 2005, CRYPTO.

[100]  Stephen M. Matyas,et al.  Generation, Distribution, and Installation of Cryptographic Keys , 1978, IBM Syst. J..

[101]  Dirk Grunwald,et al.  Massive Arrays of Idle Disks For Storage Archives , 2002, ACM/IEEE SC 2002 Conference (SC'02).

[102]  Mahmut T. Kandemir,et al.  Disk layout optimization for reducing energy consumption , 2005, ICS '05.

[103]  Zhengrong Liang,et al.  Reconstruction and visualization of 3D models of colonic surface , 1997 .