论文信息 - APPLICATION-AWARE OPTIMIZATIONS FOR BIG DATA ACCESS - 字舞流文

APPLICATION-AWARE OPTIMIZATIONS FOR BIG DATA ACCESS

Yanlong Yin | Yanlong Yin

[1] David R. O'Hallaron,et al. Computer Systems: A Programmer's Perspective , 1991 .

[2] Marios C. Papaefthymiou,et al. Performance Evaluation of Gang Scheduling for Parallel and Distributed Multiprogramming , 1997, JSSPP.

[3] Xian-He Sun,et al. A cost-intelligent application-specific data layout scheme for parallel file systems , 2011, HPDC '11.

[4] Andrea C. Arpaci-Dusseau,et al. The interaction of parallel and sequential workloads on a network of workstations , 1995, SIGMETRICS '95/PERFORMANCE '95.

[5] Jun He,et al. Pattern-aware file reorganization in MPI-IO , 2011, PDSW '11.

[6] Hui Lei,et al. An analytical approach to file prefetching , 1997 .

[7] John Bent,et al. PLFS: a checkpoint filesystem for parallel applications , 2009, Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis.

[8] Larry Rudolph,et al. Gang Scheduling Performance Benefits for Fine-Grain Synchronization , 1992, J. Parallel Distributed Comput..

[9] Friedhelm Meyer auf der Heide,et al. Dynamic and Redundant Data Placement , 2007, 27th International Conference on Distributed Computing Systems (ICDCS '07).

[10] Dawei Wang,et al. Concurrent Average Memory Access Time , 2014, Computer.

[11] Daniel A. Reed,et al. Learning to Classify Parallel Input/Output Access Patterns , 2002, IEEE Trans. Parallel Distributed Syst..

[12] Surendra Byna,et al. Parallel I/O prefetching using MPI file caching and I/O signatures , 2008, 2008 SC - International Conference for High Performance Computing, Networking, Storage and Analysis.

[13] Dean M. Tullsen,et al. Symbiotic jobscheduling for a simultaneous mutlithreading processor , 2000, SIGP.

[14] Andrea C. Arpaci-Dusseau,et al. Implicit coscheduling: coordinated scheduling with implicit information in distributed systems , 2001, TOCS.

[15] Rajeev Thakur,et al. Data sieving and collective I/O in ROMIO , 1998, Proceedings. Frontiers '99. Seventh Symposium on the Frontiers of Massively Parallel Computation.

[16] Yale N. Patt,et al. Scheduling algorithms for modern disk drives , 1994, SIGMETRICS 1994.

[17] Song Jiang,et al. Making resonance a common case: A high-performance implementation of collective I/O on parallel file systems , 2009, 2009 IEEE International Symposium on Parallel & Distributed Processing.

[18] Robert Ross,et al. Implementation and performance of a parallel file system for high performance distributed applications , 1996, Proceedings of 5th IEEE International Symposium on High Performance Distributed Computing.

[19] Karthik Vijayakumar,et al. Scalable I/O tracing and analysis , 2009, PDSW '09.

[20] John Wilkes,et al. An introduction to disk drive modeling , 1994, Computer.

[21] Raju Rangaswami,et al. I/O Deduplication: Utilizing content similarity to improve I/O performance , 2010, TOS.

[22] Dror G. Feitelson,et al. Paired Gang Scheduling , 2003, IEEE Trans. Parallel Distributed Syst..

[23] John May,et al. Parallel I/O for High Performance Computing , 2000 .

[24] Gregory R. Ganger,et al. Towards higher disk head utilization: extracting free bandwidth from busy disk drives , 2000, OSDI.

[25] Dror G. Feitelson,et al. Packing Schemes for Gang Scheduling , 1996, JSSPP.

[26] Alan Jay Smith,et al. The automatic improvement of locality in storage systems , 2005, TOCS.

[27] Samuel Lang,et al. A Segment-Level Adaptive Data Layout Scheme for Improved Load Balance in Parallel File Systems , 2011, 2011 11th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing.

[28] Daniel A. Reed,et al. Automatic ARIMA time series modeling for adaptive I/O prefetching , 2004, IEEE Transactions on Parallel and Distributed Systems.

[29] Srinivasan Seshan,et al. Measurement and Analysis of TCP Throughput Collapse in Cluster-based Storage Systems , 2008, FAST.

[30] Rajeev Thakur,et al. Improving Parallel I/O Performance with Data Layout Awareness , 2010, 2010 IEEE International Conference on Cluster Computing.

[31] Howard Gobioff,et al. The Google file system , 2003, SOSP '03.

[32] Kang G. Shin,et al. FS2: dynamic data replication in free disk space for improving disk performance and energy consumption , 2005, SOSP '05.

[33] Wei-keng Liao,et al. An Implementation and Evaluation of Client-Side File Caching for MPI-IO , 2007, 2007 IEEE International Parallel and Distributed Processing Symposium.

[34] Jun Wang,et al. MRAP: a novel MapReduce-based framework to support HPC analytics applications with access patterns , 2010, HPDC '10.

[35] Xian-He Sun,et al. Data layout optimization for petascale file systems , 2009, PDSW '09.

[36] Frank B. Schmuck,et al. GPFS: A Shared-Disk File System for Large Computing Clusters , 2002, FAST.

[37] Song Jiang,et al. IOrchestrator: Improving the Performance of Multi-node I/O Systems via Inter-Server Coordination , 2010, 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis.

[38] Laxmikant V. Kalé,et al. Periodic hierarchical load balancing for large supercomputers , 2011, Int. J. High Perform. Comput. Appl..

[39] Philip C. Roth,et al. Characterizing the I/O behavior of scientific applications on the Cray XT , 2007, PDSW '07.

[40] Robert B. Ross,et al. Noncontiguous I/O accesses through MPI-IO , 2003, CCGrid 2003. 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid, 2003. Proceedings..

[41] William Gropp,et al. Toward Scalable Performance Visualization with Jumpshot , 1999, Int. J. High Perform. Comput. Appl..

[42] Arif Merchant,et al. Proportional-Share Scheduling for Distributed Storage Systems , 2007, FAST.

[43] Mary K. Vernon,et al. The performance of multiprogrammed multiprocessor scheduling algorithms , 1990, SIGMETRICS '90.

[44] Tao Yang,et al. The Panasas ActiveScale Storage Cluster - Delivering Scalable High Bandwidth Storage , 2004, Proceedings of the ACM/IEEE SC2004 Conference.

[45] David Kotz,et al. Disk-directed I/O for MIMD multiprocessors , 1994, OSDI '94.

[46] Gang Peng,et al. Multi-dimensional storage virtualization , 2004, SIGMETRICS '04/Performance '04.

[47] Allen D. Malony,et al. The Tau Parallel Performance System , 2006, Int. J. High Perform. Comput. Appl..

[48] Jianwei Li,et al. Parallel netCDF: A High-Performance Scientific I/O Interface , 2003, ACM/IEEE SC 2003 Conference (SC'03).

[49] Surendra Byna,et al. Hiding I/O latency with pre-execution prefetching for parallel applications , 2008, 2008 SC - International Conference for High Performance Computing, Networking, Storage and Analysis.

[50] Song Jiang,et al. InterferenceRemoval: removing interference of disk access for MPI programs through data replication , 2010, ICS '10.

[51] Bin Zhou,et al. Scalable Performance of the Panasas Parallel File System , 2008, FAST.

[52] Patrick Sobalvarro,et al. Demand-Based Coscheduling of Parallel Jobs on Multiprogrammed Multiprocessors , 1995, JSSPP.

[53] Kai Shen,et al. Multi-Layer Event Trace Analysis for Parallel I/O Performance Tuning , 2007, 2007 International Conference on Parallel Processing (ICPP 2007).

[54] Carlos Maltzahn,et al. Ceph: a scalable, high-performance distributed file system , 2006, OSDI '06.

[55] Margo I. Seltzer,et al. Disk Scheduling Revisited , 1990 .

[56] Karsten Schwan,et al. ...and eat it too: high read performance in write-optimized HPC I/O middleware file formats , 2009, PDSW '09.

[57] Vagelis Hristidis,et al. BORG: Block-reORGanization for Self-optimizing Storage Systems , 2009, FAST.

[58] Irfan Ahmad,et al. PARDA: Proportional Allocation of Resources for Distributed Storage Access , 2009, FAST.

[59] Zhiwei Xu,et al. Grid replication coherence protocol , 2004, 18th International Parallel and Distributed Processing Symposium, 2004. Proceedings..

[60] Scott A. Brandt,et al. Horizon: efficient deadline-driven disk I/O management for distributed storage systems , 2010, HPDC '10.

[61] Marios C. Papaefthymiou,et al. A Gang Scheduling Design for Multiprogrammed Parallel Computing Environments , 1996, JSSPP.

[62] Anthony Skjellum,et al. Using MPI - portable parallel programming with the message-parsing interface , 1994 .

[63] Cyril U. Orji,et al. Write-only disk caches , 1990, SIGMOD '90.

[64] Victor Lee,et al. Implications of I/O for Gang Scheduled Workloads , 1997, JSSPP.

[65] Bernd Mohr,et al. KOJAK - A Tool Set for Automatic Performance Analysis of Parallel Programs , 2003, Euro-Par.

[66] Jian Xu,et al. Performance virtualization for large-scale storage systems , 2003, 22nd International Symposium on Reliable Distributed Systems, 2003. Proceedings..

[67] Dror G. Feitelson,et al. Improved Utilization and Responsiveness with Gang Scheduling , 1997, JSSPP.

[68] Shubhashis Sengupta,et al. Scalable and Distributed Mechanisms for Integrated Scheduling and Replication in Data Grids , 2008, ICDCN.

[69] Jun Wang,et al. Bridging the Gap Between Parallel File Systems and Local File Systems: A Case Study with PVFS , 2008, 2008 37th International Conference on Parallel Processing.

[70] Richard A. Golding,et al. Zygaria: Storage Performance as a Managed Resource , 2006, 12th IEEE Real-Time and Embedded Technology and Applications Symposium (RTAS'06).