DANBI: Dynamic scheduling of irregular stream programs for many-core systems
暂无分享,去创建一个
[1] Michael I. Gordon. Compiler techniques for scalable performance of stream programs on multicore architectures , 2010 .
[2] Leslie G. Valiant,et al. A bridging model for parallel computation , 1990, CACM.
[3] Michael Stumm,et al. FlexSC: Flexible System Call Scheduling with Exception-Less System Calls , 2010, OSDI.
[4] David E. Culler,et al. SEDA: an architecture for well-conditioned, scalable internet services , 2001, SOSP.
[5] Scott A. Mahlke,et al. Flextream: Adaptive Compilation of Streaming Applications for Heterogeneous Architectures , 2009, 2009 18th International Conference on Parallel Architectures and Compilation Techniques.
[6] Vivek Sarkar,et al. X10: an object-oriented approach to non-uniform cluster computing , 2005, OOPSLA '05.
[7] Dirk Grunwald,et al. Generating, optimizing, and scheduling a compiler level representation of stream parallelism , 2011 .
[8] Georg Hager,et al. Hybrid MPI/OpenMP Parallel Programming on Clusters of Multi-Core SMP Nodes , 2009, 2009 17th Euromicro International Conference on Parallel, Distributed and Network-based Processing.
[9] Matteo Frigo,et al. The implementation of the Cilk-5 multithreaded language , 1998, PLDI.
[10] Michael L. Scott,et al. Algorithms for scalable synchronization on shared-memory multiprocessors , 1991, TOCS.
[11] Robert Tappan Morris,et al. An Analysis of Linux Scalability to Many Cores , 2010, OSDI.
[12] Navendu Jain,et al. Adaptive Control of Extreme-scale Stream Processing Systems , 2006, 26th IEEE International Conference on Distributed Computing Systems (ICDCS'06).
[13] Kevin Skadron,et al. Rodinia: A benchmark suite for heterogeneous computing , 2009, 2009 IEEE International Symposium on Workload Characterization (IISWC).
[14] Kun-Lung Wu,et al. Elastic scaling of data parallel operators in stream processing , 2009, 2009 IEEE International Symposium on Parallel & Distributed Processing.
[15] Kun-Lung Wu,et al. Auto-parallelizing stateful distributed streaming applications , 2012, 2012 21st International Conference on Parallel Architectures and Compilation Techniques (PACT).
[16] Selim G. Akl,et al. Optimal Parallel Merging and Sorting Without Memory Conflicts , 1987, IEEE Transactions on Computers.
[17] Luca P. Carloni,et al. Flexible filters: load balancing through backpressure for stream programs , 2009, EMSOFT '09.
[18] Sriram Krishnamoorthy,et al. Solving Large, Irregular Graph Problems Using Adaptive Work-Stealing , 2008, 2008 37th International Conference on Parallel Processing.
[19] Keshav Pingali,et al. The tao of parallelism in algorithms , 2011, PLDI '11.
[20] William Thies,et al. An empirical characterization of stream programs and its implications for language and compiler design , 2010, 2010 19th International Conference on Parallel Architectures and Compilation Techniques (PACT).
[21] Christoforos E. Kozyrakis,et al. Dynamic Fine-Grain Scheduling of Pipeline Parallelism , 2011, 2011 International Conference on Parallel Architectures and Compilation Techniques.
[22] Thomas E. Anderson,et al. The Performance of Spin Lock Alternatives for Shared-Memory Multiprocessors , 1990, IEEE Trans. Parallel Distributed Syst..
[23] Scott A. Mahlke,et al. Sponge: portable stream programming on graphics engines , 2011, ASPLOS XVI.
[24] E.A. Lee,et al. Synchronous data flow , 1987, Proceedings of the IEEE.
[25] D. Bednárek,et al. The Bobox Project Parallelization Framework and Server for Data Processing , 2011 .
[26] Scott A. Mahlke,et al. Orchestrating the execution of stream programs on multicore platforms , 2008, PLDI '08.
[27] Ying Xing,et al. The Design of the Borealis Stream Processing Engine , 2005, CIDR.
[28] William J. Dally,et al. Buffer-space efficient and deadlock-free scheduling of stream applications on multi-core architectures , 2010, SPAA '10.
[29] Anjul Patney,et al. Task management for irregular-parallel workloads on the GPU , 2010, HPG '10.
[30] Pat Hanrahan,et al. GRAMPS: A programming model for graphics pipelines , 2009, ACM Trans. Graph..
[31] Shreekant S. Thakkar,et al. Synchronization algorithms for shared-memory multiprocessors , 1990, Computer.
[32] Leonardo Neumeyer,et al. S4: Distributed Stream Computing Platform , 2010, 2010 IEEE International Conference on Data Mining Workshops.