Fairness-oriented and location-aware NUCA for many-core SoC
暂无分享,去创建一个
[1] Huawei Li,et al. Address Remapping for Static NUCA in NoC-Based Degradable Chip-Multiprocessors , 2010, 2010 IEEE 16th Pacific Rim International Symposium on Dependable Computing.
[2] Smruti R. Sarangi,et al. FP-NUCA: A Fast NOC Layer for Implementing Large NUCA Caches , 2015, IEEE Transactions on Parallel and Distributed Systems.
[3] Simon W. Moore,et al. A communication characterisation of Splash-2 and Parsec , 2009, 2009 IEEE International Symposium on Workload Characterization (IISWC).
[4] Kai Li,et al. The PARSEC benchmark suite: Characterization and architectural implications , 2008, 2008 International Conference on Parallel Architectures and Compilation Techniques (PACT).
[5] Ji Wu,et al. CCAS: Contention and congestion aware switch allocation for network-on-chips , 2016, 2016 IEEE 34th International Conference on Computer Design (ICCD).
[6] Jaehyuk Huh,et al. A NUCA Substrate for Flexible CMP Cache Sharing , 2007, IEEE Transactions on Parallel and Distributed Systems.
[7] William J. Dally,et al. Principles and Practices of Interconnection Networks , 2004 .
[8] Emilio Luque,et al. A new method to make communication latency uniform: distributed routing balancing , 1999, ICS '99.
[9] Rajeev Balasubramonian,et al. Interconnect design considerations for large NUCA caches , 2007, ISCA '07.
[10] Mahmut T. Kandemir,et al. A novel migration-based NUCA design for Chip Multiprocessors , 2008, 2008 SC - International Conference for High Performance Computing, Networking, Storage and Analysis.
[11] Per Stenström,et al. An Adaptive Shared/Private NUCA Cache Partitioning Scheme for Chip Multiprocessors , 2007, 2007 IEEE 13th International Symposium on High Performance Computer Architecture.
[12] José Duato,et al. Achieving balanced buffer utilization with a proper co-design of flow control and routing algorithm , 2014, 2014 Eighth IEEE/ACM International Symposium on Networks-on-Chip (NoCS).
[13] Mor Harchol-Balter,et al. Thread Cluster Memory Scheduling: Exploiting Differences in Memory Access Behavior , 2010, 2010 43rd Annual IEEE/ACM International Symposium on Microarchitecture.
[14] Yu Zhang,et al. Non-uniform fat-meshes for chip multiprocessors , 2009, 2009 IEEE International Symposium on Parallel & Distributed Processing.
[15] Chita R. Das,et al. Aérgia: exploiting packet latency slack in on-chip networks , 2010, ISCA.
[16] Doug Burger,et al. An adaptive, non-uniform cache structure for wire-delay dominated on-chip caches , 2002, ASPLOS X.
[17] David A. Wood,et al. Managing Wire Delay in Large Chip-Multiprocessor Caches , 2004, 37th International Symposium on Microarchitecture (MICRO-37'04).
[18] Nan Jiang,et al. A detailed and flexible cycle-accurate Network-on-Chip simulator , 2013, 2013 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS).
[19] Mahmut T. Kandemir,et al. Addressing End-to-End Memory Access Latency in NoC-Based Multicores , 2012, 2012 45th Annual IEEE/ACM International Symposium on Microarchitecture.