Order Free Consistency: Towards a Fully Asynchronous Memory Model

Computer architects are now studying a new generation of multi-core chip architectures that may integrate hundreds of processing cores and memory banks on a single chip employing a shared memory organization. A system may consist of many such chips (nodes) and an increasing demand to support high bandwidth and shared-address space between nodes. Furthermore, the inter-chip and intra-chip interconnections are also fast progressing with optical inter-chip and photonics intra-chip technology promising unprecedented bandwidth as well as multi-channel reordering capabilities. This paper focuses on the following fundamental question: can we have a memory model that is truly asynchronous that is: (1) memory operations can be issued freely from the processors without blocking by any memory based data dependence, and (2) the memory transmissions can travel through the interconnection network freely without worrying that they may arrive at the destination out of order. Furthermore, such a memory model must be realizable: that is, we can define an operational model 1 (hence construct an abstract machine) that can fully explore the above features during program execution. The terms “operational model” and “operational semantics” can be exchangeable in this paper.

[1]  Guang R. Gao,et al.  Optimizing the Fast Fourier Transform on a Multi-core Architecture , 2007, 2007 IEEE International Parallel and Distributed Processing Symposium.

[2]  Vivek Sarkar,et al.  Location Consistency-A New Memory Model and Cache Consistency Protocol , 2000, IEEE Trans. Computers.

[3]  Brian N. Bershad,et al.  The Midway distributed shared memory system , 1993, Digest of Papers. Compcon Spring.

[4]  Larry Rudolph,et al.  Commit-reconcile & fences (CRF): a new memory model for architects and compiler writers , 1999, ISCA.

[5]  Mark D. Hill,et al.  A Unified Formalization of Four Shared-Memory Models , 1993, IEEE Trans. Parallel Distributed Syst..

[6]  Vivek Sarkar,et al.  Location consistency: stepping beyond the barriers of memory coherence and serializability , 1993 .

[7]  Matteo Frigo,et al.  The weakest reasonable memory model , 1998 .

[8]  Leslie Lamport,et al.  Time, clocks, and the ordering of events in a distributed system , 1978, CACM.

[9]  Alan L. Cox,et al.  Lazy release consistency for software distributed shared memory , 1992, ISCA '92.

[10]  Friedemann Mattern,et al.  Virtual Time and Global States of Distributed Systems , 2002 .

[11]  Guang R. Gao,et al.  Synchronization state buffer: supporting efficient fine-grain synchronization on many-core architectures , 2007, ISCA '07.

[12]  Anoop Gupta,et al.  Memory consistency and event ordering in scalable shared-memory multiprocessors , 1990, [1990] Proceedings. The 17th Annual International Symposium on Computer Architecture.

[13]  Radha Jagadeesan,et al.  A theory of memory models , 2007, PPOPP.

[14]  Jeremy Manson,et al.  The Java memory model , 2005, POPL '05.

[15]  Maurice Herlihy,et al.  Transactional Memory: Architectural Support For Lock-free Data Structures , 1993, Proceedings of the 20th Annual International Symposium on Computer Architecture.

[16]  L. Schares,et al.  Terabus: Terabit/Second-Class Card-Level Optical Interconnect Technologies , 2006, IEEE Journal of Selected Topics in Quantum Electronics.

[17]  K. S. Weng AN ABSTRACT IMPLEMENTATION FOR A GENERALIZED DATA FLOW LANGUAGE , 1980 .

[18]  J.A. Kash IntraChip Optical Networks for a Future Supercomputer-on-a-Chip , 2007, 2007 Photonics in Switching.