Operating system support for warehouse-scale computing

[1]  Donald Beaver,et al.  Dapper, a Large-Scale Distributed Systems Tracing Infrastructure , 2010 .

[2]  Willy Zwaenepoel,et al.  Implementation and performance of Munin , 1991, SOSP '91.

[3]  Henry M. Levy,et al.  Capability-Based Computer Systems , 1984 .

[4]  Henri E. Bal,et al.  Orca: A Language For Parallel Programming of Distributed Systems , 1992, IEEE Trans. Software Eng..

[5]  Zheng Shao,et al.  Data warehousing and analytics infrastructure at facebook , 2010, SIGMOD Conference.

[6]  Christoforos E. Kozyrakis,et al.  Flash storage disaggregation , 2016, EuroSys.

[7]  Douglas Ross A personal view of the personal work station: some firsts in the Fifties , 1986, HPW '86.

[8]  Galen C. Hunt,et al.  Helios: heterogeneous multiprocessing with satellite kernels , 2009, SOSP '09.

[9]  Peter G. Neumann,et al.  The CHERI capability model: Revisiting RISC in an age of risk , 2014, 2014 ACM/IEEE 41st International Symposium on Computer Architecture (ISCA).

[10]  Timothy Roscoe,et al.  An Evaluation of Capabilities for a Multikernel , 2012 .

[11]  Michael Norrish,et al.  seL4: formal verification of an OS kernel , 2009, SOSP '09.

[12]  James R. Bulpin Operating system support for simultaneous multithreaded processors , 2005 .

[13]  Margo Seltzer,et al.  VINO: The 1994 Fall Harvest , 1994 .

[14]  Andrew R. Cherenson,et al.  The Sprite network operating system , 1988, Computer.

[15]  John Allen,et al.  Scuba: Diving into Data at Facebook , 2013, Proc. VLDB Endow..

[16]  Srikanth Kandula,et al.  Jockey: guaranteed job latency in data parallel clusters , 2012, EuroSys '12.

[17]  Michael J. Fischer,et al.  The architecture of the Eden system , 1981, SOSP.

[18]  Partha Dasgupta,et al.  Linking consistency with object/thread semantics: an approach to robust computation , 1989, [1989] Proceedings. The 9th International Conference on Distributed Computing Systems.

[19]  Thomas Anderson,et al.  The case for application-specific operating systems , 1992, [1992] Proceedings Third Workshop on Workstation Operating Systems.

[20]  Silas Boyd-Wickizer,et al.  Securing Distributed Systems with Information Flow Control , 2008, NSDI.

[21]  Miguel Castro,et al.  No compromises: distributed transactions with consistency, availability, and performance , 2015, SOSP.

[22]  Virendra J. Marathe,et al.  Callisto: co-scheduling parallel runtime systems , 2014, EuroSys '14.

[23]  Antti Ylä-Jääski,et al.  Exploiting Hardware Heterogeneity within the Same Instance Type of Amazon EC2 , 2012, HotCloud.

[24]  Rob Pike,et al.  Security in Plan 9 , 2002, USENIX Security Symposium.

[25]  Jun Li,et al.  Wormhole: Reliable Pub-Sub to Support Geo-replicated Internet Services , 2015, NSDI.

[26]  J. LaFountain Inc. , 2013, American Art.

[27]  Nancy P. Kronenberg,et al.  VAXcluster: a closely-coupled distributed system , 1986, TOCS.

[28]  Liang Lin,et al.  Tenzing a SQL implementation on the MapReduce framework , 2011, Proc. VLDB Endow..

[29]  Raghotham Murthy,et al.  Peregrine: Low-latency queries on Hive warehouse data , 2012, XRDS.

[30]  Muli Ben-Yehuda,et al.  The nonkernel: a kernel designed for the cloud , 2013, APSys.

[31]  Pete Wyckoff,et al.  Hive - A Warehousing Solution Over a Map-Reduce Framework , 2009, Proc. VLDB Endow..

[32]  Steven Hand,et al.  A down-to-earth look at the cloud host OS , 2012, HotCDP '12.

[33]  Barbara Liskov,et al.  Implementation of Argus , 1987, SOSP '87.

[34]  Christina Delimitrou,et al.  Tarcil: reconciling scheduling speed and quality in large shared clusters , 2015, SoCC.

[35]  Christoforos E. Kozyrakis,et al.  Heracles: Improving resource efficiency at scale , 2015, 2015 ACM/IEEE 42nd Annual International Symposium on Computer Architecture (ISCA).

[36]  Jack B. Dennis,et al.  Programming semantics for multiprogrammed computations , 1966, CACM.

[37]  F. Shapiro,et al.  The Yale book of quotations , 2006 .

[38]  Alon Itai,et al.  On the complexity of time table and multi-commodity flow problems , 1975, 16th Annual Symposium on Foundations of Computer Science (sfcs 1975).

[39]  Hermann Härtig,et al.  Measuring energy consumption for short code paths using RAPL , 2012, PERV.

[40]  Andrew V. Goldberg,et al.  Finding Minimum-Cost Circulations by Successive Approximation , 1990, Math. Oper. Res..

[41]  Jochen Liedtke,et al.  On micro-kernel construction , 1995, SOSP.

[42]  Brian N. Bershad,et al.  Extensibility safety and performance in the SPIN operating system , 1995, SOSP.

[43]  Sanjeev Kumar,et al.  Finding a Needle in Haystack: Facebook's Photo Storage , 2010, OSDI.

[44]  Abraham Silberschatz,et al.  Operating Systems Concepts , 2005 .

[45]  Rajesh Gupta,et al.  SleepServer: A Software-Only Approach for Reducing the Energy Consumption of PCs within Enterprise Environments , 2010, USENIX Annual Technical Conference.

[46]  Aart J. C. Bik,et al.  Pregel: a system for large-scale graph processing , 2010, SIGMOD Conference.

[47]  Nir Shavit,et al.  Software transactional memory , 1995, PODC '95.

[48]  Thomas F. Wenisch,et al.  The Mystery Machine: End-to-end Performance Analysis of Large-scale Internet Services , 2014, OSDI.

[49]  Andrew Warfield,et al.  Live migration of virtual machines , 2005, NSDI.

[50]  J. Ousterhout,et al.  Medusa: an experiment in distributed operating system structure , 1980, CACM.

[51]  Leonardo Neumeyer,et al.  S4: Distributed Stream Computing Platform , 2010, 2010 IEEE International Conference on Data Mining Workshops.

[52]  Christina Delimitrou,et al.  Quasar: resource-efficient and QoS-aware cluster management , 2014, ASPLOS.

[53]  Yawei Li,et al.  Megastore: Providing Scalable, Highly Available Storage for Interactive Services , 2011, CIDR.

[54]  Steven M. Hand,et al.  Self-paging in the Nemesis operating system , 1999, OSDI '99.

[55]  Jochen Liedtke,et al.  The mungi single‐address‐space operating system , 1998, Softw. Pract. Exp..

[56]  Keith A. Lantz,et al.  Rochester's intelligent gateway , 1982, Computer.

[57]  Harvey Bratman,et al.  The Share 709 System: Supervisory Control , 1959, JACM.

[58]  A. Löbel Solving Large-Scale Real-World Minimum-Cost Flow Problems by a Network Simplex Method , 1996 .

[59]  Anthony D. Joseph,et al.  Virtics : A System for Privilege Separation of Legacy Desktop Applications , 2010 .

[60]  Andrew Vladislav Goldberg,et al.  Efficient graph algorithms for sequential and parallel computers , 1987 .

[61]  Claude Kaiser,et al.  Overview of the CHORUS ® Distributed Operating Systems , 1991 .

[62]  Andrew S. Tanenbaum,et al.  A Comparison of Two Distributed Systems: Amoeba and Sprite , 1991, Comput. Syst..

[63]  Abhishek Verma,et al.  Large-scale cluster management at Google with Borg , 2015, EuroSys.

[64]  Donald E. Porter,et al.  Cooperation and security isolation of library OSes for multi-process applications , 2014, EuroSys '14.

[65]  Craig Chambers,et al.  FlumeJava: easy, efficient data-parallel pipelines , 2010, PLDI '10.

[66]  Maurice V. Wilkes Operating systems in a changing world , 1994, OPSR.

[67]  John L. Henning SPEC CPU2006 benchmark descriptions , 2006, CARN.

[68]  Andrew V. Goldberg,et al.  On Implementing Scaling Push-Relabel Algorithms for the Minimum-Cost Flow Problem , 1991, Network Flows And Matching.

[69]  Elaine Shi,et al.  GhostRider: A Hardware-Software System for Memory Trace Oblivious Computation , 2015, ASPLOS.

[70]  Andrew S. Tanenbaum,et al.  Group communication in the Amoeba distributed operating system , 1991, [1991] Proceedings. 11th International Conference on Distributed Computing Systems.

[71]  Chita R. Das,et al.  Towards characterizing cloud backend workloads: insights from Google compute clusters , 2010, PERV.

[72]  Luiz André Barroso,et al.  The Datacenter as a Computer: An Introduction to the Design of Warehouse-Scale Machines, Second Edition , 2013, The Datacenter as a Computer: An Introduction to the Design of Warehouse-Scale Machines, Second Edition.

[73]  Robin Fairbairns,et al.  The Design and Implementation of an Operating System to Support Distributed Multimedia Applications , 1996, IEEE J. Sel. Areas Commun..

[74]  Amar Phanishayee,et al.  FAWN: a fast array of wimpy nodes , 2009, SOSP '09.

[75]  Jack B. Dennis,et al.  Virtual memory, processes, and sharing in Multics , 1967, SOSP 1967.

[76]  Scott Shenker,et al.  Making Sense of Performance in Data Analytics Frameworks , 2015, NSDI.

[77]  P. Menage Adding Generic Process Containers to the Linux Kernel , 2010 .

[78]  Jennifer Rexford,et al.  Eliminating the hypervisor attack surface for a more secure cloud , 2011, CCS '11.

[79]  James B. Orlin A Faster Strongly Polynomial Minimum Cost Flow Algorithm , 1993, Oper. Res..

[80]  Andrew Birrell,et al.  Implementing Remote procedure calls , 1983, SOSP '83.

[81]  Mary Lou Soffa,et al.  Contention aware execution: online contention detection and response , 2010, CGO '10.

[82]  Luis Ceze,et al.  DDOS: taming nondeterminism in distributed systems , 2013, ASPLOS '13.

[83]  Carlo Curino,et al.  Apache Hadoop YARN: yet another resource negotiator , 2013, SoCC.

[84]  Georges Da Costa,et al.  2005 IEEE International Symposium on Cluster Computing and the Grid , 2005, CCGRID.

[85]  Cory Hill,et al.  f4: Facebook's Warm BLOB Storage System , 2014, OSDI.

[86]  Bo Dong,et al.  Hadoop high availability through metadata replication , 2009, CloudDB@CIKM.

[87]  Ken Thompson,et al.  The use of name spaces in plan 9 , 1992, EW 5.

[88]  Robert C. Daley,et al.  An experimental time-sharing system , 1962, AIEE-IRE '62 (Spring).

[89]  Patrick Wendell,et al.  Sparrow: distributed, low latency scheduling , 2013, SOSP.

[90]  Herbert Bos,et al.  Pointless tainting?: evaluating the practicality of pointer tainting , 2009, EuroSys '09.

[91]  Jean Bacon Concurrent Systems: Operating Systems, Database and Distributed Systems: An Integrated Approach , 1997 .

[92]  Chita R. Das,et al.  Modeling and synthesizing task placement constraints in Google compute clusters , 2011, SoCC.

[93]  Dimitri P. Bertsekas,et al.  RELAX-IV : a faster version of the RELAX code for solving minimum cost flow problems , 1994 .

[94]  Robert J. Creasy,et al.  The Origin of the VM/370 Time-Sharing System , 1981, IBM J. Res. Dev..

[95]  Ali Ghodsi,et al.  Bolt-on causal consistency , 2013, SIGMOD '13.

[96]  Christoforos E. Kozyrakis,et al.  IX: A Protected Dataplane Operating System for High Throughput and Low Latency , 2014, OSDI.

[97]  Gernot Heiser,et al.  vNUMA: A Virtual Shared-Memory Multiprocessor , 2009, USENIX Annual Technical Conference.

[98]  Scott Shenker,et al.  Disk-Locality in Datacenter Computing Considered Irrelevant , 2011, HotOS.

[99]  Leslie Lamport,et al.  Time, clocks, and the ordering of events in a distributed system , 1978, CACM.

[100]  Guillermo J. Rozas,et al.  Translucent Procedures, Abstraction without Opacity , 1993 .

[101]  William A. Wulf,et al.  HYDRA , 1974, Commun. ACM.

[102]  Jon Crowcroft,et al.  Unikernels: library operating systems for the cloud , 2013, ASPLOS '13.

[103]  Jon Howell,et al.  Embassies: Radically Refactoring the Web , 2013, NSDI.

[104]  Chao Yang,et al.  Unicorn: A System for Searching the Social Graph , 2013, Proc. VLDB Endow..

[105]  Derek Gordon Murray,et al.  A distributed execution engine supporting data-dependent control flow , 2012 .

[106]  F. J. Corbat INTRODUCTION AND OVERVIEW OF THE MULTICS SYSTEM , 2010 .

[107]  Éva Tardos,et al.  A strongly polynomial minimum cost circulation algorithm , 1985, Comb..

[108]  Fred P. Brooks,et al.  The Mythical Man-Month , 1975, Reliable Software.

[109]  Andrew S. Tanenbaum,et al.  Operating systems - design and implementation, 3rd Edition , 2005 .

[110]  Roger M. Needham,et al.  The structure of a multiprogramming supervisor , 1968, Comput. J..

[111]  Tony Tung,et al.  Scaling Memcache at Facebook , 2013, NSDI.

[112]  Donald E. Porter,et al.  Rethinking the library OS from the top down , 2011, ASPLOS XVI.

[113]  Luigi Rizzo,et al.  netmap: A Novel Framework for Fast Packet I/O , 2012, USENIX ATC.

[114]  George F. Ryckman The IBM 701 computer at the general motors research laboratories , 1983 .

[115]  Christina Delimitrou,et al.  Paragon: QoS-aware scheduling for heterogeneous datacenters , 2013, ASPLOS '13.

[116]  Lingjia Tang,et al.  The impact of memory subsystem resource sharing on datacenter applications , 2011, 2011 38th Annual International Symposium on Computer Architecture (ISCA).

[117]  William J. Bolosky,et al.  Mach: A New Kernel Foundation for UNIX Development , 1986, USENIX Summer.

[118]  Larry L. Peterson,et al.  Fbufs: a high-bandwidth cross-domain transfer facility , 1994, SOSP '93.

[119]  Christoforos E. Kozyrakis,et al.  Reconciling high server utilization and sub-millisecond quality-of-service , 2014, EuroSys '14.

[120]  Gavin Lowe,et al.  An Attack on the Needham-Schroeder Public-Key Authentication Protocol , 1995, Inf. Process. Lett..

[121]  David R. Cheriton,et al.  UIO: a uniform I/O system interface for distributed systems , 1987, TOCS.

[122]  Chris Hawblitzel,et al.  Cosh: Clear OS Data Sharing In An Incoherent World , 2014, TRIOS.

[123]  James B. Orlin,et al.  Max flows in O(nm) time, or better , 2013, STOC '13.

[124]  J. Duell The design and implementation of Berkeley Lab's linux checkpoint/restart , 2005 .

[125]  Robert Griesemer,et al.  Paxos made live: an engineering perspective , 2007, PODC '07.

[126]  Hairong Kuang,et al.  The Hadoop Distributed File System , 2010, 2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST).

[127]  Daniel Mills,et al.  MillWheel: Fault-Tolerant Stream Processing at Internet Scale , 2013, Proc. VLDB Endow..

[128]  M. G. Land,et al.  A practical approach to operating systems , 1988 .

[129]  Adrian Schüpbach,et al.  Your computer is already a distributed system. Why isn't your OS? , 2009, HotOS.

[130]  Maurice V. Wilkes,et al.  Memoirs of a Computer Pioneer , 1985 .

[131]  Robert N. M. Watson,et al.  CamIO : declaring flexible and performant application , 2013 .

[132]  Werner Vogels,et al.  Dynamo: amazon's highly available key-value store , 2007, SOSP.

[133]  Dutch T. Meyer,et al.  Strata: High-Performance Scalable Storage on Virtualized Non-volatile Memory , 2014, FAST 2014.

[134]  Zoltán Király,et al.  Efficient implementations of minimum-cost flow algorithms , 2012, ArXiv.

[135]  Mark Moir,et al.  SNZI: scalable NonZero indicators , 2007, PODC '07.

[136]  D. Bertsekas,et al.  The relax codes for linear minimum cost network flow problems , 1988 .

[137]  Jerome H. Saltzer,et al.  The protection of information in computer systems , 1975, Proc. IEEE.

[138]  Miguel Castro,et al.  MIKE: A Distributed object-oriented programming platform on top of the Mach micro-kernel , 1993, USENIX MACH Symposium.

[139]  Scott Shenker,et al.  Delay scheduling: a simple technique for achieving locality and fairness in cluster scheduling , 2010, EuroSys '10.

[140]  Jeffrey I. Schiller,et al.  An Authentication Service for Open Network Systems. In , 1998 .

[141]  V G Andrew,et al.  AN EFFICIENT IMPLEMENTATION OF A SCALING MINIMUM-COST FLOW ALGORITHM , 1997 .

[142]  Francisco Vilar Brasileiro,et al.  Long-term SLOs for reclaimed cloud computing resources , 2014, SoCC.

[143]  Komal Shringare,et al.  Apache Hadoop Goes Realtime at Facebook , 2015 .

[144]  Dirk Merkel,et al.  Docker: lightweight Linux containers for consistent development and deployment , 2014 .

[145]  Arvind Krishnamurthy,et al.  Customizable and Extensible Deployment for Mobile/Cloud Applications , 2014, OSDI.

[146]  Robert F. Rosin,et al.  Supervisory and Monitor Systems , 1969, CSUR.

[147]  John Rosenberg,et al.  Grasshopper: An Orthogonally Persistent Operating System , 1994, Comput. Syst..

[148]  Kurt Fuchel,et al.  Consideration in the design of a multiple computer system with extended core storage , 1967, SOSP 1967.

[149]  Daniel Price,et al.  Solaris Zones: Operating System Support for Consolidating Commercial Workloads , 2004, LISA.

[150]  Prashant Malik,et al.  Cassandra: a decentralized structured storage system , 2010, OPSR.

[151]  Xiao Zhang,et al.  CPI2: CPU performance isolation for shared compute clusters , 2013, EuroSys '13.

[152]  Lars Bergstrom,et al.  Measuring NUMA effects with the STREAM benchmark , 2011, ArXiv.

[153]  David E. Culler,et al.  Hierarchical scheduling for diverse datacenter workloads , 2013, SoCC.

[154]  Benjamin C. Lee,et al.  Market mechanisms for managing datacenters with heterogeneous microarchitectures , 2014, TOCS.

[155]  Robert N. M. Watson,et al.  Queues Don't Matter When You Can JUMP Them! , 2015, NSDI.

[156]  Kang G. Shin,et al.  LiteGreen: Saving Energy in Networked Desktops Using Virtualization , 2010, USENIX Annual Technical Conference.

[157]  Krste Asanovic,et al.  FireBox: A Hardware Building Block for 2020 Warehouse-Scale Computers , 2014 .

[158]  Jordi Torres,et al.  GreenHadoop: leveraging green energy in data-processing frameworks , 2012, EuroSys '12.

[159]  Bruce Walker,et al.  The LOCUS distributed operating system , 1983, SOSP '83.

[160]  Remzi H. Arpaci-Dusseau Operating Systems: Three Easy Pieces , 2015, login Usenix Mag..

[161]  Alan H. Karp,et al.  Using Split Capabilities for Access Control , 2003, IEEE Softw..

[162]  Wilson C. Hsieh,et al.  Bigtable: A Distributed Storage System for Structured Data , 2006, TOCS.

[163]  Xin Chen,et al.  F1: the fault-tolerant distributed RDBMS supporting google's ad business , 2012, SIGMOD Conference.

[164]  Carlo Curino,et al.  Mercury: Hybrid Centralized and Distributed Scheduling in Large Shared Clusters , 2015, USENIX Annual Technical Conference.

[165]  Richard F. Rashid From RIG to Accent to Mach: An Evolution of a Network Operating System , 1986, FJCC.

[166]  Arnar Birgisson,et al.  Macaroons: Cookies with Contextual Caveats for Decentralized Authorization in the Cloud , 2014, NDSS.

[167]  M. Abadi,et al.  Naiad: a timely dataflow system , 2013, SOSP.

[168]  Andrea C. Arpaci-Dusseau,et al.  Analysis of HDFS under HBase: a facebook messages case study , 2014, FAST.

[169]  John K. Ousterhout,et al.  In Search of an Understandable Consensus Algorithm , 2014, USENIX ATC.

[170]  Don Marti,et al.  OSv - Optimizing the Operating System for Virtual Machines , 2014, USENIX Annual Technical Conference.

[171]  Jon Howell,et al.  Flat Datacenter Storage , 2012, OSDI.

[172]  Yanpei Chen,et al.  Energy efficiency for large-scale MapReduce workloads with significant interactive analysis , 2012, EuroSys '12.

[173]  D. J. Howarth,et al.  The Atlas Supervisor , 1899 .

[174]  John Kubiatowicz,et al.  Tessellation: Refactoring the OS around explicit resource containers with continuous adaptation , 2013, 2013 50th ACM/EDAC/IEEE Design Automation Conference (DAC).

[175]  Tajana Rosing,et al.  Utilizing green energy prediction to schedule mixed batch and service jobs in data centers , 2011, OPSR.

[176]  Srinivas Devadas,et al.  Intel SGX Explained , 2016, IACR Cryptol. ePrint Arch..

[177]  Rob Pike,et al.  Systems Software Research is Irrelevant , 2000 .

[178]  Yang Zhang,et al.  Corey: An Operating System for Many Cores , 2008, OSDI.

[179]  Richard F. Rashid,et al.  Mach and Matchmaker: kernel and language support for object-oriented distributed systems , 1986, OOPSLA 1986.

[180]  Jennifer Rexford,et al.  NoHype: virtualized cloud infrastructure without the virtualization , 2010, ISCA.

[181]  Robert N. M. Watson,et al.  Jails: confining the omnipotent root , 2000 .

[182]  Paul Tseng,et al.  Relaxation Methods for Minimum Cost Ordinary and Generalized Network Flow Problems , 1988, Oper. Res..

[183]  P. G. Neumann,et al.  A general-purpose file system for secondary storage , 1965, Published in AFIPS '65 (Fall, part I).

[184]  Michael Isard,et al.  DryadLINQ: A System for General-Purpose Distributed Data-Parallel Computing Using a High-Level Language , 2008, OSDI.

[185]  Robert Tappan Morris,et al.  An Analysis of Linux Scalability to Many Cores , 2010, OSDI.

[186]  Mark J. Clement,et al.  Core Algorithms of the Maui Scheduler , 2001, JSSPP.

[187]  Hui Ding,et al.  TAO: Facebook's Distributed Data Store for the Social Graph , 2013, USENIX Annual Technical Conference.

[188]  Hakim Weatherspoon,et al.  The Xen-Blanket: virtualize once, run everywhere , 2012, EuroSys '12.

[189]  Albert G. Greenberg,et al.  Scarlett: coping with skewed content popularity in mapreduce clusters , 2011, EuroSys '11.

[190]  Austin T. Clements,et al.  The scalable commutativity rule: designing scalable software for multicore processors , 2013, SOSP.

[191]  M. Malik,et al.  Operating Systems , 1992, Lecture Notes in Computer Science.

[192]  Robert E. Tarjan,et al.  A faster deterministic maximum flow algorithm , 1992, SODA '92.

[193]  Benjamin Hindman,et al.  Dominant Resource Fairness: Fair Allocation of Multiple Resource Types , 2011, NSDI.

[194]  L. J. Zillman Housman, 1897-1936 , 1944 .

[195]  Christoforos E. Kozyrakis,et al.  Usenix Association 10th Usenix Symposium on Operating Systems Design and Implementation (osdi '12) 335 Dune: Safe User-level Access to Privileged Cpu Features , 2022 .

[196]  Benjamin Reed,et al.  A simple totally ordered broadcast protocol , 2008, LADIS '08.

[197]  Dremel: interactive analysis of web-scale datasets , 2011, Commun. ACM.

[198]  Wei Lin,et al.  Apollo: Scalable and Coordinated Scheduling for Cloud-Scale Computing , 2014, OSDI.

[199]  J. Saltzer,et al.  A hardware architecture for implementing protection rings , 1972, OPSR.

[200]  Anne-Marie Kermarrec,et al.  Hawk: Hybrid Datacenter Scheduling , 2015, USENIX Annual Technical Conference.

[201]  James Reinders,et al.  Intel Xeon Phi Coprocessor High Performance Programming , 2013 .

[202]  Jean-Philippe Martin,et al.  Dandelion: a compiler and runtime for heterogeneous systems , 2013, SOSP.

[203]  Randy H. Katz,et al.  Mesos: A Platform for Fine-Grained Resource Sharing in the Data Center , 2011, NSDI.

[204]  Hyeontaek Lim,et al.  MICA: A Holistic Approach to Fast In-Memory Key-Value Storage , 2014, NSDI.

[205]  Andrew V. Goldberg,et al.  The Partial Augment-Relabel Algorithm for the Maximum Flow Problem , 2008, ESA.

[206]  Binoy Ravindran,et al.  Popcorn: bridging the programmability gap in heterogeneous-ISA platforms , 2015, EuroSys.

[207]  Christoph Lameter,et al.  NUMA (Non-Uniform Memory Access): An Overview , 2013, ACM Queue.

[208]  Anant Agarwal,et al.  An operating system for multicore and clouds: mechanisms and implementation , 2010, SoCC '10.

[209]  Dawson R. Engler,et al.  Exokernel: an operating system architecture for application-level resource management , 1995, SOSP.

[210]  David K. Smith Network Flows: Theory, Algorithms, and Applications , 1994 .

[211]  Adrian Schüpbach,et al.  The multikernel: a new OS architecture for scalable multicore systems , 2009, SOSP '09.

[212]  Randy H. Katz,et al.  Heterogeneity and dynamicity of clouds at scale: Google trace analysis , 2012, SoCC '12.

[213]  Ken Thompson,et al.  The UNIX time-sharing system , 1974, CACM.

[214]  Universal Turing,et al.  Proposal for development in the mathematics division of an Automatic Computing Engine ( ACE ) Universal Turing Machine , 2011 .

[215]  Richard E. Kessler,et al.  The Alpha 21264 microprocessor , 1999, IEEE Micro.

[216]  Steven Hand,et al.  Musketeer: all for one, one for all in data processing systems , 2015, EuroSys.

[217]  Alexander Hall,et al.  Processing a Trillion Cells per Mouse Click , 2012, Proc. VLDB Endow..

[218]  George G. Robertson,et al.  Accent: A communication oriented network operating system kernel , 1981, SOSP.

[219]  Eric A. Brewer,et al.  Towards robust distributed systems (abstract) , 2000, PODC '00.

[220]  Partha Dasgupta,et al.  The Clouds distributed operating system: functional description, implementation details and related work , 1988, [1988] Proceedings. The 8th International Conference on Distributed.

[221]  Johanna D. Moore,et al.  A nested transaction mechanism for LOCUS , 1983, SOSP '83.

[222]  Andrew P. Black,et al.  The Eden System: A Technical Review , 1985, IEEE Transactions on Software Engineering.

[223]  Andrew V. Goldberg,et al.  Finding minimum-cost circulations by canceling negative cycles , 1989, JACM.

[224]  L. S. Shapley,et al.  College Admissions and the Stability of Marriage , 2013, Am. Math. Mon..

[225]  P. R. Radford,et al.  Operating systems for digital computers , 1968 .

[226]  James B. Orlin,et al.  A polynomial time primal network simplex algorithm for minimum cost flows , 1996, SODA '96.

[227]  Scott Shenker,et al.  Tachyon: Reliable, Memory Speed Storage for Cluster Computing Frameworks , 2014, SoCC.

[228]  Ken Thompson,et al.  Plan 9 from Bell Labs , 1995 .

[229]  Antonio Frangioni,et al.  A Computational Study of Cost Reoptimization for Min-Cost Flow Problems , 2006, INFORMS J. Comput..

[230]  Andrew V. Goldberg,et al.  Quincy: fair scheduling for distributed computing clusters , 2009, SOSP '09.

[231]  Gang Ren,et al.  Google-Wide Profiling: A Continuous Profiling Infrastructure for Data Centers , 2010, IEEE Micro.

[232]  Scott Shenker,et al.  The Case for Tiny Tasks in Compute Clusters , 2013, HotOS.

[233]  Kevin Klues,et al.  Improving per-node efficiency in the datacenter with new OS abstractions , 2011, SoCC.

[234]  Michael Isard,et al.  TidyFS: A Simple and Small Distributed File System , 2011, USENIX Annual Technical Conference.

[235]  Simon Peter,et al.  Resource management in a multicore operating system , 2012 .

[236]  Mahadev Konar,et al.  ZooKeeper: Wait-free Coordination for Internet-scale Systems , 2010, USENIX ATC.

[237]  Eric S. Chung,et al.  A reconfigurable fabric for accelerating large-scale datacenter services , 2014, 2014 ACM/IEEE 41st International Symposium on Computer Architecture (ISCA).

[238]  Steven Hand,et al.  New wine in old skins: the case for distributed operating systems in the data center , 2013, APSys.

[239]  Mendel Rosenblum,et al.  Fast crash recovery in RAMCloud , 2011, SOSP.

[240]  Gabriel H. Loh,et al.  Dynamic Classification of Program Memory Behaviors in CMPs , 2008 .

[241]  GhemawatSanjay,et al.  The Google file system , 2003 .

[242]  M. Frans Kaashoek,et al.  RadixVM: scalable address spaces for multithreaded applications , 2013, EuroSys '13.

[243]  Maurice V. Wilkes,et al.  The Cambridge CAP computer and its operating system (Operating and programming systems series) , 1979 .

[244]  Mark Handley,et al.  Network stack specialization for performance , 2015, SIGCOMM 2015.

[245]  Robert Morris,et al.  Optimizing MapReduce for Multicore Architectures , 2010 .

[246]  Jignesh M. Patel,et al.  Twitter Heron: Stream Processing at Scale , 2015, SIGMOD Conference.

[247]  Andrew V. Goldberg,et al.  Efficient maximum flow algorithms , 2014, CACM.

[248]  Peter G. Neumann,et al.  CHERI: A Hybrid Capability-System Architecture for Scalable Software Compartmentalization , 2015, 2015 IEEE Symposium on Security and Privacy.

[249]  Joseph Gonzalez,et al.  PowerGraph: Distributed Graph-Parallel Computation on Natural Graphs , 2012, OSDI.

[250]  Marius Eriksen Your server as a function , 2014, ACM SIGOPS Oper. Syst. Rev..

[251]  Brian F. Cooper Spanner: Google's globally-distributed database , 2013, SYSTOR '13.

[252]  Yuan Yu,et al.  Dryad: distributed data-parallel programs from sequential building blocks , 2007, EuroSys '07.

[253]  Frank Dabek,et al.  Large-scale Incremental Processing Using Distributed Transactions and Notifications , 2010, OSDI.

[254]  Jignesh M. Patel,et al.  Storm@twitter , 2014, SIGMOD Conference.

[255]  Reuben Olinsky,et al.  Composing OS extensions safely and efficiently with Bascule , 2013, EuroSys '13.

[256]  Robert N. M. Watson,et al.  Firmament: Fast, Centralized Cluster Scheduling at Scale , 2016, OSDI.

[257]  Gary Brown,et al.  Denver: Nvidia's First 64-bit ARM Processor , 2015, IEEE Micro.

[258]  Robert Tappan Morris,et al.  Improving network connection locality on multicore systems , 2012, EuroSys '12.

[259]  Timothy Roscoe,et al.  Arrakis , 2014, OSDI.

[260]  Brett D. Fleisch,et al.  The Chubby lock service for loosely-coupled distributed systems , 2006, OSDI '06.

[261]  Luis Ceze,et al.  Deterministic Process Groups in dOS , 2010, OSDI.

[262]  Dheeraj Reddy,et al.  Bias scheduling in heterogeneous multi-core architectures , 2010, EuroSys '10.

[263]  J. Shapiro,et al.  EROS: a fast capability system , 2000, OPSR.

[264]  Eddie Kohler,et al.  Making information flow explicit in HiStar , 2006, OSDI '06.

[265]  Jonathan M. Smith,et al.  From Lone Dwarfs to Giant Superclusters: Rethinking Operating System Abstractions for the Cloud , 2015, HotOS.

[266]  Jason Mars Rethinking the Architecture of Warehouse-ScaleComputers , 2012 .

[267]  Byung-Gon Chun,et al.  Usenix Association 10th Usenix Symposium on Operating Systems Design and Implementation (osdi '12) 135 Megapipe: a New Programming Interface for Scalable Network I/o , 2022 .

[268]  Ion Stoica,et al.  The Power of Choice in Data-Aware Cluster Scheduling , 2014, OSDI.

[269]  Raouf Boutaba,et al.  Characterizing Task Usage Shapes in Google Compute Clusters , 2011 .

[270]  Jochen Liedtke,et al.  The performance of μ-kernel-based systems , 1997, SOSP.

[271]  Yanpei Chen,et al.  Interactive Analytical Processing in Big Data Systems: A Cross-Industry Study of MapReduce Workloads , 2012, Proc. VLDB Endow..

[272]  Scott Shenker,et al.  Choosy: max-min fair sharing for datacenter jobs with constraints , 2013, EuroSys '13.

[273]  V. A. Vyssotsky,et al.  Structure of the multics supervisor , 1965, AFIPS '65 (Fall, part I).

[274]  Yin Wang,et al.  Bistro: Scheduling Data-Parallel Jobs Against Live Production Systems , 2015, USENIX Annual Technical Conference.

[275]  Steven Hand,et al.  The Seven Deadly Sins of Cloud Computing Research , 2012, HotCloud.

[276]  Gordon Bell,et al.  C.mmp: a multi-mini-processor , 1972, AFIPS '72 (Fall, part II).

[277]  Elaine Shi,et al.  PHANTOM: practical oblivious computation in a secure processor , 2013, CCS.

[278]  G. T. Jones ‘Surely You're Joking, Mr Feynman!’ Adventures of a Curious Character , 1985 .

[279]  Mendel Rosenblum,et al.  It's Time for Low Latency , 2011, HotOS.