Making a Packet: Cost-Effective Communication for a Parallel Graph Reducer

This paper studies critical runtime-system issues encountered when packing data for transmission in a lazy, parallel graph reduction system. In particular, we aim to answer two questions: How much graph should go into a packet? How aggressively should a processor look for work after requesting remote data?