Large-grain dataflow computation and its architectural support