Design and Implementation of a Compiler and Runtime System for Composite Tree Parallelism