Mobile computing network is envisioned as a powerful framework to support the growing computation-intensive applications in the era of the Internet of Things (IoT). In this paper, we exploit the potential of a multi-layer network via a two-dimensional (2-D) task offloading scheme, which enables horizontal cooperations among the edge nodes. To minimize the average task offloading delay for all the mobile users, we formulate a mixed non-linear programming (MINLP) by jointly optimizing the 2-D offloading decisions and communication/computational resource allocation. To address this very challenging problem, we exploit the unique algorithmic structure of the optimal branch-and-bound (B&B) algorithm, and propose a novel Gaussian process imitation learning (GPIL) method to learn how to discover the shortcut for node searching in the B&B enumeration tree and significantly accelerate the B&B algorithm. When the network key parameters change, we further propose a novel recursive GPIL (RGPIL) method to agilely adapt to the new scenario with a fast policy update, where the new posterior distribution can be recursively updated based on a few new training data. Our simulation results show that the proposed method can achieve a near optimal solution with a significantly reduced complexity (e.g., a reduction of 98.7% in the number of searched nodes for a typical case). On this basis, the advantage of 2-D offloading scheme over the conventional schemes is also verified.