Scalable Implementation of the Two-Dimensional Triangular Discrete Element Method on a GPU Platform