A Comparison of Highly Configurable CPU- and GPU-Based Convolution Engines