Accelerating Depthwise Convolution and Pooling Operations on z-First Storage CNN Architectures