Fine-grained Hardware Acceleration for Efficient Batteryless Intermittent Inference on the Edge