Latency-constrained DNN architecture learning for edge systems using zerorized batch normalization