A gradient estimator via L1-randomization for online zero-order optimization with two point feedback