Nonparametric adaptive control for discrete-time Markov processes with unbounded costs under average criterion