Optimal trajectory tracking control for a class of nonlinear nonaffine systems via generalized N‐step value gradient learning