Choice of approximator and design of penalty function for an approximate dynamic programming based control approach