TD Learning with Neural Networks - Study of the Leakage Propagation Problem