Thermal neutron beam optimization for PGNAA applications using Q-learning algorithm and neural network