Towards Molecule Generation with Heterogeneous States via Reinforcement Learning