Reinforcement-Learning-Based Task Planning for Self-Reconfiguration of Cellular Satellites