FasterMoE: modeling and optimizing training of large-scale dynamic pre-trained models