A Statistical Language Model for Pre-Trained Sequence Labeling: A Case Study on Vietnamese