The Design and Implementation of a Tibetan Word Segmentation System

Word segmentation for Tibetan has not been well studied yet. This paper reports a Tibetan word segmentation system that we designed and implemented. Several issues about the system are explained, which include system architecture, knowledge bases, segmentation strategy, and algorithms. In preliminary experiments, the system demonstrates higher accuracy and domain independency.