Language Segmentation of Twitter Tweets using Weakly Supervised Language Model Induction

This paper presents early results of a weakly supervised language model induction approach for language segmentation of multilingual texts with a special focus on short texts.