Collecting Natural SMS and Chat Conversations in Multiple Languages: The BOLT Phase 2 Corpus
暂无分享,去创建一个
Zhiyi Song | Kevin Walker | Stephanie Strassel | Haejoong Lee | Jonathan Wright | Thomas Thomas | Ann Sawyer | Brendan Callahan | Brian Gainor | Jennifer Garland | Dana Fore | Preston Cabe
[1] Cédrick Fairon,et al. A translated corpus of 30,000 French SMS , 2006, LREC.
[2] Christopher Cieri,et al. Resources for new research directions in speaker recognition: the mixer 3, 4 and 5 corpora , 2007, INTERSPEECH.
[3] David Palfreyman,et al. "A Funky Language for Teenzz to Use": Representing Gulf Arabic in Instant Messaging , 2006, J. Comput. Mediat. Commun..
[4] Caroline Tagg,et al. A corpus linguistics study of SMS text messaging , 2009 .
[5] Eric Sanders. Collecting and Analysing Chats and Tweets in SoNaR , 2012, LREC 2012.
[6] Orphée De Clercq,et al. Collecting a corpus of Dutch SMS , 2012, LREC 2012.
[7] Mohammad Ali Yaghan,et al. Arabizi: A Contemporary Style of Arabic Slang , 2008, Design Issues.
[8] Stephanie Strassel,et al. Annotation Trees: LDC's customizable, extensible, scalable, annotation infrastructure , 2012, LREC.
[9] Christopher Cieri,et al. Speaker Recognition: Building the Mixer 4 and 5 Corpora , 2008, LREC.
[10] Nizar Habash,et al. Conventional Orthography for Dialectal Arabic , 2012, LREC.
[11] Stephanie Strassel,et al. Linguistic Resources for Genre-Independent Language Technologies : User-Generated Content in BOLT , 2012 .
[12] Tao Chen,et al. Creating a live, public short message service corpus: the NUS SMS corpus , 2011, Lang. Resour. Evaluation.