Corpus Linguistics and South Asian Languages: Corpus Creation and Tool Development
暂无分享,去创建一个
Kalina Bontcheva | Diana Maynard | Tony McEnery | Andrew Hardie | Valentin Tablan | Hamish Cunningham | Robert J. Gaizauskas | Richard Xiao | Paul Baker | Oana Hamza | Cristian Ursu | B. D. Jayaram | Mark Leisher
[1] Tony McEnery,et al. A new agenda for corpus linguistics - working with all of the world's languages , 2000 .
[2] Andrew Hardie,et al. The computational analysis of morphosyntactic categories in Urdu , 2004 .
[3] Hamish Cunningham,et al. GATE-a General Architecture for Text Engineering , 1996, COLING.
[4] Miriam Butt. The Structure of Complex Predicates in Urdu , 1995 .
[5] Anthony McEnery,et al. Building a corpus of spoken sylheti. , 1999 .
[6] Colin P. Masica. The Indo-Aryan Languages , 1991 .
[7] Kalina Bontcheva,et al. A Unicode-based Environment for Creation and Use of Language Resources , 2002, LREC.
[8] Signe Oksefjell,et al. A description of the English-Norwegian parallel corpus : Compilation and further developments , 1999 .
[9] Geoffrey Leech,et al. Standards for Tagsets. , 1999 .
[10] Bernard Comrie,et al. The Major languages of South Asia, the Middle East and Africa , 1990 .
[11] Tony McEnery,et al. EMILLE, A 67-Million Word Corpus of Indic Languages: Data Collection, Mark-up and Harmonisation , 2002, LREC.
[12] Bidyut B. Chaudhuri,et al. Computer recognition of printed Bangla script , 1995 .
[13] Akira Nakanishi,et al. Writing Systems of the World , 1980 .
[14] Michael C. Shapiro. An introduction to Hindi and Urdu , 1980 .