SimpleNLG-ZH: a Linguistic Realisation Engine for Mandarin

We introduce SimpleNLG-ZH, a realisation engine for Mandarin that follows the software design paradigm of SimpleNLG (Gatt and Reiter, 2009). We explain the core grammar (morphology and syntax) and the lexicon of SimpleNLG-ZH, which is very different from English and other languages for which SimpleNLG engines have been built. The system was evaluated by regenerating expressions from a body of test sentences and a corpus of human-authored expressions. Human evaluation was conducted to estimate the quality of regenerated sentences.

[1]  Marcel Bollmann Adapting SimpleNLG to German , 2011, ENLG.

[2]  Albert Gatt,et al.  Automatic generation of textual summaries from neonatal intensive care data , 2009 .

[3]  J. Packard The Morphology of Chinese: A Linguistic and Cognitive Approach , 2000 .

[4]  Ding Xu Functional categories in Mandarin Chinese , 1997 .

[5]  Irene Langkilde Forest-Based Statistical Sentence Generation , 2000, ANLP.

[6]  Chen Bo,et al.  Investigating the content and form of referring expressions in Mandarin: introducing the Mtuna corpus , 2017, INLG.

[7]  Y.-H. Audrey Li,et al.  The Syntax of Chinese , 2009 .

[8]  John Lee,et al.  Towards Universal Dependencies for Learner Chinese , 2017, UDW@NoDaLiDa.

[9]  Ehud Reiter,et al.  Book Reviews: Building Natural Language Generation Systems , 2000, CL.

[10]  Kees van Deemter Computational Models of Referring: A Study in Cognitive Science , 2016 .

[11]  D. Terence Langendoen,et al.  Topic Structures in Chinese , 1985 .

[12]  John Thayer Jensen Morphology: Word structure in generative grammar , 1990 .

[13]  Emiel Krahmer,et al.  Survey of the State of the Art in Natural Language Generation: Core tasks, applications and evaluation , 2017, J. Artif. Intell. Res..

[14]  Shou-hsin Teng,et al.  Remarks on Cleft Sentences in Chinese. , 1979 .

[15]  Wei He,et al.  Dependency Based Chinese Sentence Realization , 2009, ACL/IJCNLP.

[16]  Albert Gatt,et al.  SimpleNLG: A Realisation Engine for Practical Applications , 2009, ENLG.

[17]  石 毓智 汉语语法 = Chinese grammar , 2010 .

[18]  Alberto Bugarín,et al.  Adapting SimpleNLG to Spanish , 2017, INLG.

[19]  Waltraud Paul,et al.  Adjectives in Mandarin Chinese: The rehabilitation of a much ostracized category , 2010 .

[20]  Guy Lapalme Natural Language Generation and Summarization at RALI , 2013, ENLG.

[21]  Sasi Raja Sekhar Dokkara,et al.  A Simple Surface Realization Engine for Telugu , 2015, ENLG.

[22]  Michael White,et al.  Towards broad coverage surface realization with CCG , 2007, MTSUMMIT.

[23]  Cristina Bosco,et al.  SimpleNLG-IT: adapting SimpleNLG to Italian , 2016, INLG.

[24]  Rodrigo de Oliveira,et al.  Adapting SimpleNLG for Brazilian Portuguese realisation , 2014, INLG.

[25]  Beatrice Santorini,et al.  Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[26]  Ethel Ong,et al.  A Simple Surface Realizer for Filipino , 2011, PACLIC.

[27]  John A. Bateman,et al.  The Chinese Aspect Generation Based on Aspect Selection Functions , 2009, ACL.

[28]  Guy Lapalme,et al.  Adapting SimpleNLG for Bilingual English-French Realisation , 2013, ENLG.

[29]  Yen-hui Audrey Li,et al.  Argument Determiner Phrases and Number Phrases , 1998, Linguistic Inquiry.