论文信息 - Genie: a generator of natural language semantic parsers for virtual assistant commands

Genie: a generator of natural language semantic parsers for virtual assistant commands

To understand diverse natural language commands, virtual assistants today are trained with numerous labor-intensive, manually annotated sentences. This paper presents a methodology and the Genie toolkit that can handle new compound commands with significantly less manual effort. We advocate formalizing the capability of virtual assistants with a Virtual Assistant Programming Language (VAPL) and using a neural semantic parser to translate natural language into VAPL code. Genie needs only a small realistic set of input sentences for validating the neural model. Developers write templates to synthesize data; Genie uses crowdsourced paraphrases and data augmentation, along with the synthesized data, to train a semantic parser. We also propose design principles that make VAPL languages amenable to natural language translation. We apply these principles to revise ThingTalk, the language used by the Almond virtual assistant. We use Genie to build the first semantic parser that can support compound virtual assistants commands with unquoted free-form parameters. Genie achieves a 62% accuracy on realistic user inputs. We demonstrate Genie’s generality by showing a 19% and 31% improvement over the previous state of the art on a music skill, aggregate functions, and access control.

[1] Percy Liang,et al. Lambda Dependency-Based Compositional Semantics , 2013, ArXiv.

[2] Mirella Lapata,et al. Language to Logical Form with Neural Attention , 2016, ACL.

[3] A. Mostowski. Review: B. A. Trahtenbrot, Impossibility of an Algorithm for the Decision Problem in Finite Classes , 1950, Journal of Symbolic Logic.

[4] Tommi S. Jaakkola,et al. Tree-structured decoding with doubly-recurrent neural networks , 2016, ICLR.

[5] Yoshua Bengio,et al. Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[6] Hosung Park,et al. What is Twitter, a social network or a news media? , 2010, WWW '10.

[7] Raymond J. Mooney,et al. Learning to Parse Database Queries Using Inductive Logic Programming , 1996, AAAI/IAAI, Vol. 2.

[8] Jiyun Lee,et al. Trigger-Action Programming in the Wild: An Analysis of 200,000 IFTTT Recipes , 2016, CHI.

[9] Yoshimasa Tsuruoka,et al. A Joint Many-Task Model: Growing a Neural Network for Multiple NLP Tasks , 2016, EMNLP.

[10] Rohit J. Kate,et al. Using String-Kernels for Learning Semantic Parsers , 2006, ACL.

[11] Jure Leskovec,et al. Patterns of temporal variation in online media , 2011, WSDM '11.

[12] Lukás Burget,et al. Recurrent neural network based language model , 2010, INTERSPEECH.

[13] AnyNewPhotoByYou Dropbox AddFileFromURL. Improved Semantic Parsers For If-Then Statements , 2017 .

[14] Mark Steedman,et al. Combinatory Categorial Grammar , 2011 .

[15] Spyridon Matsoukas,et al. The Alexa Meaning Representation Language , 2018, NAACL.

[16] Jafar Adibi,et al. The Enron Email Dataset Database Schema and Brief Statistical Report , 2004 .

[17] Lukás Burget,et al. Extensions of recurrent neural network language model , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[18] Rohit J. Kate,et al. Learning to Transform Natural to Formal Languages , 2005, AAAI.

[19] Philipp Koehn,et al. Abstract Meaning Representation for Sembanking , 2013, LAW@ACL.

[20] Claire Gardent,et al. Sequence-based Structured Prediction for Semantic Parsing , 2016, ACL.

[21] Jonathan Berant,et al. Building a Semantic Parser Overnight , 2015, ACL.

[22] Dawn Xiaodong Song,et al. SQLNet: Generating Structured Queries From Natural Language Without Reinforcement Learning , 2017, ArXiv.

[23] Noah A. Smith,et al. Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) , 2016, ACL 2016.

[24] Emma Strubell,et al. Multi-Task Learning For Parsing The Alexa Meaning Representation Language , 2018, AAAI.

[25] Jeffrey Pennington,et al. GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[26] Richard Socher,et al. Learned in Translation: Contextualized Word Vectors , 2017, NIPS.

[27] Jure Leskovec,et al. Meme-tracking and the dynamics of the news cycle , 2009, KDD.

[28] Luke S. Zettlemoyer,et al. Deep Contextualized Word Representations , 2018, NAACL.

[29] Wang Ling,et al. Latent Predictor Networks for Code Generation , 2016, ACL.