论文信息 - Refer-iTTS: A System for Referring in Spoken Installments to Objects in Real-World Images

Refer-iTTS: A System for Referring in Spoken Installments to Objects in Real-World Images

Current referring expression generation systems mostly deliver their output as one-shot, written expressions. We present on-going work on incremental generation of spoken expressions referring to objects in real-world images. This approach extends upon previous work using the words-as-classifier model for generation. We implement this generator in an incremental dialogue processing framework such that we can exploit an existing interface to incremental text-to-speech synthesis. Our system generates and synthesizes referring expressions while continuously observing non-verbal user reactions.

David Schlangen | Sina Zarrieß | Soledad López Gambino | David Schlangen | Sina Zarrieß

[1] Matthew W. Crocker,et al. Using listener gaze to augment speech generation in a virtual 3D environment , 2012, CogSci.

[2] David Schlangen,et al. The InproTK 2012 release , 2012, SDCTD@NAACL-HLT.

[3] Stefan Kopp,et al. Referring in Installments: A Corpus Study of Spoken Object References in an Interactive Virtual Environment , 2012, INLG.

[4] H. H. Clark,et al. Speaking while monitoring addressees for understanding , 2004 .

[5] Vicente Ordonez,et al. ReferItGame: Referring to Objects in Photographs of Natural Scenes , 2014, EMNLP.

[6] David Schlangen,et al. INPRO_iSS: A Component for Just-In-Time Incremental Speech Synthesis , 2012, ACL.

[7] Joyce Yue Chai,et al. Collaborative Models for Referring Expression Generation in Situated Dialogue , 2014, AAAI.

[8] Gabriel Skantze,et al. A General, Abstract Model of Incremental Dialogue Processing , 2011 .

[9] David DeVault,et al. An Information-State Approach to Collaborative Reference , 2005, ACL.

[10] H. H. Clark,et al. Referring as a collaborative process , 1986, Cognition.

[11] David Schlangen,et al. Easy Things First: Installments Improve Referring Expression Generation for Objects in Photographs , 2016, ACL.