论文信息 - Task Learning through Visual Demonstration and Situated Dialogue

Task Learning through Visual Demonstration and Situated Dialogue

To enable effective collaborations between humans and cognitive robots, it is important for robots to continuously acquire task knowledge from human partners. To address this issue, we are currently developing a framework that supports task learning through visual demonstration and natural language dialogue. One core component of this framework is the integration of language and vision that is driven by dialogue for task knowledge learning. This paper describes our on-going effort, particularly, grounded task learning through joint processing of video and dialogue using And-Or-Graphs.

Changsong Liu | Song-Chun Zhu | Joyce Yue Chai | Nishant Shukla

[1] Andrea Lockerd Thomaz,et al. Robot Learning from Human Teachers , 2014, Robot Learning from Human Teachers.

[2] Stanley Peters,et al. Collaborative activities and multi-tasking in dialogue systems , 2002 .

[3] 付伶俐. 打磨Using Language,倡导新理念 , 2014 .

[4] Song-Chun Zhu,et al. Learning Perceptual Causality from Video , 2013, AAAI Workshop: Learning Rich Representations from Low-Level Sensors.

[5] Benjamin Z. Yao,et al. Learning and parsing video events with goal and intent prediction , 2013, Comput. Vis. Image Underst..

[6] Herbert H. Clark,et al. Contributing to Discourse , 1989, Cogn. Sci..

[7] Song-Chun Zhu,et al. Robot learning with a spatial, temporal, and causal and-or graph , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[8] Jan Herrington,et al. Critical characteristics of situated learning: Implications for the instructional design of multimedia , 1995 .

[9] Yunyi Jia,et al. Back to the Blocks World: Learning New Actions through Situated Human-Robot Dialogue , 2014, SIGDIAL Conference.

[10] Etienne Wenger,et al. Situated Learning: Legitimate Peripheral Participation , 1991 .

[11] Kewei Tu,et al. Joint Video and Text Parsing for Understanding Events and Answering Queries , 2013, IEEE MultiMedia.

[12] Candace L. Sidner,et al. Attention, Intentions, and the Structure of Discourse , 1986, CL.