Learning visually grounded words and syntax for a scene description task

[1]  Thomas Rist,et al.  Generating coherent presentations employing textual and visual material , 1995, Artificial Intelligence Review.

[2]  Gerd Herzog,et al.  VIsual TRAnslator: Linking perceptions and natural language descriptions , 1994, Artificial Intelligence Review.

[3]  Deb Roy,et al.  Grounded spoken language acquisition: experiments in word learning , 2003, IEEE Trans. Multim..

[4]  Alex Pentland,et al.  Learning words from sights and sounds: a computational model , 2002, Cogn. Sci..

[5]  Jeffrey Mark Siskind,et al.  Grounding the Lexical Semantics of Verbs in Visual Perception using Force Dynamics and Event Logic , 1999, J. Artif. Intell. Res..

[6]  Deb Roy,et al.  Grounded speech communication , 2000, INTERSPEECH.

[7]  Marilyn A. Walker,et al.  Learning Attribute Selections for Non-Pronominal Expressions , 2000, ACL.

[8]  D. Roy Learning Visually Grounded Words and Syntax of Natural Spoken Language , 2000 .

[9]  L. Barsalou,et al.  Whither structured representation? , 1999, Behavioral and Brain Sciences.

[10]  Robert Dale,et al.  Computational Interpretations of the Gricean Maxims in the Generation of Referring Expressions , 1995, Cogn. Sci..

[11]  R. Dale Generating referring expressions - constructing descriptions in a domain of objects and processes , 1995, ACL-MIT press series in natural language processing.

[12]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[13]  A. Gorin On automated language acquisition , 1989 .

[14]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[15]  E. Markman Categorization and naming in children , 1989 .

[16]  A. Dale Magoun,et al.  Decision, estimation and classification , 1989 .

[17]  G. Lakoff,et al.  Metaphors We Live by , 1982 .

[18]  I. Good THE POPULATION FREQUENCIES OF SPECIES AND THE ESTIMATION OF POPULATION PARAMETERS , 1953 .