Where is my forearm? Clustering of body parts from simultaneous tactile and linguistic input using sequential mapping

Humans and animals are constantly exposed to a continuous stream of sensory information from different modalities. At the same time, they form more compressed representations like concepts or symbols. In species that use language, this process is further structured by this interaction, where a mapping between the sensorimotor concepts and linguistic elements needs to be established. There is evidence that children might be learning language by simply disambiguating potential meanings based on multiple exposures to utterances in different contexts (cross-situational learning). In existing models, the mapping between modalities is usually found in a single step by directly using frequencies of referent and meaning co-occurrences. In this paper, we present an extension of this one-step mapping and introduce a newly proposed sequential mapping algorithm together with a publicly available Matlab implementation. For demonstration, we have chosen a less typical scenario: instead of learning to associate objects with their names, we focus on body representations. A humanoid robot is receiving tactile stimulations on its body, while at the same time listening to utterances of the body part names (e.g., hand, forearm and torso). With the goal at arriving at the correct "body categories", we demonstrate how a sequential mapping algorithm outperforms one-step mapping. In addition, the effect of data set size and noise in the linguistic input are studied.

[1]  Kenny Smith,et al.  Cross-Situational Learning: A Mathematical Approach , 2006, EELC.

[2]  P. Haggard,et al.  Please Scroll down for Article the Quarterly Journal of Experimental Psychology Segmenting the Body into Parts: Evidence from Biases in Tactile Perception , 2022 .

[3]  Manfred K. Warmuth,et al.  THE CMU SPHINX-4 SPEECH RECOGNITION SYSTEM , 2001 .

[4]  Giorgio Metta,et al.  YARP: Yet Another Robot Platform , 2006 .

[5]  Yiannis Demiris,et al.  Hierarchical action learning by instruction through interactive grounding of body parts and proto-actions , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[6]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[7]  G. Holmes,et al.  Sensory disturbances from cerebral lesions , 1911 .

[8]  Ellen M. Markman,et al.  Constraints Children Place on Word Meanings , 1990, Cogn. Sci..

[9]  Larissa K. Samuelson,et al.  Word learning emerges from the interaction of online referent selection and slow associative learning. , 2012, Psychological review.

[10]  Tadahiro Taniguchi,et al.  Bayesian body schema estimation using tactile information obtained through coordinated random movements , 2016, Adv. Robotics.

[11]  Matej Hoffmann,et al.  The encoding of proprioceptive inputs in the brain: knowns and unknowns from a robotic perspective , 2016, ArXiv.

[12]  D. Parisi,et al.  TRoPICALS: a computational embodied neuroscience model of compatibility effects. , 2010, Psychological review.

[13]  Alejandro Hernández Arieta,et al.  Body Schema in Robotics: A Review , 2010, IEEE Transactions on Autonomous Mental Development.

[14]  Michael P. Kaschak,et al.  Putting words in perspective , 2004, Memory & cognition.

[15]  J. Tenenbaum,et al.  Word learning as Bayesian inference. , 2007, Psychological review.

[16]  Giorgio Metta,et al.  Robotic Homunculus: Learning of Artificial Skin Representation in a Humanoid Robot Motivated by Primary Somatosensory Cortex , 2018, IEEE Transactions on Cognitive and Developmental Systems.

[17]  H. Branch Coslett,et al.  Evidence for Multiple, Distinct Representations of the Human Body , 2005, Journal of Cognitive Neuroscience.

[18]  Karla Stépánová,et al.  Estimating number of components in Gaussian mixture model using combination of greedy and merging algorithm , 2018, Pattern Analysis and Applications.

[19]  Chen Yu,et al.  Modeling cross-situational word-referent learning: prior questions. , 2012, Psychological review.

[20]  P. Wolff,et al.  Words and the mind : how words capture human experience , 2010 .

[21]  Bruno Lara,et al.  Exploration Behaviors, Body Representations, and Simulation Processes for the Development of Cognition in Artificial Agents , 2016, Front. Robot. AI.