Multimodal Categorization Based on Subjective Consistency for Robot Learning Word-Meaning from Human