Dietary Composition Perception Algorithm Using Social Robot Audition for Mandarin Chinese

As the problem of an aging population becomes more and more serious, social robots have an increasingly significant influence on human life. By employing regular question-and-answer conversations or wearable devices, some social robotics products can establish personal health archives. But those robots are unable to collect diet information automatically through robot vision or audition. A healthy diet can reduce a person’s risk of developing cancer, diabetes, heart disease, and other age-related diseases. In order to automatically perceive the dietary composition of the elderly by listening to people’s chatting, this paper proposed a chat-based automatic dietary composition perception algorithm (DCPA). DCPA uses social robot audition to understand the semantic information and percept dietary composition for Mandarin Chinese. Firstly, based on the Mel-frequency cepstrum coefficient and convolutional neural network, a speaker recognition method is designed to identify speech data. Based on speech segmentation and speaker recognition algorithm, an audio segment classification method is proposed to distinguish different speakers, store their identity information and the sequence of expression in a speech conversation. Secondly, a dietetic lexicon is established, and two kinds of dietary composition semantic understanding algorithms are proposed to understand the eating semantics and sensor dietary composition information. To evaluate the performance of the proposed DCPA algorithm, we implemented the proposed DCPA in our social robot platform. Then we established two categories of test datasets relating to a one-person and a multi-person chat. The test results show that DCPA is capable of understanding users’ dietary compositions, with an F1 score of 0.9505, 0.8940 and 0.8768 for one-person talking, a two-person chat and a three-person chat, respectively. DCPA has good robustness for obtaining dietary information.

[1]  Tanvir Hossain,et al.  Temporal Information Extraction from Textual Data using Long Short Term Memory Recurrent Neural Network , 2018 .

[2]  Maurice van Keulen,et al.  Information Extraction for Social Media , 2014, SWAIE@COLING.

[3]  Raúl Gutiérrez,et al.  Automatic Synthesis of Logical Models for Order-Sorted First-Order Theories , 2018, Journal of Automated Reasoning.

[4]  Rekha P. Nair,et al.  Voiceprint Recognition Systems for Remote Authentication-A Survey , 2011 .

[5]  Nando de Freitas,et al.  A Deep Architecture for Semantic Parsing , 2014, ACL 2014.

[6]  Yang Li,et al.  Rapid Relocation Method for Mobile Robot Based on Improved ORB-SLAM2 Algorithm , 2019, Remote. Sens..

[7]  Gökhan Tür,et al.  Multi-Domain Joint Semantic Frame Parsing Using Bi-Directional RNN-LSTM , 2016, INTERSPEECH.

[8]  Siddhartha R. Jonnalagadda,et al.  Text Mining of the Electronic Health Record: An Information Extraction Approach for Automated Identification and Subphenotyping of HFpEF Patients for Clinical Trials , 2017, Journal of Cardiovascular Translational Research.

[9]  Haixun Wang,et al.  Probase: a probabilistic taxonomy for text understanding , 2012, SIGMOD Conference.

[10]  Pavel Smrž,et al.  Design of the Human-Robot Interaction for a Semi-Autonomous Service Robot to Assist Elderly People , 2015 .

[11]  Rongrong Fu,et al.  Study of the Home-Auxiliary Robot Based on BCI , 2018, Sensors.

[12]  Perry Share,et al.  Preparing for a Robot Future? Social Professions, Social Robotics and the Challenges Ahead , 2018 .

[13]  Albert Ali Salah,et al.  Fisher vectors with cascaded normalization for paralinguistic analysis , 2015, INTERSPEECH.

[14]  Weihua Sheng,et al.  Cognitive orientation assessment for older adults using social robots , 2017, 2017 IEEE International Conference on Robotics and Biomimetics (ROBIO).

[15]  Haixun Wang,et al.  Short text understanding through lexical-semantic analysis , 2015, 2015 IEEE 31st International Conference on Data Engineering.

[16]  Wanxiang Che,et al.  LTP: A Chinese Language Technology Platform , 2010, COLING.

[17]  Douglas E. Appelt,et al.  Introduction to Information Extraction , 1999, AI Commun..

[18]  E. Newport,et al.  WORD SEGMENTATION : THE ROLE OF DISTRIBUTIONAL CUES , 1996 .

[19]  Kun Li,et al.  Personalizing a Service Robot by Learning Human Habits from Behavioral Footprints , 2015 .

[20]  J. Cavanaugh,et al.  The Bayesian information criterion: background, derivation, and applications , 2012 .

[21]  Gang Chen,et al.  A New Remote Health-Care System Based on Moving Robot Intended for the Elderly at Home , 2018, Journal of healthcare engineering.

[22]  Fabien Ringeval,et al.  I Hear You Eat and Speak: Automatic Recognition of Eating Condition and Food Type, Use-Cases, and Impact on ASR Performance , 2016, PloS one.

[23]  Thomas Fang Zheng,et al.  Comparison of different implementations of MFCC , 2001, Journal of Computer Science and Technology.

[24]  Xin Chen,et al.  ChineseFoodNet: A large-scale Image Dataset for Chinese Food Recognition , 2017, ArXiv.

[25]  Ryo Kurazume,et al.  Service robot system with an informationally structured environment , 2015, Robotics Auton. Syst..

[26]  Alexander I. Rudnicky,et al.  Unsupervised induction and filling of semantic slots for spoken dialogue systems using frame-semantic parsing , 2013, 2013 IEEE Workshop on Automatic Speech Recognition and Understanding.

[27]  Zheng Fang,et al.  Comparison of different implementations of MFCC , 2001 .

[28]  Christer Samuelsson,et al.  A Statistical Theory of Dependency Syntax , 2000, COLING.

[29]  M Jin,et al.  [A customized method for information extraction from unstructured text data in the electronic medical records]. , 2018, Beijing da xue xue bao. Yi xue ban = Journal of Peking University. Health sciences.

[30]  Guohui Tian,et al.  A Selective Attention Guided Initiative Semantic Cognition Algorithm for Service Robot , 2018, Int. J. Autom. Comput..

[31]  Antonis A. Argyros,et al.  A Robot-based Application for Physical Exercise Training , 2016, ICT4AgeingWell.

[32]  Fredric C. Gey,et al.  The relationship between recall and precision , 1994 .

[33]  Yanbin Liu,et al.  Information extraction method and its application in Chinese equipment technical manual based on rule-matching , 2017, ICSCA '17.

[34]  Dan Yang,et al.  RiSH: A robot-integrated smart home for elderly care , 2018, Robotics Auton. Syst..

[35]  Eran Elinav,et al.  You are what you eat: diet, health and the gut microbiota , 2018, Nature Reviews Gastroenterology & Hepatology.

[36]  Weihua Sheng,et al.  Convolutional Neural Network-Based Embarrassing Situation Detection under Camera for Social Robot in Smart Homes , 2018, Sensors.

[37]  Roger S. Brown,et al.  Linguistic determinism and the part of speech. , 1957, Journal of abnormal psychology.