A Review on Human-Computer Interaction and Intelligent Robots

In the field of artificial intelligence, human–computer interaction (HCI) technology and its related intelligent robot technologies are essential and interesting contents of research. From the perspective of software algorithm and hardware system, these above-mentioned technologies study and try to build a natural HCI environment. The purpose of this research is to provide an overview of HCI and intelligent robots. This research highlights the existing technologies of listening, speaking, reading, writing, and other senses, which are widely used in human interaction. Based on these same technologies, this research introduces some intelligent robot systems and platforms. This paper also forecasts some vital challenges of researching HCI and intelligent robots. The authors hope that this work will help researchers in the field to acquire the necessary information and technologies to further conduct more advanced research.

[1]  Ian Lane,et al.  Recurrent Models for Auditory Attention in Multi-Microphone Distant Speech Recognition , 2016, INTERSPEECH.

[2]  Erik Cambria,et al.  Recent Trends in Deep Learning Based Natural Language Processing , 2017, IEEE Comput. Intell. Mag..

[3]  Arun Ross,et al.  Long range iris recognition: A survey , 2017, Pattern Recognit..

[4]  Kristy Elizabeth Boyer,et al.  Unsupervised Classification of Student Dialogue Acts with Query-Likelihood Clustering , 2013, EDM.

[5]  Fuchun Sun,et al.  Visual–Tactile Fusion for Object Recognition , 2017, IEEE Transactions on Automation Science and Engineering.

[6]  Fei Liu,et al.  Automatic Summarization of Student Course Feedback , 2016, HLT-NAACL.

[7]  Amir Hussain,et al.  Applications of Deep Learning and Reinforcement Learning to Biological Data , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[8]  Xinyan Xiao,et al.  DuReader: a Chinese Machine Reading Comprehension Dataset from Real-world Applications , 2017, QA@ACL.

[9]  Fuji Ren,et al.  Predicting User-Topic Opinions in Twitter with Social and Topical Context , 2013, IEEE Transactions on Affective Computing.

[10]  Victor O. K. Li,et al.  Non-Autoregressive Neural Machine Translation , 2017, ICLR.

[11]  BengioYoshua,et al.  Using recurrent neural networks for slot filling in spoken language understanding , 2015 .

[12]  Masanori Morise,et al.  WORLD: A Vocoder-Based High-Quality Speech Synthesis System for Real-Time Applications , 2016, IEICE Trans. Inf. Syst..

[13]  H. Gardner,et al.  Frames of Mind: The Theory of Multiple Intelligences , 1983 .

[14]  Alex Graves,et al.  Sequence Transduction with Recurrent Neural Networks , 2012, ArXiv.

[15]  Francis R. Willett,et al.  Restoration of reaching and grasping in a person with tetraplegia through brain-controlled muscle stimulation: a proof-of-concept demonstration , 2017, The Lancet.

[16]  Alexander I. Rudnicky,et al.  Matrix Factorization with Knowledge Graph Propagation for Unsupervised Spoken Language Understanding , 2015, ACL.

[17]  Shingo Kuroiwa,et al.  A Model of Mental State Transition Network , 2007 .

[18]  Danna Zhou,et al.  d. , 1934, Microbial pathogenesis.

[19]  Liu Ting,et al.  Topic Augmented Convolutional Neural Network for User Interest Recognition , 2018 .

[20]  Fuji Ren Member,et al.  TFSM-based dialogue management model framework for affective dialogue systems , 2015 .

[21]  Ruifeng Li,et al.  Interface Design of a Physical Human–Robot Interaction System for Human Impedance Adaptive Skill Transfer , 2018, IEEE Transactions on Automation Science and Engineering.

[22]  Richard Socher,et al.  A Deep Reinforced Model for Abstractive Summarization , 2017, ICLR.

[23]  Chen Xiao-mei,et al.  A research of improved algorithm for GMM voiceprint recognition model , 2016, 2016 Chinese Control and Decision Conference (CCDC).

[24]  Xiao Sun,et al.  Hybrid spatiotemporal models for sentiment classification via galvanic skin response , 2019, Neurocomputing.

[25]  Matthew Richardson,et al.  MCTest: A Challenge Dataset for the Open-Domain Machine Comprehension of Text , 2013, EMNLP.

[26]  Gordon Cheng,et al.  A Tactile-Based Framework for Active Object Learning and Discrimination using Multimodal Robotic Skin , 2017, IEEE Robotics and Automation Letters.

[27]  Gang Liu,et al.  Advanced LSTM: A Study About Better Time Dependency Modeling in Emotion Recognition , 2017, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[28]  Rama Chellappa,et al.  HyperFace: A Deep Multi-Task Learning Framework for Face Detection, Landmark Localization, Pose Estimation, and Gender Recognition , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[29]  David A. Wagner,et al.  Audio Adversarial Examples: Targeted Attacks on Speech-to-Text , 2018, 2018 IEEE Security and Privacy Workshops (SPW).

[30]  Dong Yu,et al.  Context-Dependent Pre-Trained Deep Neural Networks for Large-Vocabulary Speech Recognition , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[31]  Jason Weston,et al.  Key-Value Memory Networks for Directly Reading Documents , 2016, EMNLP.

[32]  Tara N. Sainath,et al.  Deep convolutional neural networks for LVCSR , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[33]  Chun-Liang Hsu,et al.  EOG-based Human-Computer Interface system development , 2010, Expert Syst. Appl..

[34]  Zheru Chi,et al.  Facial Expression Recognition in Video with Multiple Feature Fusion , 2018, IEEE Transactions on Affective Computing.

[35]  Zhiyuan Liu,et al.  Denoising Distantly Supervised Open-Domain Question Answering , 2018, ACL.

[36]  Ya Zhang,et al.  Deep feature for text-dependent speaker verification , 2015, Speech Commun..

[37]  Matti Pietikäinen,et al.  Facial Micro-Expression Recognition Using Spatiotemporal Local Binary Pattern with Integral Projection , 2015, 2015 IEEE International Conference on Computer Vision Workshop (ICCVW).

[38]  Lantao Yu,et al.  SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient , 2016, AAAI.

[39]  Heng Yang,et al.  Facial feature point detection: A comprehensive survey , 2014, Neurocomputing.

[40]  Douglas A. Reynolds,et al.  Robust text-independent speaker identification using Gaussian mixture speaker models , 1995, IEEE Trans. Speech Audio Process..

[41]  Fuji Ren,et al.  Role-explicit query extraction and utilization for quantifying user intents , 2016, Inf. Sci..

[42]  Martin Wattenberg,et al.  Google’s Multilingual Neural Machine Translation System: Enabling Zero-Shot Translation , 2016, TACL.

[43]  Stefanos Zafeiriou,et al.  Correlated-spaces regression for learning continuous emotion dimensions , 2013, MM '13.

[44]  Hang Li,et al.  Neural Responding Machine for Short-Text Conversation , 2015, ACL.

[45]  Yann Dauphin,et al.  Convolutional Sequence to Sequence Learning , 2017, ICML.

[46]  Ratko Grbić,et al.  Improving optical character recognition performance for low quality images , 2017, 2017 International Symposium ELMAR.

[47]  Hans Peter Luhn,et al.  The Automatic Creation of Literature Abstracts , 1958, IBM J. Res. Dev..

[48]  Nilanjan Dey,et al.  Developing residential wireless sensor networks for ECG healthcare monitoring , 2017, IEEE Transactions on Consumer Electronics.

[49]  Yann Dauphin,et al.  A Convolutional Encoder Model for Neural Machine Translation , 2016, ACL.

[50]  Changqin Quan,et al.  Examining Accumulated Emotional Traits in Suicide Blogs With an Emotion Topic Model , 2016, IEEE Journal of Biomedical and Health Informatics.

[51]  Rafael A. Calvo,et al.  Automated Detection of Engagement Using Video-Based Estimation of Facial Expressions and Heart Rate , 2017, IEEE Transactions on Affective Computing.

[52]  Cícero Nogueira dos Santos,et al.  Boosting Named Entity Recognition with Neural Character Embeddings , 2015, NEWS@ACL.

[53]  Lukás Burget,et al.  Recurrent neural network based language model , 2010, INTERSPEECH.

[54]  Guanglin Li,et al.  Development of Sensory-Motor Fusion-Based Manipulation and Grasping Control for a Robotic Hand-Eye System , 2017, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[55]  Aishan Wumaier,et al.  Bidirectional Long Short-Term Memory Network with a Conditional Random Field Layer for Uyghur Part-Of-Speech Tagging , 2017, Inf..

[56]  Yoshua Bengio,et al.  Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.

[57]  Ruslan Salakhutdinov,et al.  Transfer Learning for Sequence Tagging with Hierarchical Recurrent Networks , 2016, ICLR.

[58]  Yusheng Ji,et al.  Sleepy: Wireless Channel Data Driven Sleep Monitoring via Commodity WiFi Devices , 2020, IEEE Transactions on Big Data.

[59]  Richard Kittredge,et al.  Computer Generation of Marine Weather Forecast Text , 1988 .

[60]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[61]  Hong Zhang,et al.  Development of ergonomic posture recognition technique based on 2D ordinary camera for construction hazard prevention through view-invariant features in 2D skeleton motion , 2017, Adv. Eng. Informatics.

[62]  Yong-Lae Park,et al.  Design of a Lightweight Soft Robotic Arm Using Pneumatic Artificial Muscles and Inflatable Sleeves. , 2017, Soft robotics.

[63]  Marcin Mironczuk,et al.  A recent overview of the state-of-the-art elements of text classification , 2018, Expert Syst. Appl..

[64]  Daniel Marcu,et al.  Induction of Word and Phrase Alignments for Automatic Document Summarization , 2005, CL.

[65]  Angeliki Metallinou,et al.  Topic-based Evaluation for Conversational Bots , 2018, ArXiv.

[66]  Jianfeng Gao,et al.  Deep Reinforcement Learning for Dialogue Generation , 2016, EMNLP.

[67]  Erich Elsen,et al.  Deep Speech: Scaling up end-to-end speech recognition , 2014, ArXiv.

[68]  Yoshua Bengio,et al.  Drawing and Recognizing Chinese Characters with Recurrent Neural Network , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[69]  Yu Wang,et al.  TFSM‐based dialogue management model framework for affective dialogue systems , 2015 .

[70]  Heiga Zen,et al.  WaveNet: A Generative Model for Raw Audio , 2016, SSW.

[71]  Yang Liu,et al.  A Multi-Task Learning Framework for Emotion Recognition Using 2D Continuous Space , 2017, IEEE Transactions on Affective Computing.

[72]  Geoffrey E. Hinton,et al.  Phoneme recognition using time-delay neural networks , 1989, IEEE Trans. Acoust. Speech Signal Process..

[73]  Mohamed Elhoseny,et al.  Emotion recognition using empirical mode decomposition and approximation entropy , 2018, Comput. Electr. Eng..

[74]  Marcus Liwicki,et al.  TAC-GAN - Text Conditioned Auxiliary Classifier Generative Adversarial Network , 2017, ArXiv.

[75]  Jason Weston,et al.  Reading Wikipedia to Answer Open-Domain Questions , 2017, ACL.

[76]  Tianmiao Wang,et al.  Current Researches and Future Development Trend of Intelligent Robot: A Review , 2018, Int. J. Autom. Comput..

[77]  Alan Ritter,et al.  Adversarial Learning for Neural Dialogue Generation , 2017, EMNLP.

[78]  Ting Liu,et al.  Attention-over-Attention Neural Networks for Reading Comprehension , 2016, ACL.

[79]  Emilio Serrano,et al.  Automatic Music Generation by Deep Learning , 2018, DCAI.

[80]  Grigoriy Sterling,et al.  Emotion Recognition From Speech With Recurrent Neural Networks , 2017, ArXiv.

[81]  Xu Sun,et al.  Cross-Domain and Semisupervised Named Entity Recognition in Chinese Social Media: A Unified Model , 2018, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[82]  Noah A. Smith,et al.  Toward Abstractive Summarization Using Semantic Representations , 2018, NAACL.

[83]  Holger Schwenk,et al.  Continuous Space Translation Models for Phrase-Based Statistical Machine Translation , 2012, COLING.

[84]  Alfred A. Rizzi,et al.  The LittleDog robot , 2011, Int. J. Robotics Res..

[85]  Thomas S. Huang,et al.  Survey of Face Detection on Low-Quality Images , 2018, 2018 13th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2018).

[86]  Xinlei Chen,et al.  Mind's eye: A recurrent visual representation for image caption generation , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[87]  Rajendran Parthiban,et al.  Joint facial expression recognition and intensity estimation based on weighted votes of image sequences , 2017, Pattern Recognit. Lett..

[88]  Nenghai Yu,et al.  Deliberation Networks: Sequence Generation Beyond One-Pass Decoding , 2017, NIPS.

[89]  Hideki Kawahara,et al.  Implementation of realtime STRAIGHT speech manipulation system: Report on its first implementation , 2007 .

[90]  Fuji Ren,et al.  Emotion computing using Word Mover’s Distance features based on Ren_CECps , 2018, PloS one.

[91]  J. Schmidhuber,et al.  Framewise phoneme classification with bidirectional LSTM networks , 2005, Proceedings. 2005 IEEE International Joint Conference on Neural Networks, 2005..

[92]  P ? ? ? ? ? ? ? % ? ? ? ? , 1991 .

[93]  David Vandyke,et al.  Multi-domain Dialog State Tracking using Recurrent Neural Networks , 2015, ACL.

[94]  Richard Sproat,et al.  Multilingual Text-to-Speech Synthesis: The Bell Labs Approach , 1998, CL.

[95]  Guoying Zhao,et al.  A Main Directional Mean Optical Flow Feature for Spontaneous Micro-Expression Recognition , 2016, IEEE Transactions on Affective Computing.

[96]  A. Mehrabian Pleasure-arousal-dominance: A general framework for describing and measuring individual differences in Temperament , 1996 .

[97]  Sampo Pyysalo,et al.  Attending to Characters in Neural Sequence Labeling Models , 2016, COLING.

[98]  L. Gottfredson Mainstream science on intelligence: An editorial with 52 signatories, history, and bibliography , 1997 .

[99]  Fuji Ren,et al.  Affective Information Processing and Recognizing Human Emotion , 2006, MFCSIT.

[100]  Alex Waibel,et al.  Review of TDNN (time delay neural network) architectures for speech recognition , 1991, 1991., IEEE International Sympoisum on Circuits and Systems.

[101]  Sebastian Riedel,et al.  Constructing Datasets for Multi-hop Reading Comprehension Across Documents , 2017, TACL.

[102]  Sérgio F. Chevtchenko,et al.  Multi-objective optimization for hand posture recognition , 2018, Expert Syst. Appl..

[103]  Oliveira-SantosThiago,et al.  Facial expression recognition with Convolutional Neural Networks , 2017 .

[104]  Geoffrey Zweig,et al.  Recurrent neural networks for language understanding , 2013, INTERSPEECH.

[105]  L. F. Barrett Discrete Emotions or Dimensions? The Role of Valence Focus and Arousal Focus , 1998 .

[106]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[107]  Zhu Qiaoming,et al.  A Macro Discourse Primary and Secondary Relation Recognition Method Based on Topic Similarity , 2017 .

[108]  YoungSteve,et al.  The application of hidden Markov models in speech recognition , 2007 .

[109]  Yi Peng,et al.  Evaluation of clustering algorithms for financial risk analysis using MCDM methods , 2014, Inf. Sci..

[110]  Chuan Qin,et al.  How Images Inspire Poems: Generating Classical Chinese Poetry from Images with Memory Networks , 2018, AAAI.

[111]  Guillaume Lample,et al.  Neural Architectures for Named Entity Recognition , 2016, NAACL.

[112]  Christopher Clark,et al.  Simple and Effective Multi-Paragraph Reading Comprehension , 2017, ACL.

[113]  Yusheng Ji,et al.  MoSense: An RF-Based Motion Detection System via Off-the-Shelf WiFi Devices , 2017, IEEE Internet of Things Journal.

[114]  Yutaka Matsuo,et al.  Expressive Speech Synthesis via Modeling Expressions with Variational Autoencoder , 2018, INTERSPEECH.

[115]  Feng Ji,et al.  Memory-Augmented Dialogue Management for Task-Oriented Dialogue Systems , 2018, ACM Trans. Inf. Syst..

[116]  Zhenqi Li,et al.  A Review of Emotion Recognition Using Physiological Signals , 2018, Sensors.

[117]  James H. Carlisle Evaluating the impact of office automation on top management communication , 1976, AFIPS '76.

[118]  Siqi Liu,et al.  Improved Image Captioning via Policy Gradient optimization of SPIDEr , 2016, 2017 IEEE International Conference on Computer Vision (ICCV).

[119]  Patrick Kenny,et al.  Joint Factor Analysis of Speaker and Session Variability: Theory and Algorithms , 2006 .

[120]  Danushka Bollegala,et al.  A Bottom-Up Approach to Sentence Ordering for Multi-Document Summarization , 2006, ACL.

[121]  Hsiao-Wuen Hon,et al.  Speaker-independent phone recognition using hidden Markov models , 1989, IEEE Trans. Acoust. Speech Signal Process..

[122]  Fuji Ren Member,et al.  Facial expression recognition based on AAM–SIFT and adaptive regional weighting , 2015 .

[123]  Lalit R. Bahl,et al.  Maximum mutual information estimation of hidden Markov model parameters for speech recognition , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[124]  Chang-Hwan Im,et al.  Real-Time “Eye-Writing” Recognition Using Electrooculogram , 2017, IEEE Transactions on Neural Systems and Rehabilitation Engineering.

[125]  Fuji Ren,et al.  Exploring latent semantic information for textual emotion recognition in blog articles , 2018, IEEE/CAA Journal of Automatica Sinica.

[126]  Joseph E LeDoux Emotion circuits in the brain. , 2009, Annual review of neuroscience.

[127]  Enhong Chen,et al.  Chinese Poetry Generation with Planning based Neural Network , 2016, COLING.

[128]  Diyi Yang,et al.  Hierarchical Attention Networks for Document Classification , 2016, NAACL.

[129]  Lalit R. Bahl,et al.  A Maximum Likelihood Approach to Continuous Speech Recognition , 1983, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[130]  Qin Lu,et al.  Applying regression models to query-focused multi-document summarization , 2011, Inf. Process. Manag..

[131]  Hao He,et al.  Integrated Approach of Dynamic Human Eye Movement Recognition and Tracking in Real Time , 2016, 2016 International Conference on Virtual Reality and Visualization (ICVRV).

[132]  Ren Fuji Enriching mental Engineering , 2010 .

[133]  Rui Yan,et al.  Learning to Respond with Deep Neural Networks for Retrieval-Based Human-Computer Conversation System , 2016, SIGIR.

[134]  Quoc V. Le,et al.  Sequence to Sequence Learning with Neural Networks , 2014, NIPS.

[135]  Fuji Ren,et al.  Advanced Information Retrieval , 2006, MFCSIT.

[136]  Yoshua Bengio,et al.  SampleRNN: An Unconditional End-to-End Neural Audio Generation Model , 2016, ICLR.

[137]  Shogo Tokai,et al.  Point Clouds Based 3D Facial Expression Generation , 2017 .

[138]  Mika V. Mäntylä,et al.  The evolution of sentiment analysis - A review of research topics, venues, and top cited papers , 2016, Comput. Sci. Rev..

[139]  Shuohang Wang,et al.  Learning Natural Language Inference with LSTM , 2015, NAACL.

[140]  Edilson de Aguiar,et al.  Facial expression recognition with Convolutional Neural Networks: Coping with few data and the training sample order , 2017, Pattern Recognit..

[141]  Ngoc Thang Vu,et al.  Attentive Convolutional Neural Network Based Speech Emotion Recognition: A Study on the Impact of Input Features, Signal Length, and Acted Speech , 2017, INTERSPEECH.

[142]  Anil K. Jain,et al.  Fingerprint Recognition of Young Children , 2017, IEEE Transactions on Information Forensics and Security.

[143]  P. Shaver,et al.  Emotion knowledge: further exploration of a prototype approach. , 1987, Journal of personality and social psychology.

[144]  Alexander Gutkin,et al.  Recent Advances in Google Real-Time HMM-Driven Unit Selection Synthesizer , 2016, INTERSPEECH.

[145]  Fuji Ren,et al.  Facial expression recognition based on AAM–SIFT and adaptive regional weighting , 2015 .

[146]  Xiaojun Wan,et al.  Towards Constructing Sports News from Live Text Commentary , 2016, ACL.

[147]  Changqin Quan,et al.  Feature-level sentiment analysis by using comparative domain corpora , 2016, Enterp. Inf. Syst..

[148]  Dragomir R. Radev,et al.  Coherent Citation-Based Summarization of Scientific Papers , 2011, ACL.

[149]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[150]  David Vandyke,et al.  A Network-based End-to-End Trainable Task-oriented Dialogue System , 2016, EACL.

[151]  Dilek Z. Hakkani-Tür,et al.  Dialogue Learning with Human Teaching and Feedback in End-to-End Trainable Task-Oriented Dialogue Systems , 2018, NAACL.

[152]  Jun Morimoto,et al.  EMG-Based Model Predictive Control for Physical Human–Robot Interaction: Application for Assist-As-Needed Control , 2018, IEEE Robotics and Automation Letters.

[153]  Tian Gao,et al.  Efficient Markov Blanket Discovery and Its Application , 2017, IEEE Transactions on Cybernetics.

[154]  Piji Li,et al.  Abstractive Multi-Document Summarization via Phrase Selection and Merging , 2015, ACL.

[155]  M. Shamim Hossain,et al.  Audio-visual emotion recognition using multi-directional regression and Ridgelet transform , 2016, Journal on Multimodal User Interfaces.

[156]  Sungroh Yoon,et al.  Polyphonic Music Generation with Sequence Generative Adversarial Networks , 2017 .

[157]  Shuohang Wang,et al.  Machine Comprehension Using Match-LSTM and Answer Pointer , 2016, ICLR.

[158]  Fuji Ren,et al.  Background Knowledge Based Multi-Stream Neural Network for Text Classification , 2018, Applied Sciences.

[159]  Jacek Gwizdka,et al.  Using Wireless EEG Signals to Assess Memory Workload in the $n$-Back Task , 2016, IEEE Transactions on Human-Machine Systems.

[160]  George Kurian,et al.  Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation , 2016, ArXiv.

[161]  Jian Sun,et al.  Object Detection Networks on Convolutional Feature Maps , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[162]  Hua Li,et al.  Document Summarization Using Conditional Random Fields , 2007, IJCAI.

[163]  Tauhid Zaman,et al.  Predicting Performance Under Stressful Conditions Using Galvanic Skin Response , 2016, ArXiv.

[164]  Youcef Tabet,et al.  Speech synthesis techniques. A survey , 2011, International Workshop on Systems, Signal Processing and their Applications, WOSSPA.

[165]  Javier Hernando,et al.  Deep belief networks for i-vector based speaker recognition , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[166]  Mohammad Soleymani,et al.  Analysis of EEG Signals and Facial Expressions for Continuous Emotion Detection , 2016, IEEE Transactions on Affective Computing.

[167]  Alfredo Petrosino,et al.  Iris recognition through machine learning techniques: A survey , 2016, Pattern Recognit. Lett..

[168]  Anima Anandkumar,et al.  Deep Active Learning for Named Entity Recognition , 2017, Rep4NLP@ACL.

[169]  Alberto Del Bimbo,et al.  Natural Human–Computer Interaction , 2010 .

[170]  Yi-Hsuan Yang,et al.  Convolutional Generative Adversarial Networks with Binary Neurons for Polyphonic Music Generation , 2018, ISMIR.

[171]  Shuang Xu,et al.  First Step Towards End-to-End Parametric TTS Synthesis: Generating Spectral Parameters with Neural Attention , 2016, INTERSPEECH.

[172]  Geoffrey Zweig,et al.  Using Recurrent Neural Networks for Slot Filling in Spoken Language Understanding , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[173]  Nathan Schneider,et al.  Association for Computational Linguistics: Human Language Technologies , 2011 .

[174]  Yi Peng,et al.  Evaluation of Classification Algorithms Using MCDM and Rank Correlation , 2012, Int. J. Inf. Technol. Decis. Mak..

[175]  P MoranThomas,et al.  The keystroke-level model for user performance time with interactive systems , 1980 .

[176]  Jon Dobson,et al.  Remote control of cellular behaviour with magnetic nanoparticles. , 2008, Nature nanotechnology.

[177]  Oliver Brock,et al.  A novel type of compliant and underactuated robotic hand for dexterous grasping , 2016, Int. J. Robotics Res..

[178]  Noriaki Horii,et al.  A multichannel convolutional neural network for cross-language dialog state tracking , 2016, 2016 IEEE Spoken Language Technology Workshop (SLT).

[179]  RossArun,et al.  Long range iris recognition , 2017 .

[180]  Xiaojun Wan,et al.  Phrase-Based Presentation Slides Generation for Academic Papers , 2017, AAAI.

[181]  Yu Wang,et al.  A new factored POMDP model framework for affective tutoring systems: A NEW FACTORED POMDP MODEL FRAMEWORK , 2018 .

[182]  Jian Zhang,et al.  SQuAD: 100,000+ Questions for Machine Comprehension of Text , 2016, EMNLP.

[183]  L. R. Rabiner,et al.  An introduction to the application of the theory of probabilistic functions of a Markov process to automatic speech recognition , 1983, The Bell System Technical Journal.

[184]  Patrick Kenny,et al.  Front-End Factor Analysis for Speaker Verification , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[185]  Jun Zhao,et al.  Recurrent Convolutional Neural Networks for Text Classification , 2015, AAAI.

[186]  Yi Peng,et al.  A Group Decision Making Model for Integrating Heterogeneous Information , 2018, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[187]  Nicolas Y. Masse,et al.  Reach and grasp by people with tetraplegia using a neurally controlled robotic arm , 2012, Nature.

[188]  Wei Wu,et al.  Question Condensing Networks for Answer Selection in Community Question Answering , 2018, ACL.

[189]  Begoña García Zapirain,et al.  A Stress Sensor Based on Galvanic Skin Response (GSR) Controlled by ZigBee , 2012, Sensors.

[190]  Petri Nokelainen,et al.  Measuring Multiple Intelligences and Moral Sensitivities in Education , 2011 .

[191]  Jungwon Lee,et al.  Residual LSTM: Design of a Deep Recurrent Architecture for Distant Speech Recognition , 2017, INTERSPEECH.

[192]  Xiao Sun,et al.  Emotional Human-Machine Conversation Generation Based on Long Short-Term Memory , 2017, Cognitive Computation.

[193]  Changqin Quan,et al.  Textual emotion recognition for enhancing enterprise computing , 2016, Enterp. Inf. Syst..

[194]  Cícero Nogueira dos Santos,et al.  Learning Character-level Representations for Part-of-Speech Tagging , 2014, ICML.

[195]  David Grangier,et al.  Neural Text Generation from Structured Data with Application to the Biography Domain , 2016, EMNLP.

[196]  Mark J. F. Gales,et al.  The Application of Hidden Markov Models in Speech Recognition , 2007, Found. Trends Signal Process..

[197]  Richard D. Roberts,et al.  The science of emotional intelligence : knowns and unknowns , 2008 .

[198]  Eunsol Choi,et al.  Coarse-to-Fine Question Answering for Long Documents , 2016, ACL.

[199]  Pei-Hao Su,et al.  Sample Efficient Deep Reinforcement Learning for Dialogue Systems With Large Action Spaces , 2018, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[200]  Yuchi Huang,et al.  Interactive Generative Adversarial Networks for Facial Expression Generation in Dyadic Interactions , 2018, ArXiv.

[201]  Feature Extraction Of Optical Character Recognition : Survey , 2022 .

[202]  Frans Coenen,et al.  Driving posture recognition by convolutional neural networks , 2015, 2015 11th International Conference on Natural Computation (ICNC).

[203]  Mei Wang,et al.  Deep Face Recognition: A Survey , 2018, Neurocomputing.

[204]  Yang Wang,et al.  Flexible and Creative Chinese Poetry Generation Using Neural Memory , 2017, ACL.

[205]  Xuanjing Huang,et al.  Part-of-Speech Tagging for Twitter with Adversarial Neural Networks , 2017, EMNLP.

[206]  Dilek Z. Hakkani-Tür,et al.  Scalable multi-domain dialogue state tracking , 2017, 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU).

[207]  E. O. Polat,et al.  Energy‐Autonomous, Flexible, and Transparent Tactile Skin , 2017 .

[208]  Wenpeng Yin,et al.  Attention-Based Convolutional Neural Network for Machine Comprehension , 2016, ArXiv.

[209]  Yang Chen,et al.  Pairwise comparison matrix in multiple criteria decision making , 2016 .

[210]  Gerard Salton,et al.  A vector space model for automatic indexing , 1975, CACM.

[211]  H. Critchley,et al.  The influence of physiological signals on cognition , 2018, Current Opinion in Behavioral Sciences.

[212]  Honglak Lee,et al.  Attribute2Image: Conditional Image Generation from Visual Attributes , 2015, ECCV.

[213]  Anil K. Jain,et al.  Automated Latent Fingerprint Recognition , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[214]  Satoshi Tojo,et al.  Neural-based Natural Language Generation in Dialogue using RNN Encoder-Decoder with Semantic Aggregation , 2017, SIGDIAL Conference.

[215]  Fan Lin,et al.  Chinese Character CAPTCHA Recognition and performance estimation via deep neural network , 2018, Neurocomputing.

[216]  Shuo Yang,et al.  Faceness-Net: Face Detection through Deep Facial Part Responses , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[217]  Honghai Liu,et al.  Gesture Recognition Based on Kinect and sEMG Signal Fusion , 2018, Mobile Networks and Applications.

[218]  Qun Liu,et al.  Syntax-based Deep Matching of Short Texts , 2015, IJCAI.

[219]  Changqin Quan,et al.  A novel factored POMDP model for affective dialogue management , 2016, J. Intell. Fuzzy Syst..

[220]  Sabine Van Huffel,et al.  Evaluation of a Multichannel Non-Contact ECG System and Signal Quality Algorithms for Sleep Apnea Detection and Monitoring , 2018, Sensors.

[222]  Yang Liu,et al.  Fast Joint Compression and Summarization via Graph Cuts , 2013, EMNLP.

[223]  O. A. Fakolujo,et al.  A survey of face recognition techniques , 2007 .

[224]  Changqin Quan,et al.  A blog emotion corpus for emotional expression analysis in Chinese , 2010, Comput. Speech Lang..

[225]  Harish Karnick,et al.  Text Summarization using Abstract Meaning Representation , 2017, ArXiv.

[226]  Yi Peng,et al.  Soft consensus cost models for group decision making and economic interpretations , 2019, Eur. J. Oper. Res..

[227]  Ming Zhou,et al.  Ranking with Recursive Neural Networks and Its Application to Multi-Document Summarization , 2015, AAAI.

[228]  Yannis Agiomyrgiannakis,et al.  Vocaine the vocoder and applications in speech synthesis , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[229]  Shuai Wang,et al.  Deep learning for sentiment analysis: A survey , 2018, WIREs Data Mining Knowl. Discov..

[230]  Mohammad H. Mahoor,et al.  Going deeper in facial expression recognition using deep neural networks , 2015, 2016 IEEE Winter Conference on Applications of Computer Vision (WACV).

[231]  Phil Blunsom,et al.  Teaching Machines to Read and Comprehend , 2015, NIPS.

[232]  Ming Zhou,et al.  Reinforced Mnemonic Reader for Machine Reading Comprehension , 2017, IJCAI.

[233]  Xuan Zou,et al.  Illumination Invariant Face Recognition: A Survey , 2007, 2007 First IEEE International Conference on Biometrics: Theory, Applications, and Systems.

[234]  Maxine Eskénazi,et al.  Learning Discourse-level Diversity for Neural Dialog Models using Conditional Variational Autoencoders , 2017, ACL.

[235]  Sercan Ömer Arik,et al.  Deep Voice 3: Scaling Text-to-Speech with Convolutional Sequence Learning , 2017, ICLR.

[236]  Allen Newell,et al.  The keystroke-level model for user performance time with interactive systems , 1980, CACM.

[237]  Peng Zhou,et al.  Joint Extraction of Entities and Relations Based on a Novel Tagging Scheme , 2017, ACL.

[238]  Kevin Lin,et al.  Adversarial Ranking for Language Generation , 2017, NIPS.

[239]  Xin Kang,et al.  Employing hierarchical Bayesian networks in simple and complex emotion topic analysis , 2013, Comput. Speech Lang..

[240]  Christine L. Lisetti,et al.  HapFACS 3.0: FACS-Based Facial Expression Generator for 3D Speaking Virtual Characters , 2015, IEEE Transactions on Affective Computing.

[241]  Eric Nichols,et al.  Named Entity Recognition with Bidirectional LSTM-CNNs , 2015, TACL.

[242]  Kazuyuki Matsumoto,et al.  Emotion Analysis on Social Big Data , 2020 .

[243]  Bangyan Zhou,et al.  A Robust Online Saccadic Eye Movement Recognition Method Combining Electrooculography and Video , 2017, IEEE Access.

[244]  Kevin Blankespoor,et al.  BigDog, the Rough-Terrain Quadruped Robot , 2008 .

[245]  Xiaoyan Zhu,et al.  Emotional Chatting Machine: Emotional Conversation Generation with Internal and External Memory , 2017, AAAI.

[246]  Richard Socher,et al.  DCN+: Mixed Objective and Deep Residual Coattention for Question Answering , 2017, ICLR.

[247]  K. P. Soman,et al.  Deep Learning Based Part-of-Speech Tagging for Malayalam Twitter Data (Special Issue: Deep Learning Techniques for Natural Language Processing) , 2019, J. Intell. Syst..

[248]  Rohit Prabhavalkar,et al.  Exploring architectures, data and units for streaming end-to-end speech recognition with RNN-transducer , 2017, 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU).

[249]  Yong Yu,et al.  Long Text Generation via Adversarial Training with Leaked Information , 2017, AAAI.

[250]  Ales Procházka,et al.  Microsoft Kinect Visual and Depth Sensors for Breathing and Heart Rate Analysis , 2016, Sensors.

[251]  Zhong Huang,et al.  Automatic Facial Expression Learning Method Based on Humanoid Robot XIN-REN , 2016, IEEE Transactions on Human-Machine Systems.

[252]  Rosalind W. Picard Affective computing: (526112012-054) , 1997 .

[253]  Pichao Wang,et al.  Large-Scale Multimodal Gesture Recognition Using Heterogeneous Networks , 2017, 2017 IEEE International Conference on Computer Vision Workshops (ICCVW).

[254]  Po-Sen Huang,et al.  Two-Stage Synthesis Networks for Transfer Learning in Machine Comprehension , 2017, EMNLP.

[255]  Sergio Escalera,et al.  Survey on Emotional Body Gesture Recognition , 2018, IEEE Transactions on Affective Computing.

[256]  Xin Zhang,et al.  Learning effective binary descriptors for micro-expression recognition transferred by macro-information , 2017, Pattern Recognit. Lett..

[257]  Fuji Ren,et al.  Semi-Automatic Creation of Youth Slang Corpus and Its Application to Affective Computing , 2016, IEEE Transactions on Affective Computing.

[258]  Jaime G. Carbonell,et al.  Phonologically Aware Neural Model for Named Entity Recognition in Low Resource Transfer Settings , 2016, EMNLP.

[259]  Joelle Pineau,et al.  A Hierarchical Latent Variable Encoder-Decoder Model for Generating Dialogues , 2016, AAAI.

[260]  Tom Carey,et al.  ACM SIGCHI Curricula for Human-Computer Interaction , 1992 .

[261]  Tsuyoshi Murata,et al.  {m , 1934, ACML.

[262]  Xin Jin,et al.  Face alignment in-the-wild: A Survey , 2016, Comput. Vis. Image Underst..

[263]  Lina Yao,et al.  Cascade and Parallel Convolutional Recurrent Neural Networks on EEG-based Intention Recognition for Brain Computer Interface , 2017, AAAI.

[264]  Sercan Ömer Arik,et al.  Deep Voice 2: Multi-Speaker Neural Text-to-Speech , 2017, NIPS.

[265]  Diemo Schwarz,et al.  Current Research in concatenative sound synthesis , 2005, ICMC.

[266]  Qiang Li,et al.  A Hand Gesture Recognition Framework and Wearable Gesture-Based Interaction Prototype for Mobile Devices , 2014, IEEE Transactions on Human-Machine Systems.

[267]  Yu Gu,et al.  PAWS: Passive Human Activity Recognition Based on WiFi Ambient Signals , 2016, IEEE Internet of Things Journal.

[268]  Yelong Shen,et al.  Dynamic Fusion Networks for Machine Reading Comprehension , 2017 .

[269]  Byron M. Yu,et al.  A high-performance brain–computer interface , 2006, Nature.

[270]  Soo-Young Lee,et al.  Emotional End-to-End Neural Speech Synthesizer , 2017, NIPS 2017.

[271]  Sander Dieleman,et al.  Beyond Temporal Pooling: Recurrence and Temporal Convolutions for Gesture Recognition in Video , 2015, International Journal of Computer Vision.

[272]  Lei Wang,et al.  Sentiment analysis of text based on three-way decisions , 2017, J. Intell. Fuzzy Syst..

[273]  Daniel E. Koditschek,et al.  RHex: A Simple and Highly Mobile Hexapod Robot , 2001, Int. J. Robotics Res..

[274]  Fuji Ren,et al.  Emotion classification using a CNN_LSTM-based model for smooth emotional synchronization of the humanoid robot REN-XIN , 2019, PloS one.

[275]  Alexandre Bernardino,et al.  Low-cost 3-axis soft tactile sensors for the human-friendly robot Vizzy , 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[276]  Jianfeng Gao,et al.  A Human Generated MAchine Reading COmprehension Dataset , 2018 .

[277]  Zhong-Qiu Wang,et al.  Learning utterance-level representations for speech emotion and age/gender recognition using deep neural networks , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[278]  Rabia Jafri,et al.  A Survey of Face Recognition Techniques , 2009, J. Inf. Process. Syst..

[279]  Scott Kuindersma,et al.  Optimization-based locomotion planning, estimation, and control design for the atlas humanoid robot , 2015, Autonomous Robots.

[280]  Eduard H. Hovy,et al.  End-to-end Sequence Labeling via Bi-directional LSTM-CNNs-CRF , 2016, ACL.

[281]  Yoshua Bengio,et al.  A Neural Probabilistic Language Model , 2003, J. Mach. Learn. Res..

[282]  Fuji Ren,et al.  Correction to: Emotion recognition based on physiological signals using brain asymmetry index and echo state network , 2018, Neural Computing and Applications.

[283]  Giorgio Metta,et al.  Methods and Technologies for the Implementation of Large-Scale Robot Tactile Sensors , 2011, IEEE Transactions on Robotics.

[284]  Ling Shao,et al.  Multimedia Interaction and Intelligent User Interfaces , 2010 .

[285]  B. Everitt,et al.  Emotion and motivation: the role of the amygdala, ventral striatum, and prefrontal cortex , 2002, Neuroscience & Biobehavioral Reviews.

[286]  Jianfeng Gao,et al.  A Neural Network Approach to Context-Sensitive Generation of Conversational Responses , 2015, NAACL.

[287]  Seyyed Ali Seyyedsalehi,et al.  A Brain-Inspired Method of Facial Expression Generation Using Chaotic Feature Extracting Bidirectional Associative Memory , 2017, Neural Processing Letters.

[288]  Fuji Ren,et al.  Emotion recognition based on physiological signals using brain asymmetry index and echo state network , 2018, Neural Computing and Applications.

[289]  Geoffrey E. Hinton,et al.  Application of Deep Belief Networks for Natural Language Understanding , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[290]  Sebastian Ruder,et al.  Universal Language Model Fine-tuning for Text Classification , 2018, ACL.

[291]  Chong Wang,et al.  Deep Speech 2 : End-to-End Speech Recognition in English and Mandarin , 2015, ICML.

[292]  Dario Farina,et al.  Decoding Motor Unit Activity From Forearm Muscles: Perspectives for Myoelectric Control , 2018, IEEE Transactions on Neural Systems and Rehabilitation Engineering.

[293]  Jason Weston,et al.  The Goldilocks Principle: Reading Children's Books with Explicit Memory Representations , 2015, ICLR.

[294]  Yang Liu,et al.  Using Supervised Bigram-based ILP for Extractive Summarization , 2013, ACL.

[295]  Mike Thelwall,et al.  Sentiment Analysis Is a Big Suitcase , 2017, IEEE Intelligent Systems.

[296]  Huaizu Jiang,et al.  Face Detection with the Faster R-CNN , 2016, 2017 12th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2017).

[297]  Hui Lin,et al.  A Class of Submodular Functions for Document Summarization , 2011, ACL.

[298]  Karl F. MacDorman,et al.  Review of constraints on vision-based gesture recognition for human-computer interaction , 2018, IET Comput. Vis..

[299]  Quoc V. Le,et al.  QANet: Combining Local Convolution with Global Self-Attention for Reading Comprehension , 2018, ICLR.

[300]  Houfeng Wang,et al.  Interactive Attention Networks for Aspect-Level Sentiment Classification , 2017, IJCAI.

[301]  Ahmed Hussen Abdelaziz Comparing Fusion Models for DNN-Based Audiovisual Continuous Speech Recognition , 2018, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[302]  Lin Zhao,et al.  Improving Multi-documents Summarization by Sentence Compression based on Expanded Constituent Parse Trees , 2014, EMNLP.

[303]  Yoshua Bengio,et al.  Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[305]  R. Sternberg,et al.  Intelligence: Knowns and unknowns. , 1996 .

[306]  Lei Zhang,et al.  Bottom-Up and Top-Down Attention for Image Captioning and VQA , 2017, ArXiv.

[307]  Adam Coates,et al.  Deep Voice: Real-time Neural Text-to-Speech , 2017, ICML.

[308]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[309]  David Vandyke,et al.  Semantically Conditioned LSTM-based Natural Language Generation for Spoken Dialogue Systems , 2015, EMNLP.

[310]  Zhongfei Zhang,et al.  Semisupervised Autoencoder for Sentiment Analysis , 2015, AAAI.

[311]  Gökhan Tür,et al.  Optimizing SVMs for complex call classification , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[312]  Cynthia Breazeal,et al.  Function meets style: insights from emotion theory applied to HRI , 2004, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[313]  Wei Chen,et al.  Improving Neural Machine Translation with Conditional Sequence Generative Adversarial Nets , 2017, NAACL.

[314]  Rama Chellappa,et al.  FaceNet2ExpNet: Regularizing a Deep Face Recognition Net for Expression Recognition , 2016, 2017 12th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2017).