Smart Multimedia

Social interactions mediate our communication with others, enable development and maintenance of personal and professional relationships, and contribute greatly to our health. While both verbal cues (i.e., speech) and nonverbal cues (e.g., facial expressions, hand gestures, and body language) are exchanged during social interactions, the latter encompasses more information (*65%). Given their inherent visual nature, non-verbal cues are largely inaccessible to individuals who are blind, putting this population at a social disadvantage compared to their sighted peers. For individuals who are blind, embarrassing social situations are not uncommon due to miscommunication, which can lead to social avoidance and isolation. In this paper, we propose a mapping between visual facial expressions, represented as facial action units, which may be extracted using computer vision algorithms, to haptic (vibrotactile) representations, toward discreet and real-time perception of facial expressions during social interactions by individuals who are blind.

[1]  Yoram Bresler,et al.  Structured Overcomplete Sparsifying Transform Learning with Convergence Guarantees and Applications , 2015, International Journal of Computer Vision.

[2]  Yonina C. Eldar,et al.  Blind Compressed Sensing , 2010, IEEE Transactions on Information Theory.

[3]  Alessandro Foi,et al.  Image Denoising by Sparse 3-D Transform-Domain Collaborative Filtering , 2007, IEEE Transactions on Image Processing.

[4]  Guillermo Sapiro,et al.  Non-local sparse models for image restoration , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[5]  Vadlamani Ravi,et al.  A survey on opinion mining and sentiment analysis: Tasks, approaches and applications , 2015, Knowl. Based Syst..

[6]  Guan-Ming Su,et al.  Content aware quantization: Requantization of high dynamic range baseband signals based on visual masking by noise and texture , 2016, 2016 IEEE International Conference on Image Processing (ICIP).

[7]  Benjamin Guthier,et al.  A Dataset for Psychological Human Needs Detection From Social Networks , 2017, IEEE Access.

[8]  Guan-Ming Su,et al.  Transim: Transfer Image Local Statistics Across EOTFS for HDR Image Applications , 2018, 2018 IEEE International Conference on Multimedia and Expo (ICME).

[9]  Ariel Shamir,et al.  Seam Carving for Content-Aware Image Resizing , 2007, ACM Trans. Graph..

[10]  Lei Zhang,et al.  Weighted Nuclear Norm Minimization and Its Applications to Low Level Vision , 2016, International Journal of Computer Vision.

[11]  Nick Bassiliades,et al.  Ontology-based sentiment analysis of twitter posts , 2013, Expert Syst. Appl..

[12]  D. Donoho For most large underdetermined systems of linear equations the minimal 𝓁1‐norm solution is also the sparsest solution , 2006 .

[13]  Marc Teboulle,et al.  Fast Gradient-Based Algorithms for Constrained Total Variation Image Denoising and Deblurring Problems , 2009, IEEE Transactions on Image Processing.

[14]  Petra Kralj Novak,et al.  Sentiment of Emojis , 2015, PloS one.

[15]  Taoran Lu,et al.  ITP Colour Space and Its Compression Performance for High Dynamic Range and Wide Colour Gamut Video Distribution , 2016 .

[16]  Yanjun Li,et al.  When sparsity meets low-rankness: Transform learning with non-local low-rank constraint for image restoration , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[17]  Saif Mohammad,et al.  Sentiment Analysis of Short Informal Texts , 2014, J. Artif. Intell. Res..

[18]  Yoram Bresler,et al.  MR Image Reconstruction From Highly Undersampled k-Space Data by Dictionary Learning , 2011, IEEE Transactions on Medical Imaging.

[19]  Stefan Harmeling,et al.  Image denoising: Can plain neural networks compete with BM3D? , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[20]  David L Donoho,et al.  Compressed sensing , 2006, IEEE Transactions on Information Theory.

[21]  Andrew Cotton,et al.  A Display-Independent High Dynamic Range Television System , 2016 .

[22]  Jian Yang,et al.  MemNet: A Persistent Memory Network for Image Restoration , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[23]  Abdulmotaleb El-Saddik,et al.  Survey on ontologies for affective states and their influences , 2018, Semantic Web.

[24]  Uzay Kaymak,et al.  Exploiting Emoticons in Polarity Classification of Text , 2015, J. Web Eng..

[25]  Scott Miller,et al.  Perceptual Signal Coding for More Efficient Usage of Bit Codes , 2012 .

[26]  Jian Sun,et al.  Deep ADMM-Net for Compressive Sensing MRI , 2016, NIPS.

[27]  Saif Mohammad,et al.  CROWDSOURCING A WORD–EMOTION ASSOCIATION LEXICON , 2013, Comput. Intell..

[28]  E. Candès,et al.  Stable signal recovery from incomplete and inaccurate measurements , 2005, math/0503066.

[29]  Stephen P. Boyd,et al.  Distributed Optimization and Statistical Learning via the Alternating Direction Method of Multipliers , 2011, Found. Trends Mach. Learn..

[30]  Yoram Bresler,et al.  FRIST—flipping and rotation invariant sparsifying transform learning and applications , 2015, ArXiv.

[31]  L. Rudin,et al.  Nonlinear total variation based noise removal algorithms , 1992 .

[32]  C. Andrew Segall,et al.  High quality HDR video compression using HEVC main 10 profile , 2016, 2016 Picture Coding Symposium (PCS).

[33]  Xianming Liu,et al.  When Image Denoising Meets High-Level Vision Tasks: A Deep Learning Approach , 2017, IJCAI.

[34]  Yanjun Li,et al.  Joint Adaptive Sparsity and Low-Rankness on the Fly: An Online Tensor Reconstruction Scheme for Video Denoising , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[35]  Roddy Cowie,et al.  What a neural net needs to know about emotion words , 1999 .

[36]  Guan-Ming Su,et al.  Impact Analysis of Baseband Quantizer on Coding Efficiency for HDR Video , 2016, IEEE Signal Processing Letters.

[37]  Michael Elad,et al.  Image Denoising Via Sparse and Redundant Representations Over Learned Dictionaries , 2006, IEEE Transactions on Image Processing.

[38]  E. Diener,et al.  Gender differences in negative affect and well-being: the case for emotional intensity. , 1991, Journal of personality and social psychology.

[39]  Guan-Ming Su,et al.  Single Layer Progressive Coding for High Dynamic Range Videos , 2018, 2018 Picture Coding Symposium (PCS).

[40]  Scott Miller,et al.  Color signal encoding for high dynamic range and wide color gamut based on human perception , 2014, Electronic Imaging.

[41]  Vaibhavi N Patodkar,et al.  Twitter as a Corpus for Sentiment Analysis and Opinion Mining , 2016 .

[42]  D. Donoho,et al.  Sparse MRI: The application of compressed sensing for rapid MR imaging , 2007, Magnetic resonance in medicine.

[43]  Martin Vetterli,et al.  Adaptive wavelet thresholding for image denoising and compression , 2000, IEEE Trans. Image Process..