论文信息 - Through-Screen Visible Light Sensing Empowered by Embedded Deep Learning

Through-Screen Visible Light Sensing Empowered by Embedded Deep Learning

Motivated by the trend of realizing full screens on devices such as smartphones, in this work we propose through-screen sensing with visible light for the application of fingertip air-writing. The system can recognize handwritten digits with under-screen photodiodes as the receiver. The key idea is to recognize the weak light reflected by the finger when the finger writes the digits on top of a screen. The proposed air-writing system has immunity to scene changes because it has a fixed screen light source. However, the screen is a double-edged sword as both a signal source and a noise source. We propose a data preprocessing method to reduce the interference of the screen as a noise source. We design an embedded deep learning model, a customized model ConvRNN, to model the spatial and temporal patterns in the dynamic and weak reflected signal for air-writing digits recognition. The evaluation results show that our through-screen fingertip air-writing system with visible light can achieve accuracy up to 91%. Results further show that the size of the customized ConvRNN model can be reduced by 94% with less than a 10% drop in performance.

[1] L. Deng,et al. The MNIST Database of Handwritten Digit Images for Machine Learning Research [Best of the Web] , 2012, IEEE Signal Processing Magazine.

[2] Song Han,et al. Deep Compression: Compressing Deep Neural Network with Pruning, Trained Quantization and Huffman Coding , 2015, ICLR.

[3] Yoshua Bengio,et al. Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling , 2014, ArXiv.

[4] Xia Zhou,et al. Self-Powered Gesture Recognition with Ambient Light , 2018, UIST.

[5] Nam Kim,et al. Trajectory-Based Air-Writing Recognition Using Deep Neural Network and Depth Sensor , 2020, Sensors.

[6] Rich Caruana,et al. Do Deep Nets Really Need to be Deep? , 2013, NIPS.

[7] Chandra Sekhar Seelamantula,et al. Robust Savitzky-Golay filters , 2014, 2014 19th International Conference on Digital Signal Processing.

[8] S. Kitamura,et al. A distinction method for fruit of sweet pepper using reflection of LED light , 2008, 2008 SICE Annual Conference.

[9] Jürgen Schmidhuber,et al. LSTM: A Search Space Odyssey , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[10] Xia Zhou,et al. Battery-Free Eye Tracker on Glasses , 2018, S3@MobiCom.

[11] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[12] Liangyin Chen,et al. Ambient Light Based Hand Gesture Recognition Enabled by Recurrent Neural Network , 2020, IEEE Access.

[13] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[14] Yiming Yang,et al. An Evaluation of Statistical Approaches to Text Categorization , 1999, Information Retrieval.

[15] Stefan C. Kremer,et al. Recurrent Neural Networks , 2013, Handbook on Neural Information Processing.

[16] Yoshua Bengio,et al. Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.