Image-Based Electric Consumption Recognition via Multi-Task Learning

This work presents an approach to detect and recognize digits in meter displays by applying the multi-task learning technique. Two convolutional networks are used and divided as tasks to reach different goals. One to detect digits and another to recognize them from different sources, and at the end to return display prediction. In order to validate this methodology, a displays images dataset was labelled. In order to use multi-task learning to reduce the number of failures associated with the reading process. For this, we propose a two-task approach, the first one to detect digits using the networks Faster R-CNN and RetinaNet, and the second stage for recognizing them using Resnet152 network. The initial tests yielded promising results, with 1360 displays, divided into 70% for training and 30% for the test, obtained the following values of Mean Average Precision (mAP), 0.91 (Faster R-CNN) and 0.90 (RetinaNet) on detection and 98.2% accuracy in the classification.

[1]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  W. Pora,et al.  Development of an automatic meter reading system based on ZigBee PRO Smart Energy Profile IEEE 802.15.4 standard , 2012, 2012 IEEE International Conference on Electron Devices and Solid State Circuit (EDSSC).

[3]  Yuan Li,et al.  Meter character recognition method based on gray template matching , 2010, Proceedings of the 29th Chinese Control Conference.

[4]  Rich Caruana,et al.  Multitask Learning , 1998, Encyclopedia of Machine Learning and Data Mining.

[5]  P. S. Diniz,et al.  Automatic Power Consumption Reading on Digital Meters based on Adaptive Thresholds and Multiresolution Templates , 2018 .

[6]  Hanqing Lu,et al.  Automatic Watermeter Digit Recognition on Mobile Devices , 2017, ICIMCS.

[7]  Chirag I. Patel,et al.  Optical Character Recognition by Open source OCR Tool Tesseract: A Case Study , 2012 .

[8]  Yann LeCun,et al.  Convolutional networks and applications in vision , 2010, Proceedings of 2010 IEEE International Symposium on Circuits and Systems.

[9]  Chris Eliasmith,et al.  Hyperopt: a Python library for model selection and hyperparameter optimization , 2015 .

[10]  Kuldip K. Paliwal,et al.  Bidirectional recurrent neural networks , 1997, IEEE Trans. Signal Process..

[11]  Qiang Yang,et al.  A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[12]  M. Popa Gateway design and implementation in an Automatic Meter Reading system based on Power Line Communications , 2011, The 7th International Conference on Networked Computing and Advanced Information Management.

[13]  Kaiming He,et al.  Focal Loss for Dense Object Detection , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[14]  Chunguo Jing,et al.  Study of the Automatic Reading of Watt Meter Based on Image Processing Technology , 2007, 2007 2nd IEEE Conference on Industrial Electronics and Applications.

[15]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[16]  Martin Cerman,et al.  A Mobile Recognition System for Analog Energy Meter Scanning , 2016, ISVC.

[17]  Kaiming He,et al.  Feature Pyramid Networks for Object Detection , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).