Multi-Task Joint Learning Model for Segmenting and Classifying Tongue Images Using a Deep Neural Network

Automatic tongue image segmentation and tongue image classification are two crucial tongue characterization tasks in traditional Chinese medicine (TCM). Due to the complexity of tongue segmentation and fine-grained traits of tongue image classification, both tasks are challenging. Fortunately, from the perspective of computer vision, these two tasks are highly interrelated, making them compatible with the idea of Multi-Task Joint learning (MTL). By sharing the underlying parameters and adding two different task loss functions, an MTL method for segmenting and classifying tongue images is proposed in this paper. Moreover, two state-of-the-art deep neural network variants (UNET and Discriminative Filter Learning (DFL)) are fused into the MTL to perform these two tasks. To the best of our knowledge, our method is the first attempt to manage both tasks simultaneously with MTL. We conducted extensive experiments with the proposed method. The experimental results show that our joint method outperforms the existing tongue characterization methods. Besides, visualizations and ablation studies are provided to aid in understanding our approach, which suggest that our method is highly consistent with human perception.

[1]  Dan Wang,et al.  Automatic tongue image segmentation based on histogram projection and matting , 2014, 2014 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[2]  Qiang Yang,et al.  An Overview of Multi-task Learning , 2018 .

[3]  Xinfeng Zhang,et al.  Constitution Identification of Tongue Image Based on CNN , 2018, 2018 11th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI).

[4]  Bo Yan,et al.  Classification of tongue color based on CNN , 2017, 2017 IEEE 2nd International Conference on Big Data Analysis (ICBDA)(.

[5]  Fei Su,et al.  ENS-Unet: End-to-End Noise Suppression U-Net for Brain Tumor Segmentation , 2018, 2018 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[6]  Yi Zhang,et al.  Tooth-Marked Tongue Recognition Using Multiple Instance Learning and CNN Features , 2019, IEEE Transactions on Cybernetics.

[7]  Lina Yao,et al.  A Survey on Deep Learning based Brain Computer Interface: Recent Advances and New Frontiers , 2019, ArXiv.

[8]  Changsheng Xu,et al.  Joint Pose and Expression Modeling for Facial Expression Recognition , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[9]  David Zhang,et al.  Tongue shape classification by geometric features , 2010, Inf. Sci..

[10]  Dinggang Shen,et al.  Sparse Multiview Task-Centralized Ensemble Learning for ASD Diagnosis Based on Age- and Sex-Related Functional Connectivity Patterns , 2019, IEEE Transactions on Cybernetics.

[11]  Jionglong Su,et al.  Adaptive active contour model based automatic tongue image segmentation , 2016, 2016 9th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI).

[12]  Chi-Wing Fu,et al.  H-DenseUNet: Hybrid Densely Connected UNet for Liver and Tumor Segmentation From CT Volumes , 2018, IEEE Transactions on Medical Imaging.

[13]  Dinggang Shen,et al.  BIRNet: Brain image registration using dual‐supervised fully convolutional networks , 2018, Medical Image Anal..

[14]  Bob Zhang,et al.  Significant Geometry Features in Tongue Image Analysis , 2015, Evidence-based complementary and alternative medicine : eCAM.

[15]  Xiaoou Tang,et al.  Facial Landmark Detection by Deep Multi-task Learning , 2014, ECCV.

[16]  Yanyun Qu,et al.  Deeptongue: Tongue Segmentation Via Resnet , 2018, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[17]  Bolei Zhou,et al.  Learning Deep Features for Discriminative Localization , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[18]  David Zhang,et al.  The bi-elliptical deformable contour and its application to automated tongue segmentation in Chinese medicine , 2005, IEEE Transactions on Medical Imaging.

[19]  Leixin Zhou,et al.  Multiple surface segmentation using convolution neural nets: application to retinal layer segmentation in OCT images , 2018, Biomedical optics express.

[20]  Jianwei Wang,et al.  Joint learning for pulmonary nodule segmentation, attributes and malignancy prediction , 2018, 2018 IEEE 15th International Symposium on Biomedical Imaging (ISBI 2018).

[21]  Nicolas Thome,et al.  Multitask Classification and Segmentation for Cancer Diagnosis in Mammography , 2019, 1909.05397.

[22]  Xiaogang Wang,et al.  Pyramid Scene Parsing Network , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[23]  Lizhong Zhang,et al.  Application of Image Segmentation Technique in Tongue Diagnosis , 2009, 2009 International Forum on Information Technology and Applications.

[24]  David Zhang,et al.  Statistical Analysis of Tongue Images for Feature Extraction and Diagnostics , 2013, IEEE Transactions on Image Processing.

[25]  Xiangyang Xue,et al.  Multi-task Deep Neural Network for Joint Face Recognition and Facial Attribute Prediction , 2017, ICMR.

[26]  David Zhang,et al.  A snake‐based approach to automated segmentation of tongue image using polar edge detector , 2006, Int. J. Imaging Syst. Technol..

[27]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[28]  Larry S. Davis,et al.  Learning a Discriminative Filter Bank Within a CNN for Fine-Grained Recognition , 2016, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[29]  David Zhang,et al.  Computerized tongue diagnosis based on Bayesian networks , 2004, IEEE Transactions on Biomedical Engineering.

[30]  Tao Zhou,et al.  Tongue Shape Detection Based on B-Spline , 2006, 2006 International Conference on Machine Learning and Cybernetics.

[31]  Lu Wang,et al.  Automated Tongue Segmentation in Chinese Medicine Based on Deep Learning , 2018, ICONIP.

[32]  Dinggang Shen,et al.  Effective feature learning and fusion of multimodality data using stage‐wise deep neural network for dementia diagnosis , 2018, Human brain mapping.

[33]  Huiliang Shang,et al.  A novel automatic tongue image segmentation algorithm: Color enhancement method based on L*a*b* color space , 2015, 2015 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[34]  Thomas Brox,et al.  U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[35]  Zuoyong Li,et al.  Tongue Image Segmentation via Color Decomposition and Thresholding , 2017, 2017 4th International Conference on Information Science and Control Engineering (ICISCE).

[36]  Bo Yan,et al.  Computerized tongue coating nature diagnosis using convolutional neural network , 2017, 2017 IEEE 2nd International Conference on Big Data Analysis (ICBDA)(.

[37]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[38]  Ratchadaporn Kanawong,et al.  Features for automated tongue image shape classification , 2012, 2012 IEEE International Conference on Bioinformatics and Biomedicine Workshops.

[39]  Rama Chellappa,et al.  HyperFace: A Deep Multi-Task Learning Framework for Face Detection, Landmark Localization, Pose Estimation, and Gender Recognition , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[40]  Jinhong Guo,et al.  Intelligent Syndrome Differentiation of Traditional Chinese Medicine by ANN: A Case Study of Chronic Obstructive Pulmonary Disease , 2019, IEEE Access.