Weakly Supervised Spatial Deep Learning based on Imperfect Vector Labels with Registration Errors

This paper studies weakly supervised learning on spatial raster data based on imperfect vector training labels. Given raster feature imagery and imperfect (weak) vector labels with location registration errors, our goal is to learn a deep learning model for pixel classification and refine vector labels simultaneously. The problem is important in many geoscience applications such as streamline delineation and road mapping from earth imagery, where annotating imperfect coarse vector labels is far more efficient than drawing precise labels. But the problem is challenging due to the misalignment of vector labels with raster feature pixels and the need to infer true vector label location while learning neural network parameters. Existing works on weakly supervised learning often focus on noise and errors in label semantics, assuming label locations to be either correct or irrelevant (e.g., identical and independently distributed). A few works exist on label registration errors, but these methods often focus on label misalignment on object segment boundaries at the pixel level without guaranteeing vector continuity. To fill the gap, this paper proposes a spatial learning framework based on Expectation-Maximization that iteratively updates deep neural network parameters while inferring true vector label locations. Specifically, inference of true vector locations is based on both the current pixel class predictions and the geometric properties of vectors. Evaluations on real-world high-resolution remote sensing datasets in National Hydrography Dataset (NHD) refinement show that the proposed framework outperforms baseline methods in classification accuracy and refined vector quality.

[1]  Tom M. Mitchell,et al.  Weakly Supervised Extraction of Computer Security Events from Twitter , 2015, WWW.

[2]  E. Lynn Usery,et al.  Extensibility of U-Net Neural Network Model for Hydrographic Feature Extraction and Implications for Hydrologic Modeling , 2021, Remote. Sens..

[3]  Blake Anderson,et al.  Machine Learning for Encrypted Malware Traffic Classification: Accounting for Noisy Labels and Non-Stationarity , 2017, KDD.

[4]  Zhe Jiang,et al.  Monitoring Land-Cover Changes: A Machine-Learning Perspective , 2016, IEEE Geoscience and Remote Sensing Magazine.

[5]  Iasonas Kokkinos,et al.  DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Alexander Zipf,et al.  A taxonomy of quality assessment methods for volunteered and crowdsourced geographic information , 2018, Trans. GIS.

[7]  Thomas Brox,et al.  U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[8]  Zhe Jiang,et al.  A Survey on Spatial Prediction Methods , 2019, IEEE Transactions on Knowledge and Data Engineering.

[9]  Yang Zou,et al.  Simultaneous Edge Alignment and Learning , 2018, ECCV.

[10]  Zhi-Hua Zhou,et al.  Learning From Incomplete and Inaccurate Supervision , 2019, IEEE Transactions on Knowledge and Data Engineering.

[11]  Shashi Shekhar,et al.  Spatiotemporal Data Mining: A Computational Perspective , 2015, ISPRS Int. J. Geo Inf..

[12]  Shashi Shekhar,et al.  Spatial Big Data Science , 2017, Springer International Publishing.

[13]  Nan Yang,et al.  A review of road extraction from remote sensing images , 2016 .

[14]  Roberto Cipolla,et al.  SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  Xu Miao,et al.  Active Deep Learning to Tune Down the Noise in Labels , 2018, KDD.

[16]  Sanja Fidler,et al.  Devil Is in the Edges: Learning Semantic Boundaries From Noisy Annotations , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[17]  Alexander Zipf,et al.  Mapping Human Settlements with Higher Accuracy and Less Volunteer Efforts by Combining Crowdsourcing and Deep Learning , 2019, Remote. Sens..

[18]  Zhe Jiang,et al.  An attention U-Net model for detection of fine-scale hydrologic streamlines , 2021, Environ. Model. Softw..

[19]  Heng Ji,et al.  Label Noise Reduction in Entity Typing by Heterogeneous Partial-Label Embedding , 2016, KDD.

[20]  Andrew Zisserman,et al.  AutoCorrect: Deep Inductive Alignment of Noisy Geometric Annotations , 2019, BMVC.

[21]  Zhiwu Lu,et al.  Learning from Weak and Noisy Labels for Semantic Segmentation , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  Xiaogang Wang,et al.  Learning from massive noisy labeled data for image classification , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[23]  Michael F. Goodchild,et al.  Assuring the quality of volunteered geographic information , 2012 .

[24]  M. Verleysen,et al.  Classification in the Presence of Label Noise: A Survey , 2014, IEEE Transactions on Neural Networks and Learning Systems.

[25]  Yiqun Liu,et al.  MSURU: Large Scale E-commerce Image Classification with Weakly Supervised Search Data , 2019, KDD.

[26]  Xiangjian He,et al.  Deep Learning Techniques for Medical Image Segmentation: Achievements and Challenges , 2019, Journal of Digital Imaging.

[27]  Geoffrey E. Hinton,et al.  Learning to Label Aerial Images from Noisy Data , 2012, ICML.

[28]  Thomas G. Dietterich An Experimental Comparison of Three Methods for Constructing Ensembles of Decision Trees: Bagging, Boosting, and Randomization , 2000, Machine Learning.

[29]  Richard Nock,et al.  Making Deep Neural Networks Robust to Label Noise: A Loss Correction Approach , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).