Semi-Supervised Learning With the EM Algorithm: A Comparative Study Between Unstructured and Structured Prediction

Semi-supervised learning aims to learn prediction models from both labeled and unlabeled samples. There has been extensive research in this area. Among existing work, generative mixture models with Expectation-Maximization (EM) is a popular method due to clear statistical properties. However, existing literature on EM-based semi-supervised learning largely focuses on unstructured prediction, assuming that samples are independent and identically distributed. Studies on EM-based semi-supervised approach in structured prediction is limited. This paper aims to fill the gap through a comparative study between unstructured and structured methods in EM-based semi-supervised learning. Specifically, we compare their theoretical properties and find that both methods can be considered as a generalization of self-training with soft class assignment of unlabeled samples, but the structured method additionally considers structural constraint in soft class assignment. We conducted a case study on real-world flood mapping datasets to compare the two methods. Results show that structured EM is more robust to class confusion caused by noise and obstacles in features in the context of the flood mapping application.

[1]  Nir Friedman,et al.  The Bayesian Structural EM Algorithm , 1998, UAI.

[2]  Gustavo Camps-Valls,et al.  Semi-Supervised Graph-Based Hyperspectral Image Classification , 2007, IEEE Transactions on Geoscience and Remote Sensing.

[3]  Weiping Zhu,et al.  Spatial co-training for semi-supervised image classification , 2015, Pattern Recognit. Lett..

[4]  R. Frey,et al.  EM algorithm for Markov chains observed via Gaussian noise and point process information: Theory and case studies , 2017 .

[5]  Xiaojin Zhu,et al.  --1 CONTENTS , 2006 .

[6]  Zhe Jiang,et al.  A Survey on Spatial Prediction Methods , 2019, IEEE Transactions on Knowledge and Data Engineering.

[7]  Shashi Shekhar,et al.  Spatial Big Data Science , 2017, Springer International Publishing.

[8]  Zhe Jiang,et al.  Geographical Hidden Markov Tree , 2021, IEEE Transactions on Knowledge and Data Engineering.

[9]  Shashi Shekhar,et al.  Spatiotemporal Data Mining: A Computational Perspective , 2015, ISPRS Int. J. Geo Inf..

[10]  Zhe Jiang,et al.  Hidden Markov Contour Tree: A Spatial Structured Model for Hydrological Applications , 2019, KDD.

[11]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[12]  Zhe Jiang,et al.  A Hidden Markov Contour Tree Model for Spatial Structured Prediction , 2020 .

[13]  Jon Atli Benediktsson,et al.  Semisupervised Self-Learning for Hyperspectral Image Classification , 2013, IEEE Transactions on Geoscience and Remote Sensing.

[14]  Ulf Brefeld,et al.  Semi-supervised learning for structured output variables , 2006, ICML.

[15]  L. Anselin Local Indicators of Spatial Association—LISA , 2010 .

[16]  Zhe Jiang,et al.  Geographical Hidden Markov Tree for Flood Extent Mapping , 2018, KDD.

[17]  Zoubin Ghahramani,et al.  Learning from labeled and unlabeled data with label propagation , 2002 .