Spatial Classification with Limited Observations Based on Physics-Aware Structural Constraint

Spatial classification with limited feature observations has been a challenging problem in machine learning. The problem exists in applications where only a subset of sensors are deployed at certain regions or partial responses are collected in field surveys. Existing research mostly focuses on addressing incomplete or missing data, e.g., data cleaning and imputation, classification models that allow for missing feature values, or modeling missing features as hidden variables and applying the EM algorithm. These methods, however, assume that incomplete feature observations only happen on a small subset of samples, and thus cannot solve problems where the vast majority of samples have missing feature observations. To address this issue, we propose a new approach that incorporates physics-aware structural constraints into the model representation. Our approach assumes that a spatial contextual feature is observed for all sample locations and establishes spatial structural constraint from the spatial contextual feature map. We design efficient algorithms for model parameter learning and class inference. Evaluations on real-world hydrological applications show that our approach significantly outperforms several baseline methods in classification accuracy, and the proposed solution is computationally efficient on a large data volume.

[1]  Brendan J. Frey,et al.  Factor graphs and the sum-product algorithm , 2001, IEEE Trans. Inf. Theory.

[2]  Mark A. Miller,et al.  Archetypal energy landscapes , 1998, Nature.

[3]  Geoffrey I. Webb The Problem of Missing Values in Decision Tree Grafting , 1998, Australian Joint Conference on Artificial Intelligence.

[4]  Soo-Young Lee,et al.  Training Algorithm with Incomplete Data for Feed-Forward Neural Networks , 1999, Neural Processing Letters.

[5]  Herbert Edelsbrunner,et al.  Computational Topology - an Introduction , 2009 .

[6]  Zhe Jiang,et al.  Spatial Structured Prediction Models: Applications, Challenges, and Techniques , 2020, IEEE Access.

[7]  Gustavo E. A. P. A. Batista,et al.  A Study of K-Nearest Neighbour as an Imputation Method , 2002, HIS.

[8]  Zhe Jiang,et al.  Hidden Markov Contour Tree: A Spatial Structured Model for Hydrological Applications , 2019, KDD.

[9]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[10]  Jack Snoeyink,et al.  Computing contour trees in all dimensions , 2000, SODA '00.

[11]  J. Ross Quinlan,et al.  Unknown Attribute Values in Induction , 1989, ML.

[12]  Johan A. K. Suykens,et al.  Handling missing values in support vector machine classifiers , 2005, Neural Networks.

[13]  Herbert Edelsbrunner,et al.  The geometry of biomolecular solvation , 2005 .

[14]  Yoshua Bengio,et al.  Recurrent Neural Networks for Missing or Asynchronous Data , 1995, NIPS.

[15]  Lawrence Carin,et al.  On Classification with Incomplete Data , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Venkatesh Merwade,et al.  Uncertainty in Flood Inundation Mapping: Current Issues and Future Directions , 2008 .

[17]  Pieter Abbeel,et al.  Max-margin classification of incomplete data , 2006, NIPS.

[18]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[19]  Shashi Shekhar,et al.  Spatial Big Data Science , 2017, Springer International Publishing.

[20]  Shashi Shekhar,et al.  Spatiotemporal Data Mining: A Computational Perspective , 2015, ISPRS Int. J. Geo Inf..

[21]  Michael I. Jordan,et al.  Supervised learning from incomplete data via an EM approach , 1993, NIPS.

[22]  Thomas Hofmann,et al.  Kernel Methods for Missing Variables , 2005, AISTATS.

[23]  Pengyang Wang,et al.  Beyond Geo-First Law: Learning Spatial Representations via Integrated Autocorrelations and Complementarity , 2019, 2019 IEEE International Conference on Data Mining (ICDM).

[24]  Zhe Jiang,et al.  A Survey on Spatial Prediction Methods , 2019, IEEE Transactions on Knowledge and Data Engineering.

[25]  Lena Osterhagen,et al.  Multiple Imputation For Nonresponse In Surveys , 2016 .

[26]  Zhe Jiang,et al.  Geographical Hidden Markov Tree for Flood Extent Mapping , 2018, KDD.

[27]  G. McLachlan,et al.  The EM algorithm and extensions , 1996 .

[28]  Nicole A. Lazar,et al.  Statistical Analysis With Missing Data , 2003, Technometrics.

[29]  David E. Booth,et al.  Analysis of Incomplete Multivariate Data , 2000, Technometrics.

[30]  J. R. Rohlicek,et al.  Parameter estimation of dependence tree models using the EM algorithm , 1995, IEEE Signal Processing Letters.

[31]  Jean-Philip Piquemal,et al.  Characterizing Molecular Interactions in Chemical Systems , 2014, IEEE Transactions on Visualization and Computer Graphics.

[32]  Julius Jellinek,et al.  Energy Landscapes: With Applications to Clusters, Biomolecules and Glasses , 2005 .

[33]  Zhe Jiang,et al.  Geographical Hidden Markov Tree , 2021, IEEE Transactions on Knowledge and Data Engineering.

[34]  Yanjie Fu,et al.  Unifying Inter-region Autocorrelation and Intra-region Structures for Spatial Embedding via Collective Adversarial Learning , 2019, KDD.

[35]  Michael I. Jordan,et al.  Learning from Incomplete Data , 1994 .

[36]  Aníbal R. Figueiras-Vidal,et al.  Pattern classification with missing data: a review , 2010, Neural Computing and Applications.

[37]  Xiaoying Yang,et al.  GIS-based spatial regression and prediction of water quality in river networks: a case study in Iowa. , 2010, Journal of environmental management.

[38]  Fei Wang,et al.  Label Propagation through Linear Neighborhoods , 2006, IEEE Transactions on Knowledge and Data Engineering.