A Deterministic Self-Organizing Map Approach and its Application on Satellite Data based Cloud Type Classification

A self-organizing map (SOM) is a type of competitive artificial neural network, which projects the high-dimensional input space of the training samples into a low-dimensional space with the topology relations preserved. This makes SOMs supportive of organizing and visualizing complex data sets and have been pervasively used among numerous disciplines with different applications. Notwithstanding its wide applications, the self-organizing map is perplexed by its inherent randomness, which produces dissimilar SOM patterns even when being trained on identical training samples with the same parameters every time, and thus causes usability concerns for other domain practitioners and precludes more potential users from exploring SOM based applications in a broader spectrum. Motivated by this practical concern, we propose a deterministic approach as a supplement to the standard self-organizing map. In accordance with the theoretical design, the experimental results with satellite cloud data demonstrate the effective and efficient organization as well as simplification capabilities of the proposed approach.

[1]  W. Paul Menzel,et al.  The MODIS cloud products: algorithms and examples from Terra , 2003, IEEE Trans. Geosci. Remote. Sens..

[2]  Pedro M. Domingos A few useful things to know about machine learning , 2012, Commun. ACM.

[3]  Ting Su,et al.  A deterministic method for initializing K-means clustering , 2004, 16th IEEE International Conference on Tools with Artificial Intelligence.

[4]  Robert H. Weisberg,et al.  A Review of Self-Organizing Map Applications in Meteorology and Oceanography , 2011 .

[5]  Jian Tang,et al.  Using the machine learning approach to predict patient survival from high-dimensional survival data , 2016, 2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[6]  Tak-Chung Fu,et al.  Pattern discovery from stock time series using self-organizing maps , 2016 .

[7]  Erkki Oja,et al.  PicSOM: self-organizing maps for content-based image retrieval , 1999, IJCNN'99. International Joint Conference on Neural Networks. Proceedings (Cat. No.99CH36339).

[8]  Hujun Yin,et al.  The Self-Organizing Maps: Background, Theories, Extensions and Applications , 2008, Computational Intelligence: A Compendium.

[9]  Vladimir Filkov,et al.  Consensus Clustering Algorithms: Comparison and Refinement , 2008, ALENEX.

[10]  Dongmin Lee,et al.  Radiative effects of global MODIS cloud regimes , 2016, Journal of geophysical research. Atmospheres : JGR.

[11]  Christopher N. K. Mooers,et al.  Performance evaluation of the self‐organizing map for feature extraction , 2006 .

[12]  Yukio Masumoto,et al.  Impact of Indian Ocean Dipole on intraseasonal zonal currents at 90°E on the equator as revealed by self‐organizing map , 2008 .

[13]  Steven Platnick,et al.  The MODIS Cloud Optical and Microphysical Products: Collection 6 Updates and Examples From Terra and Aqua , 2017, IEEE Transactions on Geoscience and Remote Sensing.

[14]  W. Rossow,et al.  Advances in understanding clouds from ISCCP , 1999 .

[15]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[16]  Teuvo Kohonen,et al.  Essentials of the self-organizing map , 2013, Neural Networks.

[17]  Daeho Jin,et al.  Regime-based evaluation of cloudiness in CMIP5 models , 2016, Climate Dynamics.

[18]  B. John Oommen,et al.  Topology-oriented self-organizing maps: a survey , 2014, Pattern Analysis and Applications.

[19]  Björn A. Malmgren,et al.  Climate Zonation in Puerto Rico Based on Principal Components Analysis and an Artificial Neural Network , 1999 .

[20]  Arthur Flexer,et al.  On the use of self-organizing maps for clustering and visualization , 1999, Intell. Data Anal..

[21]  George J. Huffman,et al.  An examination of the nature of global MODIS cloud regimes , 2014 .

[22]  Jianwu Wang,et al.  A Hybrid Learning Framework for Imbalanced Stream Classification , 2017, 2017 IEEE International Congress on Big Data (BigData Congress).

[23]  Xin Jin,et al.  K-Means Clustering , 2010, Encyclopedia of Machine Learning.

[24]  Bianca Zadrozny,et al.  Learning and evaluating classifiers under sample selection bias , 2004, ICML.

[25]  W. Paul Menzel,et al.  High-Spatial-Resolution Surface and Cloud-Type Classification from MODIS Multispectral Band Measurements , 2003 .

[26]  Samuel Kaski,et al.  Clustering of Human Endogenous Retrovirus Sequences with Median Self-Organizing Map , 2003 .

[27]  W. Paul Menzel,et al.  Comparison between current and future environmental satellite imagers on cloud classification using MODIS , 2007 .

[28]  Teuvo Kohonen,et al.  The self-organizing map , 1990, Neurocomputing.

[29]  W. Paul Menzel,et al.  Cloud and aerosol properties, precipitable water, and profiles of temperature and water vapor from MODIS , 2003, IEEE Trans. Geosci. Remote. Sens..

[30]  Georg Pölzlbauer Survey and Comparison of Quality Measures for Self-Organizing Maps , 2004 .

[31]  Ben Jolly,et al.  An automated satellite cloud classification scheme using self‐organizing maps: Alternative ISCCP weather states , 2016 .

[32]  Alexander N. Gorban,et al.  SOM: Stochastic initialization versus principal components , 2016, Inf. Sci..