Image Parsing with a Three-State Series Neural Network Classifier

We propose a three-state series neural network for effective propagation of context and uncertainty information for image parsing. The activation functions used in the proposed model have three states instead of the normal two states. This makes the neural network more flexible than the two-state neural network, and allows for uncertainty to be propagated through the stages. In other words, decisions about difficult pixels can be left for later stages which have access to more contextual information than earlier stages. We applied the proposed method to three different datasets and experimental results demonstrate higher performance of the three-state series neural network.

[1]  Jitendra Malik,et al.  Shape matching and object recognition using shape contexts , 2010, 2010 3rd International Conference on Computer Science and Information Technology.

[2]  Simon Haykin,et al.  Neural Networks: A Comprehensive Foundation , 1998 .

[3]  Antonio Torralba,et al.  Contextual Models for Object Detection Using Boosted Random Fields , 2004, NIPS.

[4]  Y. LeCun,et al.  Learning methods for generic object recognition with invariance to pose and lighting , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[5]  Ross T. Whitaker,et al.  Serial Neural Network Classifier for Membrane Detection using a Filter Bank , 2009 .

[6]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[7]  Zhuowen Tu,et al.  Auto-Context and Its Application to High-Level Vision Tasks and 3D Brain Image Segmentation , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Shimon Ullman,et al.  Combining Top-Down and Bottom-Up Segmentation , 2004, 2004 Conference on Computer Vision and Pattern Recognition Workshop.

[9]  Joseph F. Murray,et al.  Supervised Learning of Image Restoration with Convolutional Networks , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[10]  D. Hansel,et al.  Information processing in three-state neural networks , 1989 .