Preserving feature layout information for object recognition

In this paper, we have proposed a method to preserve layout information of feature maps, which is vanished in fully connected layer, for object classification tasks. In bag-of-features framework, codebook encodes an image to produce a group of response maps by convolution operation. And the maps hold location information of features in regular grids. To obtain mid-level representation, it is common to concatenate all the features into a long feature vector. Based on the representation, a linear classifier or full-connected layer is implemented to predict labels. However, because of the disappearance of regular girds, the spatial information of features vanish before being fed to the higher layer, even though it is preserved in the feature extraction process. In this paper, this problem is addressed by applying spatial description feature (SDF) to preserve more useful information with modified spatial pyramids. In addition, to enhance the performance of SDFs, we have designed a forward–backward strategy to select receptive fields. In the experiment, it is shown that the knowledge of feature spatial layout can promote classification and the forward–backward learning scheme can generate a compact and high-performance pipeline. © 2016 Institute of Electrical Engineers of Japan. Published by John Wiley & Sons, Inc.