Bottom-up Estimation of Geometric Layout for Indoor Images

In this paper, we propose a bottom-up approach to estimate the geometric layout of indoor images using latent variables. By utilizing latent variables to model subregions, the estimation accuracy of scene layout is implicitly improved. The proposed method consists of three sub-tasks: feature extraction, subregion classification and geometric layout classification. Firstly, the location features are extracted to roughly estimate the basic indoor structure. The influence of illumination, rich color, and foreground occlusion can be eliminated. Secondly, N-slack SSVM is applied to efficiently classify the location features extracted in the previous step. Finally, the bag-of-words model is combined with cosine similarity and information divergence filtering to improve the fault tolerance of the geometric layout classification task. The classification accuracy can reach 0.982, which well demonstrate the effectiveness of the proposed approach.

[1]  T. Kanade,et al.  Geometric reasoning for single image structure recovery , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[2]  Arnold W. M. Smeulders,et al.  Stages as Models of Scene Geometry , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Chen Liu,et al.  Layered Scene Decomposition via the Occlusion-CRF , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[4]  Antonio Torralba,et al.  Recognizing indoor scenes , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[5]  Derek Hoiem,et al.  Indoor Segmentation and Support Inference from RGBD Images , 2012, ECCV.

[6]  Shang-Hong Lai,et al.  Indoor Scene Layout Estimation from a Single Image , 2018, 2018 24th International Conference on Pattern Recognition (ICPR).

[7]  Tomasz Malisiewicz,et al.  RoomNet: End-to-End Room Layout Estimation , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[8]  Derek Hoiem,et al.  LayoutNet: Reconstructing the 3D Room Layout from a Single RGB Image , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[9]  Jason Jianjun Gu,et al.  Edge-Semantic Learning Strategy for Layout Estimation in Indoor Environment , 2020, IEEE Transactions on Cybernetics.

[10]  Theo Gevers,et al.  Extracting 3D Layout From a Single Image Using Global Image Structures , 2015, IEEE Transactions on Image Processing.