Semantic-aware label placement for augmented reality in street view

In an augmented reality (AR) application, placing labels in a manner that is clear and readable without occluding the critical information from the real world can be a challenging problem. This paper introduces a label placement technique for AR used in street view scenarios. We propose a semantic-aware task-specific label placement method by identifying potentially important image regions through a novel feature map, which we refer to as guidance map. Given an input image, its saliency information, semantic information and the task-specific importance prior are integrated in the guidance map for our labeling task. To learn the task prior, we created a label placement dataset with the users’ labeling preferences, as well as use it for evaluation. Our solution encodes the constraints for placing labels in an optimization problem to obtain the final label layout, and the labels will be placed in appropriate positions to reduce the chances of overlaying important real-world objects in street view AR scenarios. The experimental validation shows clearly the benefits of our method over previous solutions in the AR street view navigation and similar applications.

[1]  Timo Götzelmann,et al.  Metrics for Functional and Aesthetic Label Layouts , 2005, Smart Graphics.

[2]  Steven K. Feiner,et al.  View management for virtual and augmented reality , 2001, UIST '01.

[3]  Ronald Azuma,et al.  A Survey of Augmented Reality , 1997, Presence: Teleoperators & Virtual Environments.

[4]  Sebastian Ramos,et al.  The Cityscapes Dataset for Semantic Urban Scene Understanding , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[5]  Yongtian Wang,et al.  Evaluation of labelling layout method for image-driven view management in augmented reality , 2017, OZCHI.

[6]  Mihran Tuceryan,et al.  Automatic determination of text readability over textured backgrounds for augmented reality systems , 2004, Third IEEE and ACM International Symposium on Mixed and Augmented Reality.

[7]  George Papandreou,et al.  Rethinking Atrous Convolution for Semantic Image Segmentation , 2017, ArXiv.

[8]  Tom Drummond,et al.  Real-Time Video Annotations for Augmented Reality , 2005, ISVC.

[9]  Naokazu Yokoya,et al.  View management of annotations for wearable augmented reality , 2009, 2009 IEEE International Conference on Multimedia and Expo.

[10]  Wei Guo,et al.  Image-Based Label Placement for Augmented Reality Browsers , 2018, 2018 IEEE 4th International Conference on Computer and Communications (ICCC).

[11]  Daniel Weiskopf,et al.  Density-based label placement , 2019, The Visual Computer.

[12]  Patricia Morreale,et al.  Applications of Augmented Reality Systems in Education , 2010 .

[13]  Xavier Décoret,et al.  Dynamic label placement for improved interactive exploration , 2008, NPAR.

[14]  Yongtian Wang,et al.  An empirical evaluation of labelling method in augmented reality , 2018, VRCAI.

[15]  Hongyang Chao,et al.  Annotating and navigating tourist videos , 2010, GIS '10.

[16]  Makoto Sato,et al.  Nonoverlapped view management for augmented reality by tabletop projection , 2014, J. Vis. Lang. Comput..

[17]  Tsutomu Terada,et al.  An information layout method for an optical see-through head mounted display focusing on the viewability , 2008, 2008 7th IEEE/ACM International Symposium on Mixed and Augmented Reality.

[18]  Jirí Bittner,et al.  Layout-aware optimization for interactive labeling of 3D models , 2010, Comput. Graph..

[19]  Dieter Schmalstieg,et al.  Hedgehog labeling: View management techniques for external labels in 3D space , 2014, 2014 IEEE Virtual Reality (VR).

[20]  Mihran Tuceryan,et al.  Determining text readability over textured backgrounds in augmented reality systems , 2004, VRCAI '04.

[21]  Andreas Butz,et al.  View management for driver assistance in an HMD , 2013, ISMAR.

[22]  Ernesto Damiani,et al.  Augmented reality technologies, systems and applications , 2010, Multimedia Tools and Applications.

[23]  Kaori Fujinami,et al.  Estimating Visibility of Annotations for View Management in Spatial Augmented Reality Based on Machine-Learning Techniques , 2019, Sensors.

[24]  Naokazu Yokoya,et al.  Annotating user-viewed objects for wearable AR systems , 2005, Fourth IEEE and ACM International Symposium on Mixed and Augmented Reality (ISMAR'05).

[25]  Kosuke Sato,et al.  View Management of Projected Labels on Nonplanar and Textured Surfaces , 2013, IEEE Transactions on Visualization and Computer Graphics.

[26]  Zhuowen Tu,et al.  Deeply Supervised Salient Object Detection with Short Connections , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[27]  Stephen R. Ellis,et al.  Evaluation of Alternative Label Placement Techniques in Dynamic Virtual Environments , 2009, Smart Graphics.

[28]  Daniel Vogel,et al.  Specifying label layout style by example , 2007, UIST.

[29]  Jingdong Wang,et al.  Salient Object Detection: A Discriminative Regional Feature Integration Approach , 2013, International Journal of Computer Vision.

[30]  Kiyoshi Kiyokawa,et al.  Towards intelligent view management: A study of manual text placement tendencies in mobile environments using video see-through displays , 2013, 2013 IEEE International Symposium on Mixed and Augmented Reality (ISMAR).

[31]  Dieter Schmalstieg,et al.  Dynamic compact visualizations for augmented reality , 2013, 2013 IEEE Virtual Reality (VR).

[32]  Haibin Ling,et al.  Salient Object Detection in the Deep Learning Era: An In-Depth Survey , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[33]  Ronald Azuma,et al.  Evaluating label placement for augmented reality view management , 2003, The Second IEEE and ACM International Symposium on Mixed and Augmented Reality, 2003. Proceedings..

[34]  Dieter Schmalstieg,et al.  Image-driven view management for augmented reality browsers , 2012, 2012 IEEE International Symposium on Mixed and Augmented Reality (ISMAR).

[35]  Ronald Azuma,et al.  A survey of augmented reality" Presence: Teleoperators and virtual environments , 1997 .

[36]  S. Süsstrunk,et al.  Frequency-tuned salient region detection , 2009, CVPR 2009.

[37]  Srinidhi Hegde,et al.  Where to Place: A Real-Time Visual Saliency Based Label Placement for Augmented Reality Applications , 2018, 2018 25th IEEE International Conference on Image Processing (ICIP).