Lazy dragging: effortless bounding-box drawing for touch-screen devices

Dragging a rectangular box is arguably one of the most intuitive and popular user interfaces to select a whole object. However, the conventional dragging method cannot be easily applied on touch-screen devices. This paper proposes a more convenient and lazier way of bounding-box (BB) drawing for touch-screen devices. The proposed method of BB drawing, termed Lazy Dragging, is designed to yield accurate BB results at interactive speeds. To this end, Lazy Dragging proceeds in two stages: (i) it first filters out explicit non-borders via graph-based segmentation; (ii) it then singles out the best four box sides among the remaining candidates using edge feature-based random forest. In this second stage, global and local edge-based border detection and a combined approach of these global and local schemes are utilized. This paper also proposes an optional stage to simplify and ameliorate further user refinement by providing invisible magnetic guidelines. This technique guides a user's touch to the nearest superpixel boundaries. Extensive experiments on a real-world dataset demonstrate that Lazy Dragging convincingly enhances the quality of input BBs in quasi real-time, enabling effortless object selection on small touch-screen devices.

[1]  Kwanghoon Sohn,et al.  Lazy Dragging: Effortless bounding-box drawing for touch-screen devices , 2017, 2017 IEEE International Conference on Consumer Electronics (ICCE).

[2]  Kwanghoon Sohn,et al.  Personness estimation for real-time human detection on mobile devices , 2017, Expert Syst. Appl..

[3]  Bernt Schiele,et al.  What Makes for Effective Detection Proposals? , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  C. Lawrence Zitnick,et al.  Structured Forests for Fast Edge Detection , 2013, 2013 IEEE International Conference on Computer Vision.

[5]  Luc Van Gool,et al.  The Pascal Visual Object Classes (VOC) Challenge , 2010, International Journal of Computer Vision.

[6]  C. Lawrence Zitnick,et al.  Edge Boxes: Locating Object Proposals from Edges , 2014, ECCV.

[7]  Daniel P. Huttenlocher,et al.  Efficient Graph-Based Image Segmentation , 2004, International Journal of Computer Vision.

[8]  Kwanghoon Sohn,et al.  Non-parametric human segmentation using support vector machine , 2016, 2016 IEEE International Conference on Consumer Electronics (ICCE).

[9]  Andrew W. Fitzgibbon,et al.  Real-time human pose recognition in parts from single depth images , 2011, CVPR 2011.

[10]  Minh N. Do,et al.  Fast Global Image Smoothing Based on Weighted Least Squares , 2014, IEEE Transactions on Image Processing.

[11]  David A. McAllester,et al.  Object Detection with Discriminatively Trained Part Based Models , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Morten Hertzum,et al.  How Age Affects Pointing With Mouse and Touchpad: A Comparison of Young, Adult, and Elderly Users , 2010, Int. J. Hum. Comput. Interact..

[13]  Vladimir Kolmogorov,et al.  "GrabCut": interactive foreground extraction using iterated graph cuts , 2004, ACM Trans. Graph..

[14]  Mario Cifrek,et al.  A brief introduction to OpenCV , 2012, 2012 Proceedings of the 35th International Convention MIPRO.