Intermodal image-based recognition of planar kinematic mechanisms

We present a data-driven exploratory study to investigate whether trained object detectors generalize well to test images from a different modality. We focus on the domain of planar kinematic mechanisms, which can be viewed as a set of rigid bodies connected by joints, and use textbook graphics and images of hand-drawn sketches as input modalities. The goal of our algorithm is to automatically recognize the underlying mechanical structure shown in an input image by leveraging well-known computer vision methods for object recognition with the optimizing power of multiobjective evolutionary algorithms. Taking a raw image as input, we detect pin joints using local feature descriptors in a support vector machine framework. Improving upon previous work, detection confidence depends on multiple context-based classifiers of varying image patch size and greedy foreground extraction. The likelihood of rigid body connections is approximated using normalized geodesic time, and NSGA-II is used to evolve optimal mechanisms using this data. The present work is motivated by the observation that textbook diagrams and hand-drawn sketches of mechanisms exhibit similar object structure, yet have different visual characteristics. We apply our method using various combinations of images for training and testing, and the results demonstrate a trade-off between solvability and accuracy. We describe a system that automatically recognizes mechanical structures from images.Contextual features and foreground extraction enhance previous detection results.We extend the recognition pipeline to include hand-drawn sketches as input.Trained object detectors can perform well on images from a different modality.

[1]  Carlos Artemio Coello-Coello,et al.  Theoretical and numerical constraint-handling techniques used with evolutionary algorithms: a survey of the state of the art , 2002 .

[2]  Xiaogang Wang,et al.  Face Photo-Sketch Synthesis and Recognition , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Javier García de Jalón,et al.  Kinematic and Dynamic Simulation of Multibody Systems , 1994 .

[4]  M. Hegarty Mechanical reasoning by mental simulation , 2004, Trends in Cognitive Sciences.

[5]  Levent Burak Kara,et al.  An image-based, trainable symbol recognizer for hand-drawn sketches , 2005, Comput. Graph..

[6]  Koos Eissen,et al.  Sketching: Drawing Techniques for Product Designers , 2009 .

[7]  Matthew Eicholtz,et al.  Recognizing planar kinematic mechanisms from a single image using evolutionary computation , 2014, GECCO.

[8]  Daniel J. Inman,et al.  Engineering Mechanics: Dynamics , 1966 .

[9]  Vincent Lepetit,et al.  BRIEF: Binary Robust Independent Elementary Features , 2010, ECCV.

[10]  Randall Davis,et al.  Magic Paper: Sketch-Understanding Research , 2007, Computer.

[11]  Kalyanmoy Deb,et al.  A fast and elitist multiobjective genetic algorithm: NSGA-II , 2002, IEEE Trans. Evol. Comput..

[12]  A. Yuille Deformable Templates for Face Recognition , 1991, Journal of Cognitive Neuroscience.

[13]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[14]  Thomas F. Stahovich,et al.  Grouping Strokes into Shapes in Hand-Drawn Diagrams , 2010, AAAI.

[15]  Levent Burak Kara,et al.  Recognizing Network-Like Hand-Drawn Sketches: A Convolutional Neural Network Approach , 2009, DAC 2009.

[16]  Dahua Lin,et al.  Inter-modality Face Recognition , 2006, ECCV.

[17]  Lothar Thiele,et al.  Multiobjective evolutionary algorithms: a comparative case study and the strength Pareto approach , 1999, IEEE Trans. Evol. Comput..

[18]  M. Hegarty,et al.  Individual differences in mental animation during mechanical reasoning , 1994, Memory & cognition.

[19]  M. Hegarty Mental animation: inferring motion from static displays of mechanical systems. , 1992, Journal of experimental psychology. Learning, memory, and cognition.

[20]  Dorin Comaniciu,et al.  Mean Shift: A Robust Approach Toward Feature Space Analysis , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[21]  Levent Burak Kara,et al.  Neural network-based symbol recognition using a few labeled samples , 2011, Comput. Graph..

[22]  Richard O. Duda,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[23]  Gennaro Costagliola,et al.  Recognition and autocompletion of partially drawn symbols by using polar histograms as spatial relation descriptors , 2014, Comput. Graph..

[24]  Levent Burak Kara,et al.  A Sketch-Based Tool for Analyzing Vibratory Mechanical Systems , 2008 .

[25]  Randall Davis,et al.  Tahuti: a geometrical sketch recognition system for UML class diagrams , 2006, SIGGRAPH Courses.

[26]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[27]  Thomas F. Stahovich,et al.  The effect of task on classification accuracy: using gesture recognition techniques in free-sketch recognition , 2009, SBIM '09.

[28]  Levent Burak Kara,et al.  From engineering diagrams to engineering models: Visual recognition and applications , 2011, Comput. Aided Des..

[29]  Kalyanmoy Deb,et al.  Muiltiobjective Optimization Using Nondominated Sorting in Genetic Algorithms , 1994, Evolutionary Computation.

[30]  Thomas F. Stahovich,et al.  The effect of task on classification accuracy: Using gesture recognition techniques in free-sketch recognition , 2010, Comput. Graph..

[31]  Javier García de Jalón,et al.  Kinematic and Dynamic Simulation of Multibody Systems: The Real Time Challenge , 1994 .

[32]  TangXiaoou,et al.  Face Photo-Sketch Synthesis and Recognition , 2009 .

[33]  Levent Burak Kara,et al.  An efficient graph-based recognizer for hand-drawn symbols , 2007, Comput. Graph..

[34]  Tracy Anne Hammond,et al.  Recognizing sketched multistroke primitives , 2011, ACM Trans. Interact. Intell. Syst..

[35]  Yi Yang,et al.  Articulated pose estimation with flexible mixtures-of-parts , 2011, CVPR 2011.

[36]  Thomas F. Stahovich,et al.  Newtons Pen II: an intelligent, sketch-based tutoring system and its sketch processing techniques , 2012, SBIM '12.

[37]  Martial Hebert,et al.  Learning message-passing inference machines for structured prediction , 2011, CVPR 2011.

[38]  Randall Davis,et al.  A Visual Approach to Sketched Symbol Recognition , 2009, IJCAI.

[39]  R. Hibbeler Engineering Mechanics: Dynamics , 1986 .

[40]  Gary G. Yen,et al.  Constraint Handling in Multiobjective Evolutionary Optimization , 2009, IEEE Transactions on Evolutionary Computation.