论文信息 - Balancing Appearance and Context in Sketch Interpretation

Balancing Appearance and Context in Sketch Interpretation

We describe a sketch interpretation system that detects and classifies clock numerals created by subjects taking the Clock Drawing Test, a clinical tool widely used to screen for cognitive impairments (e.g., dementia). We describe how it balances appearance and context, and document its performance on some 2,000 drawings (about 24K clock numerals) produced by a wide spectrum of patients. We calibrate the utility of different forms of context, describing experiments with Conditional Random Fields trained and tested using a variety of features. We identify context that contributes to interpreting otherwise ambiguous or incomprehensible strokes. We describe ST-slices, a novel representation that enables "unpeeling" the layers of ink that result when people overwrite, which often produces ink impossible to analyze if only the final drawing is examined. We characterize when ST-slices work, calibrate their impact on performance, and consider their breadth of applicability.

Yale Song | Randall Davis | Kaichen Ma | Dana L. Penney

[1] Randall Davis,et al. ChemInk: a natural real-time recognition system for chemical drawings , 2011, IUI '11.

[2] Lorenzo Torresani,et al. Learning Spatiotemporal Features with 3D Convolutional Networks , 2014, 2015 IEEE International Conference on Computer Vision (ICCV).

[3] Trevor Darrell,et al. Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[4] Vladlen Koltun,et al. Efficient Inference in Fully Connected CRFs with Gaussian Edge Potentials , 2011, NIPS.

[5] Randall Davis,et al. Learning from Neighboring Strokes: Combining Appearance and Context for Multi-Domain Sketch Recognition , 2009, NIPS.

[6] Randall Davis,et al. Handling Overtraced Strokes in Hand-Drawn Sketches , 2004, AAAI Technical Report.

[7] Yann LeCun,et al. Regularization of Neural Networks using DropConnect , 2013, ICML.

[8] Sergei Vassilvitskii,et al. k-means++: the advantages of careful seeding , 2007, SODA '07.

[9] Hyungsin Kim,et al. The ClockMe system: computer-assisted screening tool for dementia , 2013 .

[10] J. Friedman. Special Invited Paper-Additive logistic regression: A statistical view of boosting , 2000 .

[11] Judea Pearl,et al. Reverend Bayes on Inference Engines: A Distributed Hierarchical Approach , 1982, AAAI.

[12] Randall Davis,et al. THink: Inferring Cognitive Status from Subtle Behaviors , 2014, AI Mag..

[13] Tracy Anne Hammond,et al. Using a Geometric-Based Sketch Recognition Approach to Sketch Chinese Radicals , 2008, AAAI.

[14] Thomas F. Stahovich,et al. Making pen-based interaction intelligent and natural : papers from the 2004 AAAI Symposium, October 21-24, Arlington, Virginia , 2004 .

[15] Randall Davis,et al. A Visual Approach to Sketched Symbol Recognition , 2009, IJCAI.

[16] Sargur N. Srihari,et al. On-Line and Off-Line Handwriting Recognition: A Comprehensive Survey , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[17] Bridget Kelly,et al. A 7 minute neurocognitive screening battery highly sensitive to Alzheimer's disease. , 1998, Archives of neurology.

[18] Cordelia Schmid,et al. Dense Trajectories and Motion Boundary Descriptors for Action Recognition , 2013, International Journal of Computer Vision.

[19] Jie Yang,et al. Error repair in human handwriting: an intelligent user interface for automatic online handwriting recognition , 1998, Proceedings. IEEE International Joint Symposia on Intelligence and Systems (Cat. No.98EX174).

[20] DavisRandall,et al. Learning classification models of cognitive conditions from subtle behaviors in the digital Clock Drawing Test , 2016 .

[21] Randall Davis,et al. Sketch recognition in interspersed drawings using time-based graphical models , 2008, Comput. Graph..

[22] Yann LeCun,et al. The mnist database of handwritten digits , 2005 .

[23] Rainer Malaka,et al. Sketch-Based Interfaces: Exploiting Spatio-temporal Context for Automatic Stroke Grouping , 2010, Smart Graphics.

[24] Jürgen Schmidhuber,et al. Multi-column deep neural networks for image classification , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[25] Derek Hoiem,et al. Learning CRFs Using Graph Cuts , 2008, ECCV.

[26] Cynthia Rudin,et al. Learning classification models of cognitive conditions from subtle behaviors in the digital Clock Drawing Test , 2015, Machine Learning.

[27] Victor R. Lesser,et al. The Hearsay-II Speech-Understanding System: Integrating Knowledge to Resolve Uncertainty , 1980, CSUR.

[28] Andrew McCallum,et al. Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.