Practical computer vision: Example techniques and challenges

Humans, as well as many living organisms, are gifted with the power of "seeing" and Bunderstanding[ the environment around them using their eyes. The ease with which humans process and understand the visual world is very deceiving and often prompts us to underestimate the effort and methods needed to build practical, effective, and inexpensive computer vision systems. In essence, humans have a 500-million-year head start due to evolution; it is extremely difficult at this point to build a computer vision system that has the abilities of a three-year-old child. However, by confining ourselves to particular domains, we can often find shortcuts to solve particular problems. This paper illustrates a number of such solutions in various areas developed by our group at IBM. These include object finding for video surveillance, person identification via biometrics, inspection of manufactured items along railways, and scene understanding for driver assistance, as well as object recognition and motion interpretation for retail stores. We discuss the real-world constraints for each system and describe how we overcame the irksome variability inherent in each task. By further analyzing such successful systems and comparing them to each other, we can come to understand the common underlying problems and thus start to extend our initially limited areas of competence into a more general-purpose vision toolkit. This paper concludes with a set of challenging unresolved problems that if solved could spur great progress in practical computer vision.

[1]  Sharath Pankanti,et al.  A fast and robust intelligent headlight controller for vehicles , 2011, 2011 IEEE Intelligent Vehicles Symposium (IV).

[2]  Ying Li,et al.  Component-based track inspection using machine-vision technology , 2011, ICMR '11.

[3]  Gabriel Taubin,et al.  VeggieVision: a produce recognition system , 1996, Proceedings Third IEEE Workshop on Applications of Computer Vision. WACV'96.

[4]  Ramakant Nevatia,et al.  Segmentation and Tracking of Multiple Humans in Crowded Environments , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  David Zhang,et al.  Detecting Eyelash and Reflection for Accurate Iris Segmentation , 2003, Int. J. Pattern Recognit. Artif. Intell..

[6]  Sharath Pankanti,et al.  BEYOND BEYOND FINGERPRINTING , 2008 .

[7]  Sharath Pankanti,et al.  Visual item verification for fraud prevention in retail self-checkout , 2011, 2011 IEEE Workshop on Applications of Computer Vision (WACV).

[8]  Sharath Pankanti,et al.  Large-scale vehicle detection in challenging urban surveillance environments , 2011, 2011 IEEE Workshop on Applications of Computer Vision (WACV).

[9]  Aaas News,et al.  Book Reviews , 1893, Buffalo Medical and Surgical Journal.

[10]  クンドゥ、マレー,et al.  Method and apparatus for detecting a suspicious activity by using video analysis , 2006 .

[11]  Joan Serrat,et al.  Nighttime Vehicle Detection for Intelligent Headlight Control , 2008, ACIVS.

[12]  Sharath Pankanti,et al.  Recognition of repetitive sequential human activity , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[13]  Mubarak Shah,et al.  Tracking Multiple Occluding People by Localizing on Multiple Scene Planes , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  Rama Chellappa,et al.  Machine Recognition of Human Activities: A Survey , 2008, IEEE Transactions on Circuits and Systems for Video Technology.

[15]  Sharath Pankanti,et al.  Recognition of repetitive sequential human activity , 2009, CVPR.

[16]  Nalini K. Ratha,et al.  A new approach for iris segmentation , 2008, 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[17]  Natalia A. Schmid,et al.  Image quality assessment for iris biometric , 2006, SPIE Defense + Commercial Sensing.

[18]  Sharath Pankanti,et al.  Guide to Biometrics , 2003, Springer Professional Computing.

[19]  John Daugman,et al.  New Methods in Iris Recognition , 2007, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[20]  Sharath Pankanti,et al.  Attribute-based vehicle search in crowded surveillance videos , 2011, ICMR.

[21]  Rogério Schmidt Feris,et al.  Searching surveillance video , 2007, 2007 IEEE Conference on Advanced Video and Signal Based Surveillance.

[22]  James W. Davis,et al.  Attention-Based Target Localization Using Multiple Instance Learning , 2010, ISVC.

[23]  Sharath Pankanti,et al.  A performance study of an intelligent headlight control system , 2011, 2011 IEEE Workshop on Applications of Computer Vision (WACV).

[24]  D. Fernandez,et al.  Night time vehicle detection for driving assistance lightbeam controller , 2008, 2008 IEEE Intelligent Vehicles Symposium.

[25]  John Daugman,et al.  High Confidence Visual Recognition of Persons by a Test of Statistical Independence , 1993, IEEE Trans. Pattern Anal. Mach. Intell..