Machine learning of human responses to images

The human user is an often ignored component of the imaging chain. In medical diagnostic tasks, the human observer plays the role of the decision-maker, forming opinions based on visual assessment of images. In content-based image retrieval, the human user is the ultimate judge of the relevance of images recalled from a database. We argue that data collected from human observers should be used in conjunction with machine-learning algorithms to model and optimize performance in tasks that involve humans. In essence, we treat the human observer as a nonlinear system to be identified. In this paper, we review our work in two applications of this general idea. In the first, a learning machine is trained to predict the accuracy of human observers in a lesion detection task for purposes of assessing image quality. In the second, a learning machine is trained to predict human users' perception of the similarity of two images for purposes of content-based image retrieval from a database. In both examples, it is shown that a nonlinear learning machine can accurately identify the nonlinear human system that maps images into numerical values, such as detection performance or image similarity.

[1]  B.M.W. Tsui,et al.  Comparison of radially-symmetric versus oriented channel. Models using channelized hotelling observers for myocardial defect detection in parallel-hole SPECT , 1998, 1998 IEEE Nuclear Science Symposium Conference Record. 1998 IEEE Nuclear Science Symposium and Medical Imaging Conference (Cat. No.98CH36255).

[2]  Jeffrey A. Fessler,et al.  Analysis of observer performance in known-location tasks for tomographic image reconstruction , 2006, IEEE Transactions on Medical Imaging.

[3]  Miles N. Wernick,et al.  Optimization of iterative reconstructions of /sup 99m/Tc cardiac SPECT studies using numerical observers , 2001 .

[4]  M.N. Wernick,et al.  Learning a nonlinear channelized observer for image quality assessment , 2003, 2003 IEEE Nuclear Science Symposium. Conference Record (IEEE Cat. No.03CH37515).

[5]  Alberto Del Bimbo,et al.  Visual information retrieval , 1999 .

[6]  Jie Yao,et al.  Predicting human performance by a channelized Hotelling observer model , 1992, Optics & Photonics.

[7]  P ? ? ? ? ? ? ? % ? ? ? ? , 1991 .

[8]  Richard M. Leahy,et al.  Covariance approximation for fast and accurate computation of channelized Hotelling observer statistics , 1999, 1999 IEEE Nuclear Science Symposium. Conference Record. 1999 Nuclear Science Symposium and Medical Imaging Conference (Cat. No.99CH37019).

[9]  H H Barrett,et al.  Human- and model-observer performance in ramp-spectrum noise: effects of regularization and object variability. , 2001, Journal of the Optical Society of America. A, Optics, image science, and vision.

[10]  Harrison H. Barrett,et al.  Foundations of Image Science , 2003, J. Electronic Imaging.

[11]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[12]  Benjamin Van Roy,et al.  Solving Data Mining Problems Through Pattern Recognition , 1997 .

[13]  P. Khurd,et al.  Channelized hotelling and human observer study of optimal smoothing in SPECT MAP reconstruction , 2004, IEEE Transactions on Nuclear Science.

[14]  Miles N. Wernick Pattern classification by convex analysis , 1991 .

[15]  C. Metz,et al.  Maximum likelihood estimation of receiver operating characteristic (ROC) curves from continuously-distributed data. , 1998, Statistics in medicine.

[16]  Martin Brown,et al.  Network Performance Assessment for Neurofuzzy Data Modelling , 1997, IDA.