Multi-source Information Gain for Random Forest: An Application to CT Image Prediction from MRI Data

Random forest has been widely recognized as one of the most powerful learning-based predictors in literature, with a broad range of applications in medical imaging. Notable efforts have been focused on enhancing the algorithm in multiple facets. In this paper, we present an original concept of multi-source information gain that escapes from the conventional notion inherent to random forest. We propose the idea of characterizing information gain in the training process by utilizing multiple beneficial sources of information, instead of the sole governing of prediction targets as conventionally known. We suggest the use of location and input image patches as the secondary sources of information for guiding the splitting process in random forest, and experiment on the challenging task of predicting CT images from MRI data. The experimentation is thoroughly analyzed in two datasets, i.e., human brain and prostate, with its performance further validated with the integration of auto-context model. Results prove that the multi-source information gain concept effectively helps better guide the training process with consistent improvement in prediction accuracy.

[1]  Marko Robnik,et al.  Improving Random Forests , 2004 .

[2]  Ullrich Köthe,et al.  On Oblique Random Forests , 2011, ECML/PKDD.

[3]  David Vázquez,et al.  Random Forests of Local Experts for Pedestrian Detection , 2013, 2013 IEEE International Conference on Computer Vision.

[4]  Marko Robnik-Sikonja,et al.  Improving Random Forests , 2004, ECML.

[5]  Max A. Viergever,et al.  elastix: A Toolbox for Intensity-Based Medical Image Registration , 2010, IEEE Transactions on Medical Imaging.

[6]  Stephen M. Smith,et al.  A global optimisation method for robust affine registration of brain images , 2001, Medical Image Anal..

[7]  C. Lawrence Zitnick,et al.  Structured Forests for Fast Edge Detection , 2013, 2013 IEEE International Conference on Computer Vision.

[8]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[9]  Zhuowen Tu,et al.  Auto-context and its application to high-level vision tasks , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[10]  Peter Kontschieder,et al.  Structured class-labels in random forests for semantic image labelling , 2011, 2011 International Conference on Computer Vision.