Automated data preparation for in vivo tumor characterization with machine learning