DIAGNOSE: Avoiding Out-of-Distribution Data Using Submodular Information Measures