Class imbalance in out-of-distribution datasets: Improving the robustness of the TextCNN for the classification of rare cancer types