CELL-E: A Text-To-Image Transformer for Protein Localization Prediction