Sky-image-based solar forecasting using deep learning with multi-location data: training models locally, globally or via transfer learning?