Follow-up question regarding external data: Are external, publicly available data sets not even allowed for pretraining a model? Asking as from my point of view adding this kind of data wouldn't change the nature of the challenge (i.e. the used input data for final prediction remains the same) but may allow to benefit from transfer learning