My understanding from conversations in Zindi and Discord is we are free to use any teacher model, synthetic data, etc that we wish with the main caveats being:
- Results must be reproducible
- Only one model (not separate models for each task)
- InkubaLM must be used as the model your work is based on. So you would have to use InkubaLM or a pruned version of InkubaLM as your student model.
My understanding from conversations in Zindi and Discord is we are free to use any teacher model, synthetic data, etc that we wish with the main caveats being:
- Results must be reproducible
- Only one model (not separate models for each task)
- InkubaLM must be used as the model your work is based on. So you would have to use InkubaLM or a pruned version of InkubaLM as your student model.