Primary competition visual

AI4D Malawi News Classification Challenge

Helping Malawi
$2 000 USD
Completed (almost 5 years ago)
Classification
830 joined
322 active
Starti
Jan 22, 21
Closei
May 09, 21
Reveali
May 09, 21
How to use JW3000 parallel dataset?
Data · 2 Mar 2021, 08:12 · 8

Hi - Could anyone please suggest how should one use JW3000 parallel dataset for augmentation?

Discussion 8 answers
User avatar
Muhamed_Tuo
Inveniam

Hi, you could use it to build a pretrained model.

2 Mar 2021, 08:14
Upvotes 0

Thanks for the response! Let's say if I build a pre-trained model using JW3000 dataset. I think that should be made public? Below is the text from rules section.

"You may use pretrained models as long as they are openly available to everyone."

User avatar
Muhamed_Tuo
Inveniam

That's not the case in this particular competition. Since it's stated in the rules that: "You may also use the JW300 parallel dataset to augment the data."

User avatar
Muhamed_Tuo
Inveniam

So everyone is free to use that dataset in any way that suit him well.

Yes, makes sense. I hope @Zindi will comment if they think otherwise. Thanks Muhamed!

User avatar
Muhamed_Tuo
Inveniam

My pleasure !

I don't think you can - better check with @zindi. You can use the parallel dataset to augment your data but pretraining models are banned (most probably due to the GPU resource requirement that not all of us have). Plus you would need to release the pre-trained model before some certain date if you would want to use it

Yes, Zindi just updated the rules saying we can't use JW3000 dataset.