Primary competition visual

MPEG-G Microbiome Classification Challenge

$5 000 USD
Completed (6 months ago)
Classification
Federated Learning
Python
Deep Learning
794 joined
83 active
Starti
Jun 20, 25
Closei
Sep 15, 25
Reveali
Sep 15, 25
Neural Networks vs Boosting Methods
Platform · 9 Sep 2025, 08:09 · 15

Those who are having high public score, including us, are using any of the boosting classifiers, not any neural network.

How are we supposed to make Federated Learning model using that?? Otherwise no one will be in top 15. Neural Network cannot match the accuracy of the State-of-the-art models.

any solutions??

Discussion 15 answers

Even our team is also getting a high accuracy of 3.16e-7 but that is considered as an invalid score, though it was not hardcoded or post-processed. The 'valid' score according to the Zindi can only be possible if everyone uses a NN model from scratch. There are high chances of getting rejected during the time of code review as it states to use pytorch model

9 Sep 2025, 08:13
Upvotes 0
User avatar
CodeJoe

Yes from what I'm seeing you definitely need to use a neural net.

User avatar
51pegasi

Where is that indicated? Because I suspect most of the top LB (if not all) are using boosting models

Yes I also am concerned that to be in the top LB, we have to use boosting models which is not mentioned in the info of this challenge.

User avatar
CodeJoe
Your solution must include:
A centralised model, trained using the combined dataset.
A federated learning approach, trained using the split datasets.
The aim is to prove that MPEG-G files are more efficient due to less data movement and better compression, and that federated learning preserves performance across sites better than a centralised model. You must:
Define Clients: Treat each folder (or grouping by participants/site) as a "location".
Create a Federated Dataset Loader: Each client should have its own dataset split (e.g., location1_data, location2_data, ...) and convert each into a PyTorch Dataset.
Define the Same PyTorch Model: Same architecture as centralized model.
Implement FL with Flower or PySyft
Train the FL Model
Make a submission to Zindi
In your final solution submission , you will need to output three submission files, one from your centralised model, one from your federated learning model and a technical explanation of the innovation and approach used. You will also need to indicate training time, inference time, resources used and any observations on .mgb vs uncompressed formats.
User avatar
51pegasi

But we can also apply Federated learning to xgboost for example (although not sure about the performance). So why the restriction

User avatar
CodeJoe

Can't tell, let's confirm from the organizers😅.

User avatar
Koleshjr
Multimedia university of kenya

If you are getting this score: 3.16e-7 without post processing you should definitely submit it as it is valid

But the thing is, it's a DT based model. Federated learning using that is complex.

Also the competition doesnot want us to use any other model. It mentioned to use pytorch. That's why so much confusion. :)

User avatar
Koleshjr
Multimedia university of kenya

federated learning is a must! but I have seen an xgboost federated learning implementation. You can try that. Also for that score what is your local cv?

User avatar
CodeJoe

You can retrain a boosting model.

9 Sep 2025, 08:52
Upvotes 0

Boosting models are ready, as well as the NN. But the performance between these two are quite large. No one is yet sure how the final evaluation will be done during code checking.

Will those codes having boosting models be accepted as a valid one?

User avatar
CodeJoe

I think they will focus on both federated and centralized approaches and then carbon emission, basically how long it takes to train.

For now, we need to hear from the organizers on whether they prefer a pytorch model or any. I would however urge you to follow the rules.

@Amy_Bray please clarify this for us.

Unless clarified, we are unable to understand whether to move forward with neural networks or other models.

User avatar
CodeJoe

Exactly!