GBM models like XGB, LGBM these are the best for tabular kind of data prediction. Do we need to implement a neural network or will it be acceptable by Zindi if we submit the code using Federated Learning models? what are the other top participants are doing??
is it mandatory for us to submit the data using pytorch based neural network model?
The real question is whether the DNA sequence can help in seggregating the data into the body site where it came from. The normal bioinformatics pipeline goes through many steps in order to extract features (such as aligning these sequences to reference sequences for each microorganism. If you can find a method that is capable of this, and it is less time and resources, then this should be useful.