Thanks to Zindi for the dataset.
I used the data to make a model and deployed it to raspberry pi for real-time image inference from a live camera video feed. Had to augment the data with more localized images for better accuracy performance.
Summary of the deployment and results is as below.
If anybody needs support in the replication of such a project, let me know.