Primary competition visual

To Vaccinate or Not to Vaccinate: It’s not a Question by #ZindiWeekendz

Helping Africa
$300 USD
Completed (almost 6 years ago)
Natural Language Processing
Classification
Sentiment Analysis
212 joined
126 active
Starti
Apr 24, 20
Closei
Apr 26, 20
Reveali
Apr 26, 20
Always the same problem
Data · 25 Apr 2020, 15:26 · 4

This is a classification problem. I don't know why Zindi is using RMSE (used for regression problems) for it. Always the wrong choice of the metric.

PLEEASE FIX THIS ISSUE.

Discussion 4 answers

This might not exactly be a classification problem. Some of the comments are partially negative. Some could be partially postive, so treating it as a hard class wouldn't be good. To think in a different way consider predictiing a rating a person would give to a movie based on his review, on a scale of 0 to 10. RMSE would be better suited as a metric compared to some classification metric in this case. Also these tweets are labelled with some disagreement, so 2 out of 3 people could think its positive, one could say neutral. So a distance between my models prediction and the agreement could be something I am looking for.

25 Apr 2020, 15:33
Upvotes 0

yeah! But what are we looking for? we have a tweet and we want to predict the sentiment of this tweet. In the end we want the sentiment. 0.6 0.4126 0.5 are nothing, we want to have the right predictions. That's why attacking it as a classification problem is 100% better than this way.

25 Apr 2020, 15:43
Upvotes 0
User avatar
Mugisha_

I agree to your opinion. In the real world "neutral" is a nearly useless(for lack of a better word) category and in most cases has no effect to the outcome of any real world event. So i think the problem should have been framed as a two class classification problem; pro-vaccination(1) or anti-vaccination(0). But I respect the fact that they have their reasons for keeping it this way.

Yeah. I ask my self for this question. At first I think it was classification

25 Apr 2020, 18:09
Upvotes 0