Primary competition visual

Unifi Value Frameworks PDF Lifting Competition

Helping South Africa
$5 000 USD
Challenge completed over 1 year ago
Generative AI
450 joined
73 active
Starti
Dec 21, 21
Closei
Mar 17, 24
Reveali
Mar 17, 24
User avatar
Juliuss
Freelance
Malfunctional Reference File?
Platform · 29 Dec 2023, 07:41 · 7

@Zindi kindly check that the reference file or evaluation metric is correct. The sample submission file with zeroes already gives a >90% accuracy, any slight deviation gives an accuracy of around 0%.. (I could be mistaken)

Discussion 7 answers
User avatar
Amy_Bray
Zindi

Hi Julius, We will look into this and get back to you by 3 January.

29 Dec 2023, 16:16
Upvotes 1
User avatar
Wajdi_Hajji
ESPRIT

Hi Julius, I initially shared your confusion regarding why accuracy is the metric and why a simple submission can achieve such high accuracy. Here's my understanding, though I could be wrong:

This is a multiclass classification challenge where the classes represent the values extracted from the PDFs. The simple submission reveals that about 90% of the values are 0. This occurs because many AMKEYS are not mentioned in the PDF, leading to their values being set to 0. Similarly, in the training data, these values are set to null, which comprises about 91% of the data. Thus, there is a similar imbalance between the training and evaluation datasets.

The task involves extracting 511 AMKEY values from 12 companies for the year 2022. If an AMKEY is not found in a document, a 0 is assigned to its value.

30 Dec 2023, 00:22
Upvotes 1
User avatar
Juliuss
Freelance

Certainly, @Wajdi_Hajji... Nevertheless, when I attempted to manually modify just five values for five distinct companies associated with a specific AMKEY key that I was confident were accurate, the accuracy plummeted to 0%.

User avatar
Nelly43
Zindi

Hi @JuliusFx, Please ensure the "2022_Value" column in your submission is of type float(same as the original target variable type) since for a task like this challenge even "0" and "0.0" are considered different "classes".

2 Jan 2024, 12:08
Upvotes 0
User avatar
Juliuss
Freelance

yea it now scores, thanks @Nelly43. maybe you could now check that if two submissions have similar scores, only the earliest is considered. I noted that my latest submission that scores similarly to an earlier submission is considered and I moved down the leaderboard.

I'd recommend reaching out to Zindi directly to double-check and clarify the accuracy concerns. They should be able to assist you and ensure everything aligns properly.

4 Jan 2024, 15:48
Upvotes 0
User avatar
Juliuss
Freelance

Yea, I think it was clarified.