With just 6 days left until the challenge closes and an incredible €35,000 prize pool on the line, this is a great moment to jump in if you haven’t submitted yet - or to improve your existing submission.
Thanks for picking up the issue with how some questions were scored. Based on this we’ve updated the Pass @ K scoring function.
What this means in practice:
1. Answers must follow the required format exactly – see the sample submission file Your submission text should look like:
“Based on the provided data, the most likely root cause for …: \boxed{C1}”
The scorer now checks the value inside \boxed{}. If the boxed answer is missing, malformed, or the format isn’t followed, that prediction will score 0.
This means:
2. Rescore happening today We’ll be re-scoring the leaderboard today using the updated logic to ensure results are consistent, fair, and transparent for everyone.
📌 What you should do now
Thanks for bearing with us - and as always, shout in the discussions if anything is unclear. Happy coding! 🚀
Thanks!!
Quick clarification Question:
On example test set ID - ID_A34VXCUAX9, it says: From the following 9 potential root causes, select the most likely one and enclose its number in \boxed{{}} in the final answer.
A: RF or power parameters cause severe overlap coverage
B: Network capacity insufficient or load imbalance between cells
... etc
Than will the expected answer be one of \boxed{{A~I}} or \boxed{{1~9}} (so encoded version) as the question is asking to enclose its number.
So my question is more like do I need to either
1) Encode all the answers to get 1~9 (or encode whatever range of choices that each question has in order starting from 1) or
2) just keep the answer format as it is (so it can be Alphabet or Alphanumeric or whatever format the question has) and add \boxed{{}} format on those.
Thanks!
@meganomaly
Should the answers based on the option IDs ? beacuse some questions have the options are 1-9, Z1-Z9, C1-C8 etc what should be the answer in those cases?
has the rescoring happened yet?
Yes it has - results are being published
qwen3-32b on hf is a paid model for many providers , use those paid model qwen3-32b is legal or not?