Hi everyone,
I'm excited to share my ARC-AGI Starter Notebook for the ARC-AGI competition, which implements a zero-shot learning approach using the Qwen2.5-1.5B-Instruct. This notebook serves as a baseline solution, processing input grids, generating predictions, and creating submission files in both JSON and CSV formats. It includes utility functions for grid-to-text conversion, visualization with Matplotlib, and robust error handling with fallback strategies.
Key features:
I’d love to hear your thoughts, suggestions, or any improvements you might recommend! Has anyone else experimented with zero-shot approaches for ARC-AGI? Let’s discuss how we can enhance this baseline or tackle specific challenges in the dataset.
Looking forward to your feedback!
And don't forget to star the repo if you find it helpful.
You can find the notebook link👇👇.
Hey @ML_Wizzard,
Nice work and thanks for sharing
Hey @Muhamed_Tuo
Thanks a lot! I appreciate it. Let's see how we can improve it further together.
I went through the notebook and I realized not even a single entry in your submission was made by the Qwen model instead they were made by your 'fallback' process with was essentially returning the inputs as outputs. Have you been able to make any legit submissions with an LLM yet?
Yeah that's true. I'm sure it was a petty bug out there because I used the ARChitect's model but still didn't make a difference. I guess I started quite late to even correct that. However I learnt a lot from that solution. Thank you @ML_Wizzard.
Thank you @CodeJoe. After going through the code again, I also realized that the LLM model doesn’t actually make much sense. But anyway, the competition has already ended. Thanks again @CodeJoe and @offei_lad!
Oh it's fine. Next time, we will also have to start early😂. Last minute sub doesn't help😂😂