Primary competition visual

The AI Telco Troubleshooting Challenge

€35 000 EUR
Completed (~1 month ago)
Root Cause Analysis
Fault Detection
Edge AI
Anomaly Detection
Large Language Models
1254 joined
253 active
Starti
Nov 28, 25
Closei
Feb 01, 26
Reveali
Feb 02, 26
User avatar
SeokhyunJeong
Data augmentation for phase 2
Help · 21 Jan 2026, 15:02 · 3

Thank you for organizing the challenge.

I have a question regarding the training setup.

Would it be permissible to create an additional dataset for fine-tuning that is similar to the Phase 2 test dataset?

Specifically, I am wondering whether it would be allowed to construct a dataset that includes the 9 potential causes in the Phase 2 test dataset but not included in the original Telelogs training dataset, or some background knowledge that may be helpful to solve general questions in phase 2 test data.

Thank you.

Discussion 3 answers

Yes. You can create sinthetic data in the format you prefer for fine-tuning your model

21 Jan 2026, 15:04
Upvotes 0
User avatar
SeokhyunJeong

Thank you for your kind reply!

Are we constrained to use same qwen models to create this synthetic data?

I've asked this in another thread and as far as I recall you said to use same qwen model as the task entails. Is this still true?