The dataset shared with the participants is composed by multiple choice questions related to 3GPP standards included in the TeleQnA. TeleQnA is a comprehensive dataset tailored to assess the knowledge of LLMs in the field of telecommunications.
It encompasses 1827 multiple-choice questions distributed across two distinct categories:
The 1827 MCQs are also divided in 1461 and 366 questions that compose the train and test1 sets.
For more in-depth information about the dataset and the generation process, please refer to [2].
Each question is represented in JSON format, comprising five distinct fields:
The test1 MCQs do not include neither the correct answer nor the related explanation.
To request computational resources for running Falcon-7B, please fill the following form https://forms.office.com/r/Dx2jN5SWG8
Here there is one example from the dataset:
question 2045: {
"question": "What is the maximum number of eigenmodes that the MIMO channel can support? (nt is the number of transmit antennas, nr is the number of receive antennas)",
"option 1": "nt",
"option 2": "nr",
"option 3": "min(nt, nr)",
"option 4": "max(nt, nr)",
"answer": "option 3: min(nt, nr)",
"explanation": "The maximum number of eigenmodes that the MIMO channel can support is min(nt, nr).",
"category": "Research publications"}
Join the largest network for
data scientists and AI builders