Data description
In this challenge, you are given a dataset that links skills to occupations (jobs). Your task is to learn these relationships and predict the most relevant jobs for a new set of skills.
Train Data
The training dataset contains examples of skill sets and their corresponding occupations.
-
ID: A unique identifier for each sample.
-
skill_1 to skill_5: Five skills associated with an individual or job profile. Each skill is represented by a unique code.
-
occ_1 to occ_5: Five occupations linked to the given skills. These are the target variables you are trying to predict, also represented by standardized occupation codes.
Each row shows a relationship between a set of skills and multiple relevant occupations.
Test Data
The test dataset has the same structure as the training data, but without the occupation columns.
-
ID: A unique identifier for each sample.
-
skill_1 to skill_5: Five skills for which you need to predict relevant occupations.
Your task is to predict the top 5 most relevant occupations for each row.