The dataset is divided into:
-
Training Data: Contains sensor readings and corresponding CO2 levels measured by the reference meter. This is used to train the machine learning model.
-
Test Data: Contains sensor readings without CO2 values. Participants must use their models to predict CO2 levels for this dataset, which will be evaluated against unseen ground truth values. This has a 70 - 30 split for Public and Private dataset respectively.