Primary competition visual

Title Extraction in Lecture Slides Challenge by ITU AI/ML in 5G Challenge

3 000 Zindi Points
Completed (over 2 years ago)
Computer Vision
160 joined
26 active
Starti
Apr 26, 23
Closei
Jun 11, 23
Reveali
Jun 11, 23
About

We are providing still frame images extracted from webinar videos containing presentation slides with one or more titles. The objective of the challenge is to identify the title(s) in each slide that would be used to annotate the presentations.

Be aware that some training slide images may contain up to 4 titles at once, which is reflected in Train.csv ("Title1", "Title2", .., "Title4" columns). This is done to enrich your training with as much title samples as possible. Note that all our test images can only have a single title and you are expected to predict it.

The Train.csv and SampleSubmission.csv comprise the ID and Title columns where:

  • The labels in the ID columns correspond to the image labels without the leading zeros, i.e. image 000537.jpg in the train set corresponds to ID 537 in the Train.csv file.
  • The Title variable(s) represents the ground truth, i.e. title, of each slide/image in both the train and test set
Files
Description
Files
This is a starter notebook to help you make your first submission. If the file opens weirdly you can ctrl-S and it will save to your download folder.
Contains test set image files.
Contains the train set image files and corresponding title masks.
Contains the image labels along with corresponding slide titles for the train set.
Is an example of what your submission file should look like. The order of the rows does not matter, but the names of the "ID" must be correct.