There are 19,093 audio files in the train set and 3,167 in the test set. You will use these files to train your model and submit your sentences.
The goal of this competition is to build an ASR model that will help illiterate people use existing apps to find which bus they can take to reach their destination, without having to know how to read or write.