Does anyone have an idea of how I can load the dataset? I can't seem to get the actual audio of the dataset to use, here is my code
```
import os
import pandas as pd
from datasets import Dataset, DatasetDict, load_dataset
# Step 1: Read the train.csv file
csv_path = "/content/drive/MyDrive/data/train.csv"
train_df = pd.read_csv(csv_path)
dataset = load_dataset('csv', data_files=csv_path)
```
the audio data is in the test_audios folder
i would rather have my `path="/content/drive/MyDrive/train_audios/` because i tried that as well
could you give a code example please, I would really appreciate it!
You can try this! It will load the training audios and the associated sentence in one dataset.
'''
from dataset import load_dataset
import pandas as pd
train_df = pd.read_csv('train.csv', usecols = ["path","sentence"]).rename(columns={'path': 'file_name', 'sentence': 'transcription'})
from datasets import Dataset
print(train_df.head(5))
#save df with labels in the same folder as training audios
train_df.to_csv("train_audios/metadata.csv", index=False)
dataset = load_dataset("audiofolder", data_dir="train_audios")
'''
for more ref: https://huggingface.co/docs/datasets/audio_load