Primary competition visual

Hulkshare Recommendation Algorithm Challenge

$7 500 USD
Challenge completed over 3 years ago
Prediction
Collaborative Filtering
510 joined
70 active
Starti
Feb 03, 22
Closei
May 01, 22
Reveali
May 01, 22
Data clarification
Help Ā· 13 Mar 2022, 12:06 Ā· edited ~6 hours later Ā· 7

Dear @ZINDI!

Could you please clarify some moments about data? Here is a screenshot from StarterNotebook: https://disk.yandex.ru/i/rJ_OYRd1Nr67Kg

My questions are written in magenta color:

1. How could straight intervals in certain video and session have different slopes? Is there a kind of playback speed change?

2. How can the slope be negative (for ~850-950 frames)? It seems like reverse playback.

UPD:

3. What is vidTime? It has correlation = 1 with the 'frame' column, so it could not be the duration of the current session. Please fix the data description https://zindi.africa/competitions/hulkshare-recommendation-algorithm-challenge/data

4. There are some pairs (video_id, session_id) having more than 1k frames and (e.g (video_id='ID_00FMRXW972', session_id='459deba29f86115427c7ae2f36a0f72747f984e5' has 2k frames and vidTime points). What does it mean? Whether the data is corrupted (duplicated)?

Discussion 7 answers

Hi

Can't clarify everything, but yes, the playback speed may change. And also the user can skip part of the video, go back and forward.

14 Mar 2022, 09:38
Upvotes 0
User avatar
flamethrower

It will be good if we can have more clarity on the data to understand best way to approach the problem. For the same vid id, two sessions showing the same linear pattern but with different slopes have different session duration and VidTime. For instance, one shows 3s and another 1200s.

Both show no pauses, forwards, rewinds. I'm wondering did the session of 3s actually listen to a song in 3s for a video duration, that should be around 1400s. Can we say both listened to the same song and completed the session without pauses, rewinds, forwards?

My thinking was for a linear listening pattern, the session durations should be similar. Is the video duration given determined for a specific playback speed?

Thank you for the clarification @Zindi @DmitriyM

session duration is the time that the user spent watching the video. The video may be 2000s long, but one user watched it for 3s and another one for 1400s.

User avatar
flamethrower

Thank you very much for the clarification.

I think I understand better now, so either way a listening session is split into 1000 frames. I see the one with 3 seconds didn't finish the song.

Thank you.

Yes. Each session is split into 1000 frames. Each frame represents session_duration/1000 (s). So if the session_duration changes, the frame size also changes, but the amount of frames don't.

User avatar
flamethrower

This helps a lot. Thank you for the clarity

1. How could straight intervals in certain video and session have different slopes? Is there a kind of playback speed change?

PLAY, FAST FORWARD, REWIND, PAUSE. The slope of the line represents a viewing pattern. The spot is a visual function representing mode per time.

your welcome to contunue the conversation on our discourd https://discord.gg/TwsnzK8k

12 Apr 2022, 08:21
Upvotes 0