Dear Zindi,
I would like to have an official statement that the leadboard is constructed based on 100% of the data set.
"Note that there is Public and Private Leaderboards. The Public Leaderboard excludes approximately 50% of the test dataset. While the competition is open, the Public Leaderboard will rank the submitted solutions by the accuracy score they achieve. Upon close of the competition, the Private Leaderboard, which covers 100% of the test dataset, will be made public and will constitute the final ranking for the competition."
The unveiling of the private leaderboard is popularly known as "the shakeup". Models built to generalise well result in similar or even better performance on the private leaderboard. I have been a victim of overfitting to the public leaderboard where I was 1st position then significantly dropped to 21st position because my model didnt generalise well enough. I hope I was able to provide some clarity.