🧬 Must-Read: Low test accuracy

InstaDeep Enzyme Classification Challenge

Job Interview

Completed (almost 5 years ago)

Skills you will learn

Classification

521 joined

70 active

Info Data Chat Leaderboard

Start

Nov 17, 20

Feb 21, 21

Reveal

Feb 21, 21

serg132003

Low test accuracy

Data · 15 Jan 2021, 19:57 · 2

I don't understand - train accuracy is higher then 90, validation accuracy close to 90, curve is coverging nice, BUT test accuracy - lower then 80! Why is that? Is test data differs that much from train? How to fight that? Thanks for any ideas.

Discussion 2 answers

Kamenialexnea

Ecole nationale superieure polytechnique yaounde

train creature : (array(['creature9', 'creature3', 'creature8', 'creature4', 'creature0', 'creature2', 'creature5', 'creature1'], dtype=object), test creature : array(['creature7', 'creature6'], dtype=object))

I think it can explain the difference

13 Feb 2021, 02:59

Upvotes 0

serg132003

Yes, I understand that. but ussually we evaluate our model on validation test, taking from train, considering distribution of features the same in train and test. Such big difference shows that test sreatures structure actually differs from train. It was shown by Humza, that, for example, test set doesn't include some amino acids (X, U), that are present in train. But I beleive it it not the only reason.

replied to Kamenialexnea13 Feb 2021, 15:11

Upvotes 0

Join the largest network for
data scientists and AI builders

About FAQs

Status