🏥 Trending Now: How well are transformers ?

Basic Needs Basic Rights Kenya - Tech4MentalHealth

Helping Kenya

$4 200 USD

Completed (~6 years ago)

Skills you will learn

Classification

1254 joined

499 active

Info Data Chat Leaderboard

Start

Apr 26, 20

Jul 05, 20

Reveal

Jul 05, 20

ahmedbesbes

How well are transformers ?

Help · 24 Jun 2020, 12:06 · 8

Hello everyone,

I'm quite new to transformers. I started this competition to learn more about them. Thanks to hugging face's transformers library things are pretty simple to put in place.

I've tried BERT (base-uncased), Roberta (base and large), and Distilbert and so far the best score I got (with Roberta) is around 0.37...

The classifier I put on top of them is a simple linear layer with four 4 outputs. (multi-label classification)

I also noticed that these models easily overfit on the dataset (in a few epochs like 4 or 5).

What is your experience with transformers?

Do you have any tips/best practices to share, in particular on small datasets with short phrases?

Let's learn together :)

Discussion 8 answers

aninda_bitm

A possible idea could be leverage transformer to generate data and then do training

24 Jun 2020, 12:12

Upvotes 0

ahmedbesbes

Have you tried this approach? I doubt the quality of the generated results given the small size of the dataset.

replied to aninda_bitm24 Jun 2020, 12:14

Upvotes 0

aninda_bitm

No I have not tried it. Also are you using simple transformer. You may try language modelling fine tuning before

replied to ahmedbesbes24 Jun 2020, 12:20

Upvotes 0

ahmedbesbes

did not try LM fine-tuning. I'll look into it. thank you

replied to aninda_bitm24 Jun 2020, 12:28

Upvotes 0

aninda_bitm

Are you using simpletransformers package

replied to ahmedbesbes24 Jun 2020, 12:28

Upvotes 0

ahmedbesbes

no transformers only. is simpletransformers better?

replied to aninda_bitm24 Jun 2020, 12:30

Upvotes 0

aninda_bitm

It provides a nice wrapper around transformers and which is good for a noob like me

replied to ahmedbesbes24 Jun 2020, 12:31

Upvotes 0

swagatron

Any tips on how I should decide my BatchSize according to this small dataset ?

28 Jun 2020, 12:12

Upvotes 0

Join the largest network for
data scientists and AI builders

About FAQs

Status