🏥 This Week on Zindi: The number of tokens (760) exc...

Malawi Public Health Systems LLM Challenge

Helping Malawi

$2 000 USD

Challenge completed almost 2 years ago

Skills you will learn

Questioning and Answering

Generative AI

409 joined

74 active

Info Data Chat Leaderboard

Start

Jan 24, 24

Mar 03, 24

Reveal

Mar 03, 24

AdeptSchneider22

Kenyatta University

The number of tokens (760) exceeded the maximum context length (512)

Help · 25 Feb 2024, 07:37 · 1

Is there anyone who has tried using his or her fine-tuned LLM for this task in his or her RAG system? How are you handling the limited context window? I'm aware you can use Langchain to limit the chunks plus query that'll be passed upon querying but that may reduce the accuracy of the RAG system. How can one increase an LLM context window? I saw the Gemma Large Language Model has a 20,000 context window. Anyone who has successfully fine-tuned Gemma for this task can jump into this discussion.

Discussion 1 answer

AdeptSchneider22

Kenyatta University

Even after setting my max_seq_length to 2048 during Supervised Finetuning as follows: trainer = SFTTrainer(

model = model,

train_dataset = dataset['train'],

dataset_text_field = "text",

max_seq_length = 2048,

args = training_args,

)

I'd appreciate it if someone could clarify whether this max_seq_length translates to a model's context window.

25 Feb 2024, 07:48

Upvotes 0

Join the largest network for
data scientists and AI builders

About FAQs

Status