Fine tune Transformers for text generation

mwitiderrick · April 4, 2022, 8:44am

Hello

Is there an example like this one (Fine-tune a pretrained model) for fine tuning HF transformers for text generation?

merve · April 4, 2022, 10:10am

@mwitiderrick Hello
You can check out this link for all example notebooks.

mwitiderrick · April 4, 2022, 3:01pm

Hello @merve, thanks for the response, indeed I found the notebooks very useful. One follow-up question.

When I run predictions like this for a binary problem

import tensorflow as tf
predicted_class_id = int(tf.math.argmax(logits, axis=-1)[0])
bert.config.id2label[predicted_class_id]

I get the result as LABEL_1, How do I know if this is the prediction for class 0 or 1.

Thanks.

merve · April 4, 2022, 3:46pm

Hello Derrick,

Can you send me the model repo so that I can see the config file?

mwitiderrick · April 4, 2022, 5:01pm

Hello @merve

Not sure about the repo but the model is

from transformers import TFAutoModelForSequenceClassification
model = TFAutoModelForSequenceClassification.from_pretrained("bert-base-uncased", num_labels=2)

merve · April 5, 2022, 8:13am

Hello Derrick,

Sorry it’s my fault, label is already a label I meant, which dataset is the model fine-tuned on?

mwitiderrick · April 5, 2022, 8:46am

Hello @merve
It’s imdb

dataset = load_dataset("imdb")

mwitiderrick · April 6, 2022, 5:10am

Any update on this @merve ?

merve · April 6, 2022, 10:16am

@mwitiderrick In the page of the dataset you can see the label for 1 is positive.

mwitiderrick · April 6, 2022, 12:01pm

Wanted to clarify that LABEL_1 means label 1 and not 0

tbomez · May 22, 2023, 5:11pm

hi @mwitiderrick, which HF notebook did you use for fine-tuning a model for text generation? thank you, Tom

Pranavagrl · July 27, 2023, 6:08am

Hi @mwitiderrick I am a beginner I was trying to fine-tune the model to generate text I will be a great help if you will be able to provide your notebook to me.
I am ready to use any model of LlamaForCasualLM architecture any dataset will be fine for me I just want to understand the Concept and practical Implementation Behind that.

Topic		Replies	Views
Training GPT2 Text generation model with classification labels 🤗Transformers	0	640	December 7, 2022
Text classification and generation from the same model Beginners	1	827	July 27, 2023
Fine Tune text generation Model using different type of data 🤗Transformers	0	352	August 1, 2023
Tutorial: Fine-tuning with custom datasets – sentiment, NER, and question answering 🤗Transformers	19	12857	February 12, 2024
Fine tune text generation model Beginners	0	263	January 16, 2024

Fine tune Transformers for text generation

Related topics