Hello, I want to fine tune pszemraj/led-base-book-summary model on my custom data of Bank Regulatory Document (15-20 pages) but the documents is well above the input token limit I can truncate it but I believe that it will cause a lot of loss of information. Can anyone suggest the right way to fine-…

Finetuning summarization model using long text data

itacdonev March 20, 2024, 6:23am 7

Two main sources that I consulted first:

HF documenatation
HF Causal lanugage modeling - preparing datasets (YT video)

Then I tried the code on a very simple example like couple of sentences with max_length=4 to see it in action and whether it is behaving as expected.

Hope this help.

1 Like

Topic		Replies	Views
Summarization on long documents 🤗Transformers	63	59149	August 16, 2024
T5 tokenizer's post-processor is suboptimal for truncated sequences for seq2seq finetuning 🤗Transformers	0	333	July 5, 2023
T5 Generates very short summaries 🤗Transformers	22	5587	September 11, 2020
Summarization: Is finetune_trainer.py accepting length arguments correctly? Beginners	9	2327	December 19, 2020
Output truncation of summaries models 🤗Transformers	0	442	March 30, 2023

Finetuning summarization model using long text data

Related topics