Finetuning summarization model using long text data

Two main sources that I consulted first:

  1. HF documenatation
  2. HF Causal lanugage modeling - preparing datasets (YT video)

Then I tried the code on a very simple example like couple of sentences with max_length=4 to see it in action and whether it is behaving as expected.

Hope this help.

1 Like