Mistral trouble when fine-tuning : Don't set pad_token_id = eos_token_id
|
8
|
5951
|
August 28, 2024
|
GPT2 finetuned with eos token will never yield eos token during generation
|
6
|
3384
|
April 12, 2024
|
Transformers v3.0.0 is out!
|
0
|
1941
|
July 7, 2020
|
Labels in language modeling: which tokens to set to -100?
|
1
|
3481
|
November 30, 2020
|
Issue with finetuning a seq-to-seq model
|
30
|
3963
|
August 11, 2022
|