Hello, Iām trying to summarize my own dataset using longt5 model, so I used official sample code for summarization released to huggingface notebooks here, and there is problem.
I checked that using my own dataset is not a problem so first I want to run this code with its offered example dataset āxsumā.
When I run this code with xsum dataset using the original āt5-smallā model itās working well, so I only changed the model from t5 to longt5.
The result is that training takes the proper time as if itās training well, but the result is all 0 or nan value like this.
Can anyone give me some help to fix this problem? Is there a problem to compute_metrics?
I fixed the problem by myself. In huggingface github there was someone claiming same issue with me. He said the problem occurs when fp16 is turned on. So I turned off fp16 and now itās working.