Does task specific prefix matters for T5 fine-tuning?

Great part is that T5 performs really well with and without prefix :smile: Here’s what I observed in my experiments.,

  1. It converges slightly faster when using a task prefix and when the task was similar, say summarization
  2. Performed equally well even without prefix, took slightly longer to converge
4 Likes