Training from scratch

What do you mean prompt accuracy? Something like CLIP score?