What is the intended difference between the transformers.LogitsProcessor
and transformers.LogitsWarper
classes?
Hey @dpaleka
.generate()
maintainer here. The two classes are very similar – the biggest distinction would be that transformers.LogitsWarper
are only used with sampling strategies (i.e. when you pass do_sample=True
).
1 Like