What does model.generate do I'm not?

chaaaaaaaa · July 29, 2024, 4:45pm

I never solved this, but it is most likely a combination of the following:

Not using the advanced decoding methods inference does (top-k,top-p,beam-search)
The format does not match the original training data.
Hugging face quirks

While I still feel the outputs should at least be in the same ballpark, it’s understandable that the training output is poor.

My suggestion is to use hugging face sparingly. It’s easy to use, but that means it is opaque to what’s going on. I’ve spent a lot of time trying to figure out what the hugging face code is doing when I get unusual results. Unless you are doing “standard” work, it’s best to avoid it. It takes more time, but building the system from lower-level code normally pays off. It may be more complex, but it’s easier to debug and understand.

You can see another issue I have that shows the potential weirdness when using hugging face methods.

Topic		Replies	Views
Llama model outputs strange words Beginners	0	131	December 1, 2024
Logits from generate and model call different 🤗Transformers	2	929	January 26, 2025
Provide examples to model before inferencing and how to cache the examples Beginners	0	20	March 5, 2025
Generating Once for 16 Tokens is Not Same Generating Single Token 16 Times? 🤗Transformers	4	279	April 17, 2024
Inconsistency in logit values between generation and direct model prediction #31127 🤗Transformers	0	210	May 30, 2024

What does model.generate do I'm not?

Related topics