Completely different results for model in pipeline and by itself

ollibolli · March 8, 2022, 3:44pm

Ah, actually, I think I see the issue, right after posting it.
I think it’s about setting up the tokenizer? Could that be it?
In preprocess I make sure to have overlap, pad and max length = model.max_length
I’m guessing by default when I just pass tokenizer to the pipeline, none of these get set.

~~Can I pass a pre setup tokenizer?~~
Figured it out, there are two parameters, max_seq_len and max_answer_len that can set during the call but not initialization.

thanks a lot!

Topic		Replies	Views
Different results between pipeline and model() with multiple inputs 🤗Transformers	0	547	April 20, 2022
Model results differ after creating pipeline with same model Beginners	2	863	September 30, 2020
Different outputs when using pipeline Intermediate	2	1232	July 20, 2023
Slightly different output from trainer.predict and pipeline(..., function_to_apply="none") Beginners	1	502	June 21, 2023
Fundamental newbie questions Beginners	1	1335	December 6, 2020

Completely different results for model in pipeline and by itself

Related topics