How could I fusion the logits from different models and then convert it to Token?

Sev777 · May 10, 2024, 10:42am

How could I fusion the logits from different models then convert it to Token?

Suppose I have tow logits x,y from GPT2-base and GPT2-large, with the same shape: 85032000.

8 is the batch size, and 50 is the length of input tokens, 32000 is the vocabe size.

Then I fusion the x and y using: Z= x+y

So, how could I convert the Z to the tokens with the generation function?

I have tried to use the [argmax(z) for z in Z], but the results are bad. And it seems due to I didn’t consider the stopping criteria, but how?

Topic		Replies	Views
GPT-2 Logits to tokens for beam search (Generate method) 🤗Transformers	0	1319	September 2, 2021
How to combine two models' logits Beginners	3	7361	April 3, 2023
How to convert model output logits into string sentences during training to check what the model is outputting? 🤗Transformers	3	5301	October 14, 2021
Merge logits from multiple models in beam search? Beginners	0	399	July 5, 2022
Can I get logits for each sequence I acqired from model.generate()? Beginners	1	1340	November 27, 2020