GPT-2 Logits to tokens for beam search (Generate method)

Americo · September 2, 2021, 1:57pm

I have a TF GPT-2 LMHead model running on TF Serving and I want to do a beam search(multiple tokens output) with the models’ output logits.

payload = {“inputs”: input_padded}
requests.post(‘http://localhost:8501/v1/models/gpt2-farmacos5:predict’, data=json.dumps(payload))

That request returns me a tf.tensor of logits for next token. But how do I convert these logits to multiple tokens output like the huggingface’s gpt2model.generate() method (but in tf serving) ?

PD I know there is transformers/generation_tf_utils.py at master · huggingface/transformers · GitHub but I need a simpler implementation.

PD2: I know this post is similar to patrick’s post Generation Probabilities: How to compute probabilities of output scores for GPT2 but he’s working on pytorch and Im in tensorflow. Mabye i should migrate to Pytorch ?

Thanks

Topic		Replies	Views
Print All Tokens Over a Certain Probability Threshold Research	3	1112	July 21, 2020
Probaility of sequence generated in beam search of GPT2 🤗Transformers	0	595	September 29, 2020
T5 transformer tokens and scores Beginners	0	707	July 26, 2022
Use custom LogitsProcessor in `model.generate()` Beginners	2	6778	March 14, 2023
Prevent repeat tokens in GPT2LMHeadModel text generation with max_new_tokens=1 Beginners	0	1115	November 19, 2021

GPT-2 Logits to tokens for beam search (Generate method)

Related topics