Custom Decoding Strategy

cm2001 · December 6, 2023, 9:55pm

Hi,

I am currently working on a project that involves controllable text generation.
To do this, I am making a ‘post-processing module’ which involves reranking the logits during the decoding process.

To start, I have forked transformers, and added a new GenerationMode to the GenerationMixin.
(GenerationMode.TEST_GREEDY_SEARCH) for this example.

Within the greedy_search(), I believe next_tokens_scores would be the logits scores for the tokens? I am not sure.

One of the problems is I need to decode the tokens of the logits into the human readable word to use it in my post processing module. At the moment I am just importing the tokeniser I need into the function, but I do not know how to get the current tokens.

In summary, I need the token, and its probability, for each of the logits.

Any help would be greatly appreciated.

Topic		Replies	Views
Combining tokenizer.decode and model.generate scores for probability prediction Intermediate	2	1332	March 1, 2023
How to create a custom decoding strategy in the GenerationMixin class? 🤗Transformers	2	1259	February 16, 2023
Can I get logits for each sequence I acqired from model.generate()? Beginners	1	1299	November 27, 2020
Rewriting generate function for manual decoder input 🤗Transformers	7	3554	July 11, 2022
Trying to recreate `model.greedy_search()` for custom decoding of LLM output, but I am getting a different decoded output Intermediate	3	349	February 8, 2024

Custom Decoding Strategy

Related topics