Constrain the search of the decoder in seq2seq architechture

ugoren · May 31, 2021, 9:03pm

I am trying to train a seq2seq model with constraints on the form of the output sequence.

My input sequence is unconstrained (any sentence), and my output sequence is formal language that resembles assembly.
The output sequence is a series of triplets of (operator, operand_1, operand_2)

An example triplet is (move, living_room, kitchen), note that given that the operator is "move" the type of the 2 operands is determined and it must be a room.

I know all operators, operands and constraints in advanced.

I’ve been looking into overriding the greedy_search method, but adjust_logits_during_generation also looks like an interesting direction.

How would you tackle this problem ?

Topic		Replies	Views
Text generation using custom constraints 🤗Transformers	0	691	August 25, 2022
Need advice for implementing Greedy Search for ORTModelForSeq2SeqLM 🤗Optimum	2	595	January 17, 2024
Limit length of output sequence in Seq2Seq model Beginners	0	297	November 19, 2021
Constrained decoding based on position 🤗Transformers	0	35	October 4, 2024
Best models for seq2seq tasks 🤗Transformers	3	1123	August 16, 2020

Constrain the search of the decoder in seq2seq architechture

Related topics