T5 user defined loss function

chrisdoyleIE · August 7, 2020, 8:21am

You need a differentiable model to do the sampling for you

Let V be the set of words in the vocabulary. Some models define a reinforcement learning model with a state space vector x with dimension |V|, such that x_i can be any integer in V, and a discreet action space of all integers in V.

Someone linked a paper from salesforce which follows this general idea but adds a few useful bells and whistles.

Topic		Replies	Views
TFT5ForConditionalGeneration with custom loss Beginners	0	451	April 4, 2022
Cross Entropy Loss and loss of HuggingFace T5ForConditionalGeneration does not matches 🤗Transformers	11	5294	November 29, 2023
Question regarding T5ForConditionalGeneraton loss in the example Beginners	0	323	January 4, 2021
T5 for conditional generation: getting started Beginners	20	18677	July 19, 2023
How to output loss from model.generate()? 🤗Transformers	16	6057	January 7, 2025

T5 user defined loss function

Related topics