Custom Loss for Pretrained TF HF Models

jjdv · May 15, 2022, 9:07am

Maybe a bit of a dumb question but I’m trying to fine tune a simple POS tagger like so:

    model = transformers.TFAutoModelForTokenClassification.from_pretrained('distilbert-base-multilingual-cased', num_labels=len(LABELS))
    optimizer = tf.keras.optimizers.Adam(learning_rate=lr)
    loss = tf.keras.losses.SparseCategoricalCrossentropy(from_logits=True),
    model.compile(optimizer=optimizer,
                  loss={"loss": loss},
                  metrics=tf.keras.metrics.SparseCategoricalAccuracy())
    model.fit(train,
              epochs=args.epochs)

but it complains that the target for getting the gradient is None:

TypeError: Target should be a list or nested structure of Tensors or Variables to be differentiated, but recieved None

I tried passing the loss directly and as a dictionary like in the snippet here and neither way works. How do I pass it a custom loss properly?

I’m at a loss since the example code pretty much does the same thing I’m doing.

Topic		Replies	Views
Implementing a Trainer with custom loss produces key error 🤗Accelerate	2	3114	April 30, 2023
Training loss is not decreasing using TFBertModel 🤗Transformers	4	5759	October 24, 2023
Custom loss does not work Beginners	2	45	December 24, 2024
Custom_loss fn for token_classification Beginners	3	345	November 6, 2024
Deepspeed trainer and custom loss weights DeepSpeed	1	556	February 28, 2024

Custom Loss for Pretrained TF HF Models

Related topics