Having troubel in understanding what loss is currently in use

aquorio15 · November 24, 2023, 10:23am

I was going through this hugging face code and I am having trouble understanding what loss the model is currently using. Although I know most seq2seq models uses CrossEntrophy loss but I don’t see the definition anywhere in the code

github.com

huggingface/transformers/blob/aca6288ff42cebded5421020f0ff088adeb446dd/examples/language-modeling/run_clm.py

#!/usr/bin/env python
# coding=utf-8
# Copyright 2020 The HuggingFace Inc. team. All rights reserved.
#
# Licensed under the Apache License, Version 2.0 (the "License");
# you may not use this file except in compliance with the License.
# You may obtain a copy of the License at
#
#     http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
"""
Fine-tuning the library models for causal language modeling (GPT, GPT-2, CTRL, ...) on a text file or a dataset.

Here is the full list of checkpoints on the hub that can be fine-tuned by this script:
https://huggingface.co/models?filter=causal-lm

This file has been truncated. show original

Actually I want to train the model with a new custom loss. I have trained a baseline model and its working fine.

Thank You

panigrah · November 24, 2023, 12:21pm

Most of the pre trained model classes have a default loss function that gets called inside the train method. I usually lookup the model source code to find out the loss method if I need to.

If you are using the trainer class you can override the loss method by following the first example on this link.

Topic		Replies	Views
How was self.loss_function implemented 🤗Transformers	4	30	June 9, 2025
Finetuning GPT2 with user defined loss Beginners	56	16089	July 23, 2023
Alternative Language Modeling Loss Calculation 🤗Transformers	0	78	September 25, 2024
Finetuning sentence embedding model with SageMaker - how to compute loss? Amazon SageMaker	9	3951	December 21, 2022
"run_lm_finetuning.py" was replaced? Beginners	5	4645	June 1, 2021

Having troubel in understanding what loss is currently in use

Related topics