Forward Function Output of XGLMForCausalLM

Rosieee · May 25, 2023, 7:58pm

Hello everyone,

I’m currently working with XGLM models and I was wondering why the forward function returns CausalLMOutputWithCrossAttentions instead of CausalLMOutputWithPast (used by other decoder-model causalLMheads) or other classes. I was confused by the name because decoder-only models do not have cross attentions like encoder-decoder models.

Could someone help me to understand the differences and the design choice behind? Thank you all!

Topic		Replies	Views
Problem with returning decoder cross attentions through generate function 🤗Transformers	0	25	October 25, 2024
Is it okay to use CausalLM with zero attention values? Models	0	94	June 4, 2024
Output dimension of AutoModelForCausalLM Models	1	1401	July 2, 2024
Forward Pass Output Logits 🤗Transformers	0	87	August 26, 2024
What is `self.loss_function` in `forward()` of newly released LLM? 🤗Transformers	0	48	January 14, 2025

Forward Function Output of XGLMForCausalLM

Related topics