Domain adaptation from Causal LM to a Seq2Seq model

Johnguitarist · March 8, 2023, 11:09am

Hey all, I’m trying to retrain the CodeGen model to generate code from prompts. It seems the model was initially trained to complete code in a Causal LM domain, similar to how GitHub Copilot works. I want to use this model to generate code from prompts IE: ‘function to perform this operation on a dataframe’ would return the code to perform this. I’m pretty sure that means it would need to be a Seq2Seq LM type, the issue is the Seq2Seq classes for fine-tuning don’t support Codegen as it’s a Causal LM. How can I fine-tune this model to perform a different task while retaining the models knowledge through transfer learning? There doesn’t seem to be any literature on this that I have come across.

Thanks!
John.

Topic		Replies	Views
Training AutoModelForCausalLM in a Seq2Seq task 🤗Transformers	0	325	June 25, 2023
Can you fine tune a CausalLM model (GPT2) to seq2seq, redefining the architecture or do I need to retrain the model from scratch? 🤗Transformers	0	345	February 28, 2024
CodeGen Model - Transfer Learning, Train and Eval (codeparrot/apps database) Beginners	0	539	August 7, 2022
Training GPT-type models for classification tasks CausalLM vs SequenceClassification Models	2	1098	August 7, 2024
LoRa Task Type what is the difference between Seq2Seq and CausalLM 🤗Transformers	0	1021	May 5, 2023

Domain adaptation from Causal LM to a Seq2Seq model

Related topics