How to modify Model Class with AutoModelForCausalLM.from_config

oleotiger1 · February 4, 2024, 4:36am

I’m working with deepspeed and transformers for distributed inference. The model is LLAMA-7B@FP16. I’m loading model with AutoModelForCausalLM.from_config.

I want to profile decoding stage. So I have to add torch.profiler around decoding and omit prefilling. What I suppose could achieve this is to modify funtions in transformers.generation.utils such as sample() or greedy_search(). But I don’t wanna to change source code of transfomers.

Any idea that I could add torch.profiler in greedy_search()? Or any other better method that I cound profiling decoding stage?

Topic		Replies	Views
The best way to modify a transformers model with minimal modifications 🤗Transformers	0	660	December 25, 2023
Customizing model architecture from predefined models 🤗Transformers	0	354	March 13, 2024
Why does automodelforcausallm.from_pretrained() work on base models and not instruct models? 🤗Transformers	4	85	March 15, 2025
AutoModelForCausalLM and transformers.pipeline Beginners	2	622	August 29, 2024
Replacing the LlamaDecoderLayer Class hugging Face With New LongNet Intermediate	0	803	March 30, 2024

How to modify Model Class with AutoModelForCausalLM.from_config

Related topics