I was looking into techniques to reduce memory usage, and came across quite a few mentions of packing
and padding_free
, for example here. Is there any reason why these techniques are limited to DPOConfig
and SFTConfig
? For example, would I be able to set these parameters in e.g. RewardTrainer
and RewardConfig
? Thanks!
1 Like
It seems that there was a response.