I want to implement RLHF in claude sonnet 3.7 model. How can we do this . please anyone can help me in this. Is it possible or not ?
1 Like
Hmm…
Isn’t Claude a proprietary model?
I don’t think it can be fine-tuned.