Chapter 1 questions

bird-of-paradise · May 13, 2025, 12:57am

I just published two blog posts about recent RL algorithms for reasoning tasks such as GRPO and Dr. GRPO.
You can find them on Medium:

I hope you find it helpful in some way.

Thank you,
Jen

Topic		Replies	Views
Chapter 7 questions Course	119	10405	July 10, 2025
Bert2bert translator? 🤗Transformers	6	44	August 28, 2025
Chapter 3 questions Course	149	10516	August 29, 2025
Encoder-Decoder model only generates bos_token's [<s><s><s>] Models	17	3175	December 6, 2022
EncoderDecoderModel for token classification 🤗Transformers	0	193	October 29, 2022