Chapter 1 questions

Hi @Pizofreude,

I just published two blog posts about recent RL algorithms for reasoning tasks such as GRPO and Dr. GRPO.
You can find them on Medium:

I hope you find it helpful in some way.

Thank you,
Jen

1 Like