Parameter groups and GPT2 LayerNorm

Thanks!
That was very quick :zap: