How can I do few shot learning using the transformers library and gpt-j-6B?
I have found the contents of the following link helpful but it only describes few shot learning using the API.
Thanks for providing the code. When I run this code, it gives the following errors–>
The attention mask and the pad token id were not set. As a consequence, you may observe unexpected behavior. Please pass your input's `attention_mask` to obtain reliable results.
Setting `pad_token_id` to `eos_token_id`:50256 for open-end generation.
**RuntimeError** : "LayerNormKernelImpl" not implemented for 'Half'
Can you please help me with these?