How to get "EleutherAI/gpt-j-6B" working?

Marcin · August 23, 2021, 11:52pm

I’m trying to run the EleutherAI/gpt-j-6B model, but with no luck. The code

model = AutoModelForCausalLM.from_pretrained("EleutherAI/gpt-j-6B")

returns the following error:

Traceback (most recent call last):
  File "gptjtest.py", line 18, in <module>
    model = AutoModelForCausalLM.from_pretrained("gpt-j-6B")
  File "/home/marcin/miniconda3/envs/py37/lib/python3.7/site-packages/transformers/models/auto/auto_factory.py", line 383, in from_pretrained
    pretrained_model_name_or_path, return_unused_kwargs=True, **kwargs
  File "/home/marcin/miniconda3/envs/py37/lib/python3.7/site-packages/transformers/models/auto/configuration_auto.py", line 514, in from_pretrained
    config_class = CONFIG_MAPPING[config_dict["model_type"]]
  File "/home/marcin/miniconda3/envs/py37/lib/python3.7/site-packages/transformers/models/auto/configuration_auto.py", line 263, in __getitem__
    raise KeyError(key)
KeyError: 'gptj'

I’ve tried transformers version 4.9.2 as well as the latest 4.10.0.dev0 from github trunk. Apparently there is no model_type of type gptj. Do I need to add it somehow?

nielsr · August 24, 2021, 7:01am

Hi, GPT-J-6B is not added yet to the library. It will be soon though: GPT-J-6B by StellaAthena · Pull Request #13022 · huggingface/transformers · GitHub

Marcin · August 24, 2021, 8:06pm

Thanks for your answer! Thanks to you, I found the right fork and got it working for the meantime.

Maybe it would be beneficial to include information about the version of the library the models run with? (possibly an extension of the huggingface web interface`).

Joy-Lunkad · August 31, 2021, 5:54pm

Hi, Thank you for linking the right fork! I am new to hugging face and I can’t figure out how to get it working as you did. Could you please point me in the right direction where I could learn how to do it?

Marcin · August 31, 2021, 6:45pm

this worked for me:

uninstall previous version:
pip uninstall transformers
install the fork:
pip install git+https://github.com/StellaAthena/transformers
use the model:
model = AutoModelForCausalLM.from_pretrained("EleutherAI/gpt-j-6B")

just remember, this model needs 24GB memory.

Topic		Replies	Views
Keyerror when trying to download GPT-J-6B checkpoint Beginners	2	1700	September 29, 2021
The model EleutherAI/gpt-j-6B is too large to be loaded automatically 🤗Hub	1	1474	August 22, 2024
Issues running GPT-J-6B Beginners	1	1120	January 31, 2023
Does GPT-J support api access? Beginners	1	543	October 26, 2021
Is it possible to get logits when using gpt-j in float16 precision Models	0	369	March 13, 2023

How to get "EleutherAI/gpt-j-6B" working?

Related topics