Is it possible to do inference on gpt-j-6B via Colab?

NightMachinery · December 21, 2021, 11:49am

When I use the pipeline API, it crashes Colab with an out of memory error (fills 25.5GB of RAM). I think it should be possible to do the inference on TPUv2? But how do I tell the pipeline to start using the TPUs from the start?

from transformers import pipeline
model_name = 'EleutherAI/gpt-j-6B'
generator = pipeline('text-generation', model=model_name)
out = generator("I am Harry Potter.", do_sample=True, min_length=50)

nielsr · December 21, 2021, 1:41pm

Hi,

Inference is only possible on Colab Pro. You can check my notebook here for more info.

Topic		Replies	Views
How do I do inference using the GPT models on TPUs? 🤗Transformers	5	2514	October 13, 2024
Finetuning GPT-J6B for custom dataset 🤗Tokenizers	1	1082	March 6, 2022
How to train a gpt2 with colab pro Models	16	3710	February 29, 2024
TPU Out of memory in Pix2Struct ForConditionalGeneration model Intermediate	0	246	August 13, 2023
Running out of memory at inference Beginners	0	926	June 26, 2022

Is it possible to do inference on gpt-j-6B via Colab?

Related topics