Batch size for trainer.predict()

berkayberabi · January 25, 2021, 3:21pm

Hi,

I pass a test dataset to trainer.predict but I have many samples. Therefore, I get a memory error. Does the library support a way of batch based trainer.predict? or do I have to implement it myself?

sgugger · January 25, 2021, 4:27pm

You can pass eval_accumulation_steps=xxx to pass the predictions to the CPU every xxx steps, this should help.

mhilmiasyrofi · August 4, 2021, 5:25am

You can set the batch size manually using trainer.prediction_loop()

Instead of using trainer.predict(test_dataset), you can use torch DataLoader for trainer.prediction_loop(). Thus, you might change

from

raw_pred, _, _ = trainer.predict(test_dataset)

into:

test_loader = DataLoader(test_dataset, batch_size=64, shuffle=False)
raw_pred, _, _ = trainer.prediction_loop(test_loader, description="prediction")

kechan · July 17, 2022, 10:19pm

In transformers 4.20.1,

args = TrainingArguments(output_dir=‘tmp_trainer’, per_device_eval_batch_size=16)

trainer = Trainer(model=model, args=args)

predictions = trainer.predict(pred_dataset)

sriram6399 · November 26, 2022, 4:08pm

Hi I tried this method, but I see that the prediction process is killed at 99% without generating the predictions. There are no Memory Issues. Looks like if I use trainer.prediction_loop() method, I cannot set the argument predict_with_gentrate=True, I am thinking this might be causing the problem, I am not sure though. I am very new to working on pre_trained models. Could you please let me know your insights, what could be the possible reason for this issue.

Topic		Replies	Views
Why eval_accumulation_steps takes so much memory 🤗Transformers	5	1551	May 8, 2024
Using Trainer at inference time 🤗Transformers	9	15901	May 4, 2023
Prohibitively large RAM consumption on Trainer validation 🤗Transformers	2	1577	April 24, 2024
CUDA out of memory error while predicting (evaluation) 🤗Transformers	1	1358	March 22, 2024
Trainer will evaluate using my entire validation set(60k), which gives me cuda memory usage issue:. Is there a param that allows evaluating only on some batches in validation set? Beginners	4	1321	April 29, 2022

Batch size for trainer.predict()

Related topics