Hanging on prediction

Aron · May 13, 2022, 9:56am

I have trained a Roberta based model using the TFAutoModelForSequenceClassification class. I want to now use this model to make predictions. I am used to Keras, so I’m using the tensorflow version above, and to predict I use model.predict on the tokenized input sentences.
However the issue is that if I do this on a dataset of size below about 1500 this works fine, but as soon as I go a little above it (I actually want to run this on ~5 million samples) it hangs on the model.predict line. I have tried various batch sizes from 32 to 512, this didn’t change anything.
Tokenization works fine for the full dataset, and the only output I get from model.predict are these 2 probably unrelated warnings that I also get when I run it on a small dataset where it doesnt hang:

tensorflow/core/platform/profile_utils/cpu_utils.cc:128] Failed to get CPU frequency: 0 Hz
tensorflow/core/grappler/optimizers/custom_graph_optimizer_registry.cc:113] Plugin optimizer for device_type GPU is enabled.

The full line is model.predict(dataset, batch_size=64) for instance, where type(dataset) is tensorflow.python.data.ops.dataset_ops.PrefetchDataset.

Any idea what I’m doing wrong?

Topic		Replies	Views
Inference just halts, no error, how to troubleshoot 🤗Transformers	7	1197	February 13, 2024
AutoModel never runs with multiprocessing 🤗Transformers	0	1143	July 19, 2021
Accelerate stalls when using Tensor Dataset 🤗Accelerate	0	312	December 31, 2023
How to predict in Tensorflow 🤗Transformers	1	2161	February 17, 2021
Fionetune model always predicts same output class for new data Models	7	2861	June 19, 2024

Hanging on prediction

Related topics