Utilizing GPU in masked language modelling in Tensorflow

bhavya-shah · February 27, 2023, 12:10pm

I am doing a masked language modelling task using distilbert model

This is the tokenizer that I trained:
tokenizer = t.PreTrainedTokenizerFast(tokenizer_file='path/to/tokenizer.json', unk_token="[UNK]", pad_token="[PAD]", cls_token="[CLS]", sep_token="[SEP]", mask_token="[MASK]")

I am also using huggingface datasets library to preprocess my data so everything is in a standard way as shown in this tutorial: Fine-tuning a masked language model - Hugging Face Course .

I’m not getting any errors and the model is training well.

But my complaint is that the training is not leveraging the GPU on my machine(I found this by opening the task manager + the training of 1 epochs is taking 3hrs). I even tried by enabling the GPU access from NVIDIA Control Panel but it did not help. I have NVIDIA GTX 1650 on my machine with CUDA 12.0(Found this by using the nvidia-smi command in CLI).

Please help me understand my mistake.

Topic		Replies	Views
Not using GPU although it is specified Course	5	30976	December 30, 2024
Is Transformers using GPU by default? Beginners	6	154206	December 11, 2023
Using GPU with transformers Beginners	4	11516	November 3, 2020
DPR Context tokenization in a GPU 🤗Datasets	4	1176	September 25, 2020
NLP Pretrained model model doesn’t use GPU when making inference 🤗Transformers	11	10102	March 11, 2022

Utilizing GPU in masked language modelling in Tensorflow

Related topics