Accuracy changes dramatically

burakisikli · November 23, 2020, 9:26pm

Hi,
I tried to fine tune a bert model for text classification task using same parameters(learning rate, warmup step, batch size, number of epoch) in pytorch and tensorflow. If I use tensorflow, the validation accuracy changes dramatically. In pytorch accuracy is around %96, in tensorflow %76. One thing I noticed is the gpu memory usage difference (pytorch: ~12gb, tf ~8gb). Shouldn’t we expect it to be the similar accuracy?

transformers version: 3.5.1
Platform: Linux-4.19.112±x86_64-with-Ubuntu-18.04-bionic
Python version: 3.6.9
PyTorch version (GPU?): 1.7.0+cu101 (True)
Tensorflow version (GPU?): 2.3.0 (True)
Using GPU in script?: Yes
Using distributed or parallel set-up in script?: No

from transformers import TFBertForSequenceClassification

model = TFBertForSequenceClassification.from_pretrained('bert-base-uncased', num_labels = num_labels)

optimizer = tf.keras.optimizers.Adam(learning_rate=lr_schedule) 
model.compile(optimizer=optimizer, loss=model.compute_loss, metrics=['accuracy']) 
history = model.fit(train_dataset.shuffle(1000).batch(32), epochs=epochs, batch_size=32)

Topic		Replies	Views
Pytorch trainer giving worse results than tensorflow Beginners	0	643	January 5, 2023
Accuracy decreasing after saving/reloading my model 🤗Transformers	3	9	July 8, 2025
BERT model is slow in Pytorch 🤗Transformers	5	626	November 30, 2023
Advice to speed and performance 🤗Transformers	4	7220	December 7, 2020
Why training accuracy and test accuracy on train set is significantly different? Beginners	0	1396	February 28, 2022

Accuracy changes dramatically

Related topics