Trainer doesn't call compute_metrics during evaluation

filbeofITK · February 13, 2024, 4:24pm

Hello!

I have a custom model that I train and also would like to test within the HF environment. However it seems that even tho I pass a custom compute_metrics function to my trainer it doesn’t call it once.

This is my code:

def plot_covariance_matrix(model_output, config):
print(“Hello World!”)
# Calculate covariance matrices
cov_matrix_og = np.corrcoef(model_output.target, rowvar=True)
cov_matrix_reconst = np.corrcoef(model_output.output, rowvar=True)

# Plot covariance matrix of the data
fig, axes = plt.subplots(1, 2, figsize=(12, 6))
sns.heatmap(cov_matrix_og, annot=True, cmap="vlag", ax=axes[0], xticklabels=features, yticklabels=features)
axes[0].set_title('Covariance of the original data.')

sns.heatmap(cov_matrix_reconst, annot=True, cmap="vlag", ax=axes[1], xticklabels=features, yticklabels=features)
axes[1].set_title('Covariance of the reconstructed data.')

fig.suptitle('Covariance matrices.')
plt.tight_layout()
print('Before saving figure path:', os.path.abspath('.'))
fig.savefig(os.path.join(config['savepath'], config['result_filename']), format='png')

def compute_metrics_baseline(model_output):
plot_covariance_matrix(model_output, config={
‘result_filename’: ‘baseline_result.png’,
‘savepath’: os.path.dirname(os.path.abspath(file))
})
return {}

testing_args_baseline = TrainingArguments(output_dir=“embeddingmodel_test_checkpoints”, logging_dir=‘./baseline_log’,
remove_unused_columns=False, evaluation_strategy=‘epoch’,
per_device_eval_batch_size=BATCH_SIZE)

baseline_tester = Trainer(
model=embeddingModel,
args=testing_args_baseline,
eval_dataset=test_dataset,
data_collator=baseline_collator,
compute_metrics=compute_metrics_baseline
)

print(‘Testing baseline model.’)
baseline_tester.evaluate()

As you can see there are some print statements already in there, because first I thought my figure is saved on a different location I want it to…

But not even the first print statement is reached, since I cannot se anything.
I checked this topic: Trainer never invokes compute_metrics and many others but I still can’t find reason for this.

At this point I’m considering ditching evaluate as is, because hand coding this would have taken far less time than trying to debug it, so this post is my last resort

helloelwin · June 21, 2024, 12:50am

Hi I met the same problem. Have you been able to solve it?

filbeofITK · June 24, 2024, 2:49pm

No sadly I had to write my own evaluation methods.

Topic		Replies	Views
Compute_metrics() behaves strangely in distributed setting 🤗Transformers	0	47	July 28, 2024
Difference between using or not compute_metric Beginners	2	856	November 27, 2023
Custom model for Trainer 🤗Transformers	1	382	July 8, 2023
Pass Arguments to custom compute_metrics in Trainer Beginners	2	3258	April 10, 2023
Custom metrics with extra data? Beginners	8	3565	April 12, 2024

Trainer doesn't call compute_metrics during evaluation

Related topics