Wandb.watch in accelerate library

aclifton314 · June 20, 2022, 10:12pm

When I’m nominally using wandb, in there documentation for pytorch integration there is a suggested call to the watch method:

wandb.watch(my_model, log='all', log_freq=8)

With the following enviroment variable set:

export WANDB_WATCH="all"

I’m able to record the values of the gradients and the parameter values throughout training. Is there a way to record the gradients and parameter values throughout training by using the accelerate library?

morgan · June 22, 2022, 10:04am

Did you try calling wandb.watch() just after initialising the wandb run in Acclerate? It might still work as long as its called before you start logging in your script

muellerzr · June 23, 2022, 8:19pm

For code of what @morgan means, it’d look something like this:

accelerate = Accelerator(log_with="wandb")

accelerate.init_trackers("my_projectname")
wandb.watch()

aclifton314 · June 30, 2022, 6:00pm

@morgan @muellerzr That works perfectly! I tested it out and you can call wandb.watch() even right before the training loop starts (before one starts logging metrics to wandb).

aclifton314 · June 30, 2022, 6:50pm

One last thing. I noticed that if I run accelerator on multiple gpu (say 3) and choose my tracker to be wandb that it will output 3 files to sync to wandb. Is there a way to some how aggregate all those files into one to get a single view of the entire training loop?

morgan · July 18, 2022, 4:16pm

@aclifton314 we’re aware of this and will be working on it, for now you could pass a group argument to the wandb init kwargs, then you can group by this value in the UI. Group Runs - Documentation

kerkathy · May 1, 2024, 5:29pm

did you put auguments in watch()? I mean

accelerate.init_trackers("my_projectname")
wandb.watch(model, ...)

Topic		Replies	Views
Multiple wandb outputs 🤗Accelerate	7	2821	August 22, 2022
Tracking summarization example results 🤗Accelerate	1	1972	December 13, 2022
Limiting print and log statements 🤗Accelerate	11	3316	August 3, 2022
Wandb tracker run and project specifier 🤗Accelerate	4	1671	June 27, 2022
End_training() after evaluation 🤗Accelerate	2	845	August 25, 2022

Wandb.watch in accelerate library

Related topics