Is it possible to push to HuggingFace on every checkpoint while training?

@sgugger Thank you for the reply! Actually, what I’ve found is that push_to_hub=True launches a background git push. I think the problem is that sometimes the git push process hangs. Is there any way to debug that git process, to see why it does not finish pushing?

Also, just for context, I can sometimes go into the directory and use git push with success. But also, I’ve found that only the first checkpoint produces a git commit. While trying to debug the background git push, I noticed that sometimes there can be 3 or 4 checkpoints created, but no commits happening for them. So only the first checkpoint gets pushed.

Let me know if there’s any extra info I can provide. I might try and copy one of the run_*.py scripts and writing some custom Python code to solve this problem.