Tensor shape mismatch error when doing an allgather in distributed training with FSDP

Hi. I think this post is not going to have anybody comment so if somebody knows where I could ask this with potential help, please tell me, I would appreciate it. Thanks!

1 Like