🤗Accelerate

Topic	Replies	Views	Activity
Are helper methods also in parallel?	0	10	January 27, 2025
Using device_map='auto' for training	5	35898	January 24, 2025
ValueError: The model has been loaded with `accelerate` and therefore cannot be moved to a specific device. Please discard the `device` argument when creating your pipeline object	5	242	January 20, 2025
Problems with hanging process at the end when using dataloaders on each process	5	4467	January 1, 2025
The used dataset had no length, returning gathered tensors. You should drop the remainder yourself	4	268	December 26, 2024
Grad Accumulation in FSDP	1	39	December 26, 2024
AttributeError: 'AcceleratorState' object has no attribute 'distributed_type', Llama 2 70B Fine-tuning, using 'accelerate' on a single GPU	1	1034	December 25, 2024
Cuda Out of Memory with Multi-GPU Accelerate for gemma-2b	1	127	December 22, 2024
DeepSpeed Zero causes intermittent GPU usage	1	321	December 19, 2024
Inconsistent SpeechT5 Sinusoidal Positional Embedding weight tensor shape in fine-tuning run sessions	2	29	December 17, 2024
Problem launching train_dreambooth_flux.py (noob here)	2	98	December 16, 2024
How to accumulate when examples per batch is not fixed	0	22	December 11, 2024
Do Trainer and Callback get created multiple times in case of distributed setup	1	235	December 11, 2024
Does timm.data.loader.MultiEpochsDataLoader work with Accelerator?	0	54	December 9, 2024
Troubles with features in .prepare()	1	35	November 30, 2024
How to run inference on multigpus	0	128	November 29, 2024
General question about large model loading	2	917	November 28, 2024
Slurm Issues running accelerate	1	1055	November 28, 2024
Proposal to Enhance `get_state_dict` and Introduce `load_from_state_dict` for Greater Flexibility	0	31	November 23, 2024
Request for Clarification and Possible Refinement of `Plugin` and `KwargsHandler` Design	1	51	November 23, 2024
Proposal to Rename `notebook_launcher` for Broader Accessibility and Clarity	1	52	November 23, 2024
How to correctly use model weights outside of forward in distributed training set-up with Accelerate?	0	76	November 12, 2024
Inconsistent Training Time with Accelerate	0	30	November 8, 2024
Bug with model.generate if max_length or max_new_tokens are set, with accelerate deepspeed zero level 3	4	1162	November 7, 2024
Issue with LoRA Adapter Loading on Multiple GPUs during Fine-Tuning with Accelerate and SFTTrainer	3	1008	September 18, 2024
What is the correct way to compute metrics while training using Accelerate?	0	22	October 29, 2024
Evaluation Metrics are not matching with Shuffle = False	0	21	October 19, 2024
How to specify FSDP config without launching via Accelerate	2	292	October 19, 2024
Loading a HF Model in Multiple GPUs and Run Inferences in those GPUs	10	9605	October 16, 2024
Distributed inference: how to store results in a global variable	2	34	October 16, 2024