About the 🤗Accelerate category
|
|
1
|
2406
|
February 20, 2022
|
How to work with meta tensors?
|
|
1
|
1912
|
April 16, 2025
|
Multi-gpu inference llama-3.2 vision with QLoRA
|
|
4
|
36
|
April 16, 2025
|
BitsandBytes conflict with Accelerate
|
|
6
|
67
|
April 14, 2025
|
Issues with Dataset Loading and Checkpoint Saving using FSDP with HuggingFace Trainer on SLURM Multi-Node Setup
|
|
1
|
37
|
April 7, 2025
|
Meta device error while instantiating model
|
|
5
|
6671
|
April 1, 2025
|
Saving bf16 Model Weights When Using Accelerate+DeepSpeed
|
|
4
|
277
|
March 17, 2025
|
Cannot run multi GPU training on SLURM
|
|
1
|
52
|
March 16, 2025
|
Fp8 error in accelerate test
|
|
1
|
56
|
March 11, 2025
|
Accelerator .prepare() replaces custom DataLoader Sampler
|
|
5
|
1195
|
March 9, 2025
|
Using large dataset with accelerate
|
|
0
|
28
|
March 6, 2025
|
Accelerator.save_state errors out due to timeout. Unable to increase timeout through kwargs_handlers
|
|
5
|
1127
|
March 3, 2025
|
HF accelerate DeepSpeed plugin does not use custom optimizer or scheduler
|
|
2
|
15
|
March 1, 2025
|
Bug on multi-gpu trainer with accelerate
|
|
6
|
195
|
February 18, 2025
|
Accelerate remain stuck on using GPU 5 to perform barrier as devices used by this process are currently unknown. This can potentially cause a hang if this rank to GPU mapping is incorrect. Specify device_ids in barrier() to force use of a particular devic
|
|
1
|
474
|
February 17, 2025
|
Errors when using gradient accumulation with FSDP + PEFT LoRA + SFTTrainer
|
|
2
|
839
|
February 6, 2025
|
Save accelerate model
|
|
4
|
379
|
February 5, 2025
|
Calling other large models at runtime?
|
|
0
|
5
|
February 3, 2025
|
Training using FSDP, qLoRa on multinode
|
|
0
|
35
|
January 29, 2025
|
Are helper methods also in parallel?
|
|
0
|
8
|
January 27, 2025
|
Using device_map='auto' for training
|
|
5
|
33436
|
January 24, 2025
|
ValueError: The model has been loaded with `accelerate` and therefore cannot be moved to a specific device. Please discard the `device` argument when creating your pipeline object
|
|
5
|
162
|
January 20, 2025
|
Problems with hanging process at the end when using dataloaders on each process
|
|
5
|
4395
|
January 1, 2025
|
The used dataset had no length, returning gathered tensors. You should drop the remainder yourself
|
|
4
|
199
|
December 26, 2024
|
Grad Accumulation in FSDP
|
|
1
|
36
|
December 26, 2024
|
AttributeError: 'AcceleratorState' object has no attribute 'distributed_type', Llama 2 70B Fine-tuning, using 'accelerate' on a single GPU
|
|
1
|
994
|
December 25, 2024
|
Cuda Out of Memory with Multi-GPU Accelerate for gemma-2b
|
|
1
|
108
|
December 22, 2024
|
DeepSpeed Zero causes intermittent GPU usage
|
|
1
|
182
|
December 19, 2024
|
Inconsistent SpeechT5 Sinusoidal Positional Embedding weight tensor shape in fine-tuning run sessions
|
|
2
|
27
|
December 17, 2024
|
Problem launching train_dreambooth_flux.py (noob here)
|
|
2
|
83
|
December 16, 2024
|