Best way to use accelerate for large embeddings

ssharpe42 · December 9, 2022, 2:23pm

What is the best way to use accelerate to train huge embedding matrices?

How do we effectively split it into multiple devices and initialize non-empty weights? I want to be able to access a batch of embeddings and move it to the GPU for each step. Would it just be best to initialize on the CPU? Is there a way I can use the strategies that use disk, ram, and GPU?

Topic		Replies	Views
What is the correct way to compute metrics while training using Accelerate? 🤗Accelerate	0	22	October 29, 2024
Loading weights straight to GPU & Training support 🤗Accelerate	0	214	September 18, 2023
Calling other large models at runtime? 🤗Accelerate	0	7	February 3, 2025
Accelerate! I have a query, no actual problem to be solved! Beginners	2	284	August 8, 2023
Am I using Embeddings wrong or is it the wrong approach Beginners	2	122	April 26, 2024

Best way to use accelerate for large embeddings

Related topics