Using Accelerate with DeepSpeed for WNUT Example
|
|
1
|
847
|
July 19, 2023
|
Using Gradio on JupyterHub
|
|
3
|
1142
|
July 19, 2023
|
Why does hugging face falcon model use mode.config.use_cache = False, why wouldn't it want to have the decoder re-use computations for fine-tuning?
|
|
7
|
2692
|
July 19, 2023
|
Is there any music vocals/voice-to-text model?
|
|
0
|
1005
|
July 19, 2023
|
Fine-tuning Stable Diffusion 2 with a custom Dreambooth approach
|
|
0
|
710
|
July 20, 2023
|
Https://circulaxil-erfahrungen.webflow.io/
|
|
0
|
378
|
July 20, 2023
|
Blocks as output
|
|
0
|
288
|
July 20, 2023
|
Long audio input for training?
|
|
0
|
224
|
July 20, 2023
|
Change loss and dataset format with SFTTrainer (TRL & QLoRA )
|
|
0
|
1672
|
July 19, 2023
|
BatchSampler - with trainer
|
|
0
|
191
|
July 20, 2023
|
DataCollator for selecting a random subset and permutation
|
|
0
|
574
|
July 20, 2023
|
Duration of training time trainer api
|
|
1
|
311
|
July 20, 2023
|
How to jit.trace gpt-neo-125mb
|
|
3
|
1257
|
July 20, 2023
|
Tracking Usage and writing to a file in hugging space
|
|
0
|
254
|
July 20, 2023
|
WMT testset score
|
|
0
|
183
|
September 20, 2021
|
Fine-tuning NLLB model
|
|
1
|
2627
|
July 20, 2023
|
Validation loss is none while training using pytorch training loop
|
|
0
|
387
|
July 20, 2023
|
TypeError: Values in `DatasetDict` should be of type `Dataset` but got type '<class 'dict'>' *Solved*
|
|
0
|
1146
|
July 20, 2023
|
Batch aesthetics score predictor with statistics on Gradio
|
|
1
|
863
|
July 20, 2023
|
AI model for Bitcoin blockchain data analysis
|
|
0
|
578
|
July 20, 2023
|
Inference Client Errors
|
|
0
|
182
|
July 20, 2023
|
Dataset blockchain bitcoin
|
|
0
|
506
|
July 20, 2023
|
Finetuning Stable Diffusion inpainting checkpoint
|
|
1
|
711
|
July 20, 2023
|
Pre-trained DeBERTa - Weak MLM performance any hints?
|
|
1
|
275
|
July 21, 2023
|
Sliding Window Approach for Multilabel Classification
|
|
0
|
545
|
July 21, 2023
|
Error of run_glue.py: RuntimeError: CUDA error: device-side assert triggered
|
|
0
|
727
|
July 21, 2023
|
Customization guideline for beginner
|
|
0
|
93
|
July 21, 2023
|
No module named 'optimum.neuron'; 'optimum' is not a package
|
|
2
|
2106
|
July 21, 2023
|
Correct way to load pretrained model
|
|
0
|
574
|
July 21, 2023
|
Using Roberta for Sentence2Vec
|
|
3
|
1255
|
April 11, 2021
|