Training deit with new size of image
|
|
0
|
13
|
July 25, 2024
|
PipelineIterator Issue
|
|
1
|
180
|
July 25, 2024
|
The role of the bf16 arguments in SFTConfig
|
|
0
|
473
|
July 25, 2024
|
Running model.generate() in deep speed training
|
|
2
|
544
|
July 25, 2024
|
Dora training taking 8x time? Why?
|
|
0
|
67
|
July 24, 2024
|
Maximum Recursion Depth Error
|
|
0
|
126
|
July 24, 2024
|
How to correctly freeze some of the Wav2Vec2-Bert's layers?
|
|
0
|
98
|
July 24, 2024
|
Wav2Vec2 Loss Function Question
|
|
1
|
209
|
July 24, 2024
|
RuntimeError: Expected tensor for argument #1 'indices' to have one of the following scalar types: Long, Int; but got MPSFloatType instead (while checking arguments for embedding)
|
|
1
|
1422
|
July 24, 2024
|
Slow speed with large context
|
|
0
|
13
|
July 24, 2024
|
Finetuning T5 with Input Embeddings
|
|
0
|
35
|
July 24, 2024
|
ValueError: Using fsdp only works in distributed training
|
|
6
|
2124
|
July 24, 2024
|
Unable to create tensor, you should probably activate truncation and/or padding with 'padding=True' 'truncation=True'
|
|
2
|
597
|
July 24, 2024
|
How to set wandb project name for trainer?
|
|
2
|
9178
|
July 23, 2024
|
RuntimeError: Error building extension 'cpu_adam'
|
|
4
|
5286
|
July 23, 2024
|
Use finetuned model for feature extraction
|
|
0
|
62
|
July 23, 2024
|
Past dynamic features in Time Series Transformer
|
|
0
|
33
|
July 22, 2024
|
Getting error in importing TFTrainer
|
|
1
|
1037
|
July 22, 2024
|
Mask2Former not performing as expected
|
|
8
|
2527
|
July 22, 2024
|
Saving checkpoints when using DeepSpeed is taking abnormally long
|
|
0
|
183
|
July 22, 2024
|
AutoModel vs PreTrainedModel difference/relationship
|
|
0
|
62
|
July 21, 2024
|
AutoModel.from_pretrained vs PreTrainedModel.from_pretrained
|
|
0
|
243
|
February 2, 2024
|
Text to structure: a way to standardize outputs
|
|
3
|
3820
|
July 21, 2024
|
Finetuning whisper attention mask not set and canot be inferred
|
|
4
|
5789
|
July 20, 2024
|
Wav2vec2 CUDA OOM with distributed training
|
|
0
|
5
|
July 20, 2024
|
How can I see the installed version of transformers
|
|
4
|
44544
|
July 20, 2024
|
Finetune only certain embeddings
|
|
0
|
15
|
July 19, 2024
|
AssertionError: Attempted unscale_ but _scale is None. This may indicate your script did not use scaler.scale(loss or outputs) earlier in the iteration
|
|
0
|
316
|
July 19, 2024
|
TimeSeriesTransformer - mat1 and mat2 shapes cannot be multiplied
|
|
12
|
3921
|
July 18, 2024
|
Understanding DataCollation
|
|
0
|
18
|
July 18, 2024
|