Training deit with new size of image
|
|
0
|
12
|
July 25, 2024
|
PipelineIterator Issue
|
|
1
|
173
|
July 25, 2024
|
The role of the bf16 arguments in SFTConfig
|
|
0
|
350
|
July 25, 2024
|
Running model.generate() in deep speed training
|
|
2
|
523
|
July 25, 2024
|
Dora training taking 8x time? Why?
|
|
0
|
61
|
July 24, 2024
|
Maximum Recursion Depth Error
|
|
0
|
118
|
July 24, 2024
|
How to correctly freeze some of the Wav2Vec2-Bert's layers?
|
|
0
|
79
|
July 24, 2024
|
Wav2Vec2 Loss Function Question
|
|
1
|
160
|
July 24, 2024
|
RuntimeError: Expected tensor for argument #1 'indices' to have one of the following scalar types: Long, Int; but got MPSFloatType instead (while checking arguments for embedding)
|
|
1
|
1334
|
July 24, 2024
|
Slow speed with large context
|
|
0
|
12
|
July 24, 2024
|
Finetuning T5 with Input Embeddings
|
|
0
|
26
|
July 24, 2024
|
ValueError: Using fsdp only works in distributed training
|
|
6
|
2036
|
July 24, 2024
|
Unable to create tensor, you should probably activate truncation and/or padding with 'padding=True' 'truncation=True'
|
|
2
|
460
|
July 24, 2024
|
How to set wandb project name for trainer?
|
|
2
|
8723
|
July 23, 2024
|
RuntimeError: Error building extension 'cpu_adam'
|
|
4
|
5183
|
July 23, 2024
|
Use finetuned model for feature extraction
|
|
0
|
59
|
July 23, 2024
|
Past dynamic features in Time Series Transformer
|
|
0
|
31
|
July 22, 2024
|
Getting error in importing TFTrainer
|
|
1
|
990
|
July 22, 2024
|
Mask2Former not performing as expected
|
|
8
|
2386
|
July 22, 2024
|
Saving checkpoints when using DeepSpeed is taking abnormally long
|
|
0
|
157
|
July 22, 2024
|
AutoModel vs PreTrainedModel difference/relationship
|
|
0
|
58
|
July 21, 2024
|
AutoModel.from_pretrained vs PreTrainedModel.from_pretrained
|
|
0
|
233
|
February 2, 2024
|
Text to structure: a way to standardize outputs
|
|
3
|
3325
|
July 21, 2024
|
Finetuning whisper attention mask not set and canot be inferred
|
|
4
|
5279
|
July 20, 2024
|
Wav2vec2 CUDA OOM with distributed training
|
|
0
|
4
|
July 20, 2024
|
How can I see the installed version of transformers
|
|
4
|
42764
|
July 20, 2024
|
Finetune only certain embeddings
|
|
0
|
11
|
July 19, 2024
|
AssertionError: Attempted unscale_ but _scale is None. This may indicate your script did not use scaler.scale(loss or outputs) earlier in the iteration
|
|
0
|
274
|
July 19, 2024
|
TimeSeriesTransformer - mat1 and mat2 shapes cannot be multiplied
|
|
12
|
3581
|
July 18, 2024
|
Understanding DataCollation
|
|
0
|
14
|
July 18, 2024
|