Pipeline device issue, torch_xla generation() bug, flax models malloc errors
|
|
0
|
61
|
April 21, 2024
|
Config parameters for custom models
|
|
0
|
52
|
April 21, 2024
|
Interface API deployment
|
|
0
|
52
|
April 21, 2024
|
Model Parallism
|
|
0
|
64
|
April 21, 2024
|
Fine tuning a sentence-transformer for cosine sim on 500k sentence pairs without labels-- advice
|
|
2
|
853
|
April 20, 2024
|
I can not download the llama2 model with transformers on gcp
|
|
0
|
83
|
April 19, 2024
|
Batching in "automatic-speech-recognition" pipelines
|
|
1
|
1392
|
April 19, 2024
|
Errors when trying to fine-tune OpenLLaMA using Trainer API
|
|
0
|
80
|
April 19, 2024
|
Getting warning message on creation of WeightedLossTrainer object
|
|
1
|
1383
|
April 19, 2024
|
Concatenating in vision transformer
|
|
0
|
59
|
April 19, 2024
|
PEFT fine-tuning Mistral-7B-Instruct-v0.2 - Warning messages
|
|
0
|
194
|
April 19, 2024
|
Grouping by length makes training loss oscillate and makes evaluation loss worse
|
|
0
|
62
|
April 18, 2024
|
Transformers CausalLM loss is always nan
|
|
0
|
68
|
April 18, 2024
|
Error when increasing max_length for tokenizer - OverflowError: out of range integral type conversion attempted
|
|
0
|
96
|
April 18, 2024
|
Ref_model in DPOTrainer
|
|
0
|
81
|
April 18, 2024
|
Gather Input tensor at index 1 has invalid shape
|
|
1
|
561
|
April 18, 2024
|
Module error not found: "torch.utils._pytree"
|
|
1
|
517
|
April 17, 2024
|
Why is Trainer single-threaded during "Generating split..."?
|
|
0
|
73
|
April 17, 2024
|
Trainer() shows no log for validation loss when using PEFT
|
|
0
|
94
|
April 17, 2024
|
Generating Once for 16 Tokens is Not Same Generating Single Token 16 Times?
|
|
4
|
107
|
April 17, 2024
|
Setting weights as adapter weights
|
|
0
|
46
|
April 17, 2024
|
Would PyTorch's FSDP work with a model loaded using device_map='auto'?
|
|
0
|
60
|
April 17, 2024
|
Triaging cudaErrorIllegalAddress Error
|
|
2
|
1371
|
April 17, 2024
|
Pretrain model not accepting optimizer
|
|
24
|
1680
|
April 16, 2024
|
Trainer freezes/crashes after evaluation step
|
|
6
|
372
|
April 16, 2024
|
How to Modify LLaMA 2 Model for Internal Token Generation Timing
|
|
0
|
98
|
April 16, 2024
|
How Labelled Data is Processed | Transformers Trainer
|
|
10
|
401
|
April 16, 2024
|
Need help to reduce CLIP image embedding time
|
|
0
|
72
|
April 15, 2024
|
Custom config error when model.save_pretrained
|
|
3
|
1440
|
April 15, 2024
|
How to parameter efficient finetune Decoder in encoder-decoder model?
|
|
0
|
68
|
April 15, 2024
|