Java version of transformers library?
|
|
1
|
4
|
January 30, 2025
|
Multiple Loss Tracking on Train and Evaluate Steps
|
|
1
|
4
|
January 30, 2025
|
Llama-2 find answer in a transcript
|
|
1
|
13
|
January 30, 2025
|
ModernBertForQuestionAnswering does not exist?
|
|
4
|
50
|
January 29, 2025
|
Setting up my custom device map for a LLM
|
|
3
|
4281
|
January 29, 2025
|
Can Donut model be used to query Multipage documents?
|
|
3
|
1379
|
January 29, 2025
|
Cannot use Hugging Face cache on a read-only filesystem
|
|
3
|
19
|
January 29, 2025
|
PPOTrainer + LoRA and Continued Training
|
|
0
|
16
|
January 28, 2025
|
ValueError: boxes1 must be in [x0, y0, x1, y1] (corner) format
|
|
2
|
14
|
January 28, 2025
|
Convert RT-DETR model to coreml
|
|
3
|
11
|
January 27, 2025
|
Logits from generate and model call different
|
|
2
|
729
|
January 26, 2025
|
Problem generating with T5ForConditionalGeneration on a custom task
|
|
2
|
10
|
January 26, 2025
|
GPTQ quantization on Custom dataset
|
|
4
|
530
|
January 24, 2025
|
The Best Approach for Weighted Multilabel Classification
|
|
1
|
17
|
January 24, 2025
|
How can I replace modules in a pretrained model?
|
|
4
|
3911
|
January 24, 2025
|
RuntimeError: result type Float can't be cast to the desired output type Long
|
|
1
|
16
|
January 24, 2025
|
Difference in model prediction before saving and after loafing
|
|
4
|
212
|
January 23, 2025
|
Upgrading to transformers 4.9.1?
|
|
3
|
658
|
January 23, 2025
|
AI-Generated Text Detection: Is There a Feature in transformers?
|
|
2
|
32
|
January 22, 2025
|
Outputs change if re-using KVCache (past_key_values) for model.forward and generation
|
|
5
|
18
|
January 22, 2025
|
The state_dict export by peft.get_peft_model_state_dict doesn't contain adapter name
|
|
7
|
10
|
January 22, 2025
|
torch.distributed.elastic.multiprocessing.errors.ChildFailedError
|
|
19
|
37301
|
January 22, 2025
|
Seq2SeqTrainer multiple GPUs
|
|
2
|
10
|
January 22, 2025
|
Accessing a model with model.generate via remote only
|
|
1
|
8
|
January 22, 2025
|
GPU utilization almost always 0 during training
|
|
2
|
47
|
January 22, 2025
|
Finetuning llama for classification
|
|
2
|
780
|
January 21, 2025
|
Adding Audio-MAE to Transformers
|
|
1
|
17
|
January 21, 2025
|
Dataset for multilabel classification
|
|
1
|
28
|
January 20, 2025
|
Having the 'The model did not return a loss from the inputs, only the following keys: logits.' error only when predict_with_generate = True
|
|
2
|
27
|
January 20, 2025
|
Fine-Tuning a Text2Text Model using different tokenizer
|
|
5
|
27
|
January 20, 2025
|