Adding Audio-MAE to Transformers
|
|
0
|
1
|
January 21, 2025
|
Outputs change if re-using KVCache (past_key_values) for model.forward and generation
|
|
3
|
6
|
January 21, 2025
|
Dataset for multilabel classification
|
|
1
|
8
|
January 20, 2025
|
Having the 'The model did not return a loss from the inputs, only the following keys: logits.' error only when predict_with_generate = True
|
|
2
|
13
|
January 20, 2025
|
Fine-Tuning a Text2Text Model using different tokenizer
|
|
5
|
22
|
January 20, 2025
|
[Announcement] Generation: Get probabilities for generated output
|
|
63
|
37584
|
January 20, 2025
|
Perhaps your features (`output` in this case) have excessive nesting (inputs type `list` where type `int` is expected)
|
|
19
|
52
|
January 20, 2025
|
Pip install optimum[exporters-tf]
|
|
3
|
13
|
January 18, 2025
|
Pretrained Model for Fine-Tuning has 100% Trainable Parameters
|
|
2
|
22
|
January 17, 2025
|
DONUT: Reading order for pseudo-OCR pre-training task
|
|
0
|
5
|
January 16, 2025
|
SSL Certificate Issue
|
|
7
|
20187
|
January 16, 2025
|
Unable to load a newly trained tokenizer from local files
|
|
4
|
16
|
January 16, 2025
|
Issues Fine Tuning RT-DETR
|
|
1
|
24
|
January 15, 2025
|
Change the classifcation threshold
|
|
2
|
18
|
January 15, 2025
|
Python code for Gemma models
|
|
1
|
22
|
January 15, 2025
|
RuntimeError: Failed to import transformers.models.roberta.modeling_tf_roberta because of the following error (look up to see its traceback): No module named 'keras.engine'
|
|
6
|
4837
|
January 14, 2025
|
Initializing a big model on GPU with random weights
|
|
2
|
11
|
January 14, 2025
|
What is `self.loss_function` in `forward()` of newly released LLM?
|
|
0
|
7
|
January 14, 2025
|
ValueError: Unable to create tensor, you should probably activate truncation and/or padding with 'padding=True' 'truncation=True' to have batched tensors with the same length
|
|
4
|
32624
|
January 13, 2025
|
Qwen Not work anymore
|
|
1
|
15
|
January 13, 2025
|
No matter what I do the HF…
|
|
2
|
23
|
January 13, 2025
|
Expected `tensors` and `new_tensors` to have the same type but found <class 'tuple'> and <class 'torch.Tensor'>
|
|
2
|
10
|
January 12, 2025
|
Fine-tuning an NLLB model for a new language
|
|
7
|
1777
|
January 12, 2025
|
Preparing data for Donut training results in error "ArrowInvalid: offset overflow while concatenating arrays"
|
|
2
|
32
|
January 12, 2025
|
ModernBertForQuestionAnswering does not exist?
|
|
3
|
39
|
January 11, 2025
|
Coherforai I have API but I can't Access
|
|
1
|
9
|
January 10, 2025
|
Llama-2 7B-hf repeats context of question directly from input prompt, cuts off with newlines
|
|
16
|
27450
|
January 10, 2025
|
Mamba2 Cache Position
|
|
3
|
16
|
January 10, 2025
|
Multi-input tag and ,multi-label output for token classification using Bert pretrained model
|
|
1
|
29
|
January 9, 2025
|
TypeError: 'list' object is not callable
|
|
1
|
9
|
January 8, 2025
|