PatchTST issue adding independent variables to the model that help predict one target variable
|
|
0
|
132
|
March 21, 2024
|
Converting CLIPModel to VisionTextDualEncoderModel
|
|
1
|
166
|
March 21, 2024
|
How to use peft+base merged models in offline mode?
|
|
3
|
1132
|
March 21, 2024
|
Disparity between output from `forward` and `generate` for greedy search (using Whisper)
|
|
3
|
1356
|
August 11, 2024
|
Using trasnsformer to get image features
|
|
3
|
3365
|
March 20, 2024
|
Potential error in the documentation relating to Deberta-v2 position_biased_input
|
|
2
|
113
|
March 20, 2024
|
How to train SETFIT on a dataset like NLI
|
|
0
|
110
|
March 20, 2024
|
BERT for text classification
|
|
0
|
445
|
March 20, 2024
|
Which transformers version will support this?
|
|
0
|
80
|
March 20, 2024
|
Tokenizer is not defined
|
|
5
|
11316
|
March 19, 2024
|
[URGENT] Issues with Training RoBERTa Model for Text Prediction with Fill Mask Task
|
|
6
|
220
|
March 19, 2024
|
Problem "Target size must be the same as input size "
|
|
0
|
306
|
March 19, 2024
|
Seq2SeqTrainer produces error during validation when using T5
|
|
0
|
137
|
March 18, 2024
|
Unexpected input type after export
|
|
0
|
115
|
March 18, 2024
|
ValueError: Expected input batch_size to match target batch_size in Token Classification
|
|
8
|
4342
|
March 17, 2024
|
Deepspeed zero-2 cpu offloading killing process = -9 error
|
|
1
|
1825
|
March 17, 2024
|
Can't push model to model hub
|
|
1
|
597
|
March 17, 2024
|
Device while using pipeline
|
|
0
|
80
|
March 16, 2024
|
No instructions in documentationTo train a new IDEFICS model from scratch
|
|
0
|
108
|
March 16, 2024
|
Fine-tuning for translation with facebook mbart-large-50
|
|
1
|
1733
|
March 16, 2024
|
Tokenizer train_new_from_iterator hanging for several models
|
|
0
|
153
|
March 16, 2024
|
I am following a hugging face guide for fine tuning whisper but I run into error when training
|
|
0
|
171
|
March 15, 2024
|
Is it ok to have max_length greater than context_length of the model
|
|
0
|
337
|
March 15, 2024
|
Release timeline for 4.39.0 / mamba?
|
|
0
|
210
|
March 14, 2024
|
Error while using LILT model "index out of range in self"
|
|
5
|
703
|
March 14, 2024
|
Quantizing a model on M1 Mac for qlora
|
|
0
|
1719
|
March 14, 2024
|
`seq_classif_dropout = 0.2` what is the use of adding dropout after the classification network
|
|
0
|
106
|
March 14, 2024
|
Conceptual question: Early loading of the model defeats the purpose of deepspeed!
|
|
0
|
158
|
March 14, 2024
|
How to fine-tune a Mistral-7B model for machine translation?
|
|
1
|
362
|
March 13, 2024
|
Customizing model architecture from predefined models
|
|
0
|
362
|
March 13, 2024
|