Can I use "AutoModel For Sequence Classification" class for generative models?
|
|
2
|
740
|
April 15, 2024
|
Looking for exploratory study / best practices for LoRA adapters config (LLM fine-tuning)
|
|
0
|
370
|
April 15, 2024
|
Access feature in custom compute_loss method
|
|
0
|
184
|
April 15, 2024
|
Trocr Model not utilising gpu even I am specified that
|
|
0
|
311
|
April 15, 2024
|
Import transformers fails; installation issue?
|
|
1
|
1840
|
April 15, 2024
|
Hugging face course enough for understanding transformer and llm stuff
|
|
2
|
184
|
March 10, 2024
|
When I try to use my fine-tuned Causal LM model to inference a prompt, I get nothing but the last word repeated multiple times
|
|
1
|
510
|
April 14, 2024
|
Solving error for mismatch tensor size
|
|
0
|
308
|
April 14, 2024
|
Padding options for LayoutLM processor
|
|
0
|
145
|
April 14, 2024
|
Help with Sparse LLM Implementation
|
|
0
|
199
|
April 14, 2024
|
Model for image regression
|
|
0
|
206
|
April 13, 2024
|
Inverse normalising entities in Whisper
|
|
2
|
1021
|
April 13, 2024
|
How to set up Trainer for a regression?
|
|
6
|
13744
|
April 13, 2024
|
Fine-tuning BERT with multiple classification heads
|
|
10
|
5416
|
January 19, 2024
|
Remove a named module from a pre-trained model
|
|
0
|
240
|
April 12, 2024
|
Mistral model generates the same embeddings for different input texts
|
|
2
|
336
|
April 12, 2024
|
Loss becomes nan
|
|
0
|
832
|
April 12, 2024
|
Caching encoder state for multiple encoder-decoder `.generate()` calls?
|
|
2
|
237
|
April 12, 2024
|
Trainner API is not working. Its complaining of numpy depreciation issues
|
|
0
|
136
|
April 11, 2024
|
RuntimeError: CUDA error: device-side assert triggered 4x10
|
|
0
|
176
|
April 11, 2024
|
How to properly UPCAST the model weights to float32?
|
|
2
|
450
|
April 11, 2024
|
Shouldn't RobertaForCausalLM generate something?
|
|
8
|
1417
|
April 11, 2024
|
How many GB of RAM do I need to train DBRX?
|
|
2
|
232
|
April 11, 2024
|
Tensor size error when generating embeddings for documents using pre-trained models
|
|
3
|
509
|
April 11, 2024
|
Search models by tokenizer
|
|
0
|
91
|
April 10, 2024
|
Fine-Tune LoRA adapter starting from existing adapter
|
|
1
|
258
|
April 10, 2024
|
Seeking Clarification: Model Evaluation - Train and Val loss
|
|
3
|
694
|
April 10, 2024
|
Development status of huggingface/tflite-android-transformers and modern alternatives
|
|
0
|
324
|
April 10, 2024
|
Exporting UDOP to ONNX fails
|
|
0
|
458
|
April 8, 2024
|
IndexError: index out of range in self while training a language model from scratch
|
|
0
|
295
|
April 9, 2024
|