Different intermediate results given different number of epochs
|
|
0
|
132
|
December 20, 2023
|
Whisper encoder
|
|
0
|
150
|
December 20, 2023
|
Embed size 2 in time series transformer
|
|
7
|
691
|
December 19, 2023
|
Performing Whisper's "transcribe" with Transformer pipelines
|
|
2
|
2732
|
December 19, 2023
|
Getting LLaMA tokenizer from meta
|
|
0
|
119
|
December 19, 2023
|
Whisper Model: Validation loss decreasing but WER increasing/constant
|
|
0
|
272
|
December 18, 2023
|
🤗Transformer with Trainer API on TPU VMs and TPU Pods
|
|
0
|
414
|
December 18, 2023
|
Deploy model in hugging face platform
|
|
0
|
262
|
December 18, 2023
|
Flash Attention 2 Error on Mistral Based Model
|
|
0
|
627
|
December 18, 2023
|
Recommended hardware for running LLMs locally
|
|
2
|
34673
|
December 18, 2023
|
Models slow on M1 Pro 16gb
|
|
0
|
737
|
December 18, 2023
|
Datasets: Limit the number of rows?
|
|
4
|
8629
|
December 17, 2023
|
IndexError while training Roberta with a custom tokenizer
|
|
8
|
1165
|
December 17, 2023
|
RuntimeError when training: Expected floating point type for target with class probabilities, got Long
|
|
0
|
713
|
December 17, 2023
|
Handwriting Identification Model
|
|
0
|
179
|
December 16, 2023
|
Adding model and prompt info to generated image
|
|
0
|
257
|
December 16, 2023
|
Training loss changes as we change learning rate
|
|
0
|
300
|
December 16, 2023
|
Training GPT-2 with OSCAR Dataset in Dutch: Seeking Advice
|
|
0
|
160
|
December 16, 2023
|
ERROR: Failed building wheel for tokenizers
|
|
0
|
8549
|
December 16, 2023
|
[Solved]Empty Card When using c4 Dataset during Quantization wiht GPTQ
|
|
0
|
570
|
December 15, 2023
|
How to utilize AWS and VLLM, to make an Api available to a running llm (any opensource model)on an AWS sage maker gpu
|
|
0
|
823
|
December 15, 2023
|
Load_datasets is extremely slow in loading HF datasets
|
|
1
|
2549
|
December 15, 2023
|
Unable to load a FineTuned LLama Model to GPU for inference
|
|
3
|
2990
|
December 15, 2023
|
Using a new model in an older version of Transformers library
|
|
0
|
233
|
December 15, 2023
|
How to give context when translating single words?
|
|
1
|
345
|
December 15, 2023
|
Every space based on meditron-70b gives this same error!
|
|
0
|
232
|
December 15, 2023
|
(Memory) error when trying to use AutoModel.from_pretrained
|
|
0
|
365
|
December 14, 2023
|
Adding a new context to the prompt when generating text
|
|
0
|
1286
|
December 14, 2023
|
How do I change image size and patch size in Tensorflow
|
|
0
|
250
|
December 14, 2023
|
Please guide the on-promise spec for LLM fine-tuning
|
|
0
|
143
|
December 14, 2023
|