How to obtain latent vectors from model with transformers
|
|
1
|
113
|
April 9, 2024
|
Unpacking transformer's trainer.eval() to see every example's output, loss
|
|
4
|
104
|
April 9, 2024
|
Weighed Loss Function in Regression Task
|
|
1
|
471
|
April 6, 2024
|
Chat bot for Question and answer in csv ,all open source models
|
|
0
|
111
|
April 4, 2024
|
Weight and shape different than the number of channels in input
|
|
0
|
98
|
April 4, 2024
|
Cuda out of memory error
|
|
8
|
26598
|
April 4, 2024
|
Inference after QLoRA fine-tuning
|
|
6
|
3034
|
April 4, 2024
|
LayoutLM data format for bounding box classification
|
|
0
|
97
|
April 3, 2024
|
Efficient Conditional Selection of Data Set Rows
|
|
0
|
73
|
April 3, 2024
|
SAM image size for fine-tuning
|
|
5
|
3410
|
April 3, 2024
|
DPO with Chat Data
|
|
0
|
98
|
April 1, 2024
|
GPT2Tokenizer not putting bos/eos token
|
|
3
|
3948
|
March 31, 2024
|
How to get activation maps of models
|
|
0
|
91
|
March 31, 2024
|
Fine-tuning Mistral/Mixtral for sequence classification on long context
|
|
0
|
778
|
March 30, 2024
|
Finetuning T5 for Summarisation - Poor results
|
|
0
|
88
|
March 30, 2024
|
Docker image: transformers-all-latest-gpu not running
|
|
0
|
182
|
March 30, 2024
|
Replacing the LlamaDecoderLayer Class hugging Face With New LongNet
|
|
0
|
131
|
March 30, 2024
|
CUDA Runtime Error in the Middle of Training
|
|
1
|
114
|
March 30, 2024
|
Training Question/Answer on My Own Codebase
|
|
0
|
69
|
March 29, 2024
|
Dataset download faster
|
|
1
|
76
|
March 29, 2024
|
Error making predictions using LMM (LLaVA) model on multiple GPUs
|
|
0
|
136
|
March 27, 2024
|
Using same instructions for fine-tuning: Is this bad for the model?
|
|
1
|
224
|
March 26, 2024
|
I'm in search of a programmer
|
|
0
|
106
|
March 20, 2024
|
Using a finetuned model for embeddings
|
|
0
|
84
|
March 20, 2024
|
Finding Serverless Inference APIs that support attention outputs (output_attentions = true)
|
|
0
|
91
|
March 19, 2024
|
Question regarding multiple prompt-tuning
|
|
0
|
106
|
March 19, 2024
|
Finetuning LLama2-70B using 4-bit quantization on multi-GPU using Deepspeed ZeRO
|
|
1
|
1579
|
March 19, 2024
|
BERT Fine-tuning for Sequence Classification
|
|
0
|
82
|
March 19, 2024
|
Nested named entity recognition
|
|
2
|
187
|
March 19, 2024
|
Inference API for fine-tuned model not working: No package metadata was found for bitsandbytes
|
|
9
|
883
|
March 18, 2024
|