Q&A evaluation: Mismatch in the number of predictions (775) and references (835)
|
|
4
|
426
|
September 28, 2023
|
Llama-2 7B-hf repeats context of question directly from input prompt, cuts off with newlines
|
|
12
|
4755
|
September 28, 2023
|
Is there way to convert the Donut model to openvino format
|
|
0
|
13
|
September 28, 2023
|
Batched generation_config/kwargs for the `transformers.generation.utils.generate` function
|
|
0
|
18
|
September 28, 2023
|
Convert torch tensor to String Representation Value
|
|
1
|
17
|
September 28, 2023
|
Finetune Llama with PPOTrainer
|
|
1
|
106
|
September 28, 2023
|
How to use llava with huggingface
|
|
2
|
94
|
September 28, 2023
|
Use custom model for mask filling using pipeline
|
|
0
|
24
|
September 27, 2023
|
Trainer and Accelerate
|
|
12
|
439
|
September 27, 2023
|
torch.distributed.elastic.multiprocessing.errors.ChildFailedError
|
|
16
|
13817
|
September 27, 2023
|
Costumizing MASKed tokens
|
|
1
|
57
|
September 27, 2023
|
Target {} is out of bounds
|
|
2
|
5818
|
September 27, 2023
|
Slower train with collator for completion only
|
|
0
|
21
|
September 27, 2023
|
Conversational pipeline by huggingface transformer taking too long to generate output
|
|
0
|
20
|
September 27, 2023
|
Vall-e and Vall-e X implementation
|
|
0
|
18
|
September 27, 2023
|
Llama 2 7B fine-tuned with IA3 errors when performing inference
|
|
1
|
42
|
September 27, 2023
|
Does anyone have an idea how we can run llama2 with multiple GPUs?
|
|
0
|
29
|
September 27, 2023
|
Debugging the compute_loss function for custom dice loss in binary segmentation tasks
|
|
0
|
20
|
September 27, 2023
|
Getting token probabilities of a caption given an image from BLIP2
|
|
1
|
30
|
September 27, 2023
|
Avoid installation of pytorch with transformers for onnx inference
|
|
0
|
23
|
September 26, 2023
|
How does attention key/value caching work with models that have learned absolute position embeddings?
|
|
0
|
27
|
September 26, 2023
|
Error with DataCollator for SpeechT5
|
|
2
|
100
|
September 26, 2023
|
CUDA out of memory when using Trainer with compute_metrics
|
|
19
|
16775
|
September 26, 2023
|
RuntimeError: a leaf Variable that requires grad is being used in an in-place operation
|
|
3
|
85
|
September 25, 2023
|
What is the right way of developing my own model based on a pretrained transformer?
|
|
0
|
39
|
September 25, 2023
|
Rouge-L score in Trainer huggingface
|
|
1
|
73
|
September 25, 2023
|
Time series Prediction: inference process
|
|
0
|
30
|
September 24, 2023
|
Best transformer model to check grammar
|
|
0
|
42
|
September 24, 2023
|
How to evaluate before first training step?
|
|
6
|
1490
|
September 23, 2023
|
How to turn off text streamer to repeat prompt in the output?
|
|
0
|
36
|
September 23, 2023
|