Trainer being very slow to init training setting group_by_length to True
|
|
1
|
279
|
February 1, 2025
|
Calculating Perplexity for Quantized Llama 3 8B & Mistral 7B Models: Evaluate Library vs. Custom Code?
|
|
3
|
102
|
March 16, 2025
|
Unable to log in via web interface
|
|
7
|
644
|
February 3, 2025
|
Using IterableDataset with Trainer - `IterableDataset' has no len()
|
|
7
|
14138
|
December 17, 2024
|
Inference without gradient computation?
|
|
2
|
6840
|
December 26, 2024
|
ValueError: Target size (torch.Size([8])) must be the same as input size (torch.Size([8, 8]))
|
|
15
|
12486
|
January 9, 2025
|
Rs-bpe tokenizer [PyPI | Python] - Outperforms tiktoken & tokenizers
|
|
2
|
35
|
March 19, 2025
|
Llama-2 Sequence Classification: Much lower accuracy on inference from checkpoint compared to model
|
|
5
|
5901
|
February 20, 2024
|
NLLB tokenizer multiple target/source languages within a training batch
|
|
5
|
1394
|
January 10, 2025
|
Huggingface-cli login hangs
|
|
3
|
2796
|
July 8, 2024
|
I am completely lost on the hugging face site
|
|
6
|
974
|
April 8, 2025
|
Launch timed out, space was not healthy after 30 min
|
|
14
|
3367
|
January 12, 2025
|
How to create dataset from CSV to training Question answeringļ¼
|
|
6
|
2420
|
January 20, 2025
|
Runtime error Exit code: 0. Reason: application does not seem to be initialized Container logs: ===== Application Startup at 2025-03-21 09:57:17 =====
|
|
0
|
57
|
March 21, 2025
|
How to output loss from model.generate()?
|
|
16
|
5854
|
January 7, 2025
|
Download and load fine-tuned model locally (VS Code)
|
|
3
|
4421
|
January 24, 2025
|
AttributeError: 'AcceleratorState' object has no attribute 'distributed_type', Llama 2 70B Fine-tuning, using 'accelerate' on a single GPU
|
|
1
|
1008
|
December 25, 2024
|
Slurm Issues running accelerate
|
|
1
|
914
|
November 28, 2024
|
Japanese keyword audio dataset
|
|
3
|
259
|
April 1, 2025
|
Pre-training: ValueError: You should supply an encoding or a list of encodings to this method that includes input_ids, but you provided []
|
|
3
|
3541
|
February 4, 2025
|
Question about PAST KEY VALUES in ONNX format decoder
|
|
0
|
12
|
March 21, 2025
|
504 Gateway Time-out in Inference Endpoints
|
|
3
|
650
|
January 23, 2025
|
LLaVA multi-image input support for inference
|
|
8
|
6985
|
August 30, 2024
|
ASR spell correction
|
|
29
|
8669
|
April 24, 2024
|
TCPA Compliance for Opt-Out
|
|
0
|
9
|
February 19, 2025
|
Try to read arrow files get: Invalid: Not an Arrow file
|
|
3
|
2793
|
May 31, 2024
|
Brain Computer Interface Issue + No Consent
|
|
2
|
7
|
February 21, 2025
|
How to download subset of of a dataset scripted
|
|
6
|
5677
|
December 7, 2023
|
Getting cannot import name 'is_npu_available' from 'accelerate.utils'
|
|
2
|
6373
|
July 6, 2024
|
[SOLVED] accelerate.Accelerator(): CUDA error: invalid device ordinal
|
|
11
|
9944
|
July 6, 2024
|