Llama 3 70b in the Chat UI Is Super Slow and Nearly Unusable
|
|
2
|
731
|
October 4, 2024
|
Huggingface Jupyter Notebook Login
|
|
1
|
15782
|
October 8, 2024
|
Cannot cancel my gated repo request
|
|
8
|
418
|
April 26, 2025
|
NER fine-tuning
|
|
1
|
4986
|
December 20, 2021
|
Model Parallelism and Pipelining for Model Training
|
|
3
|
3517
|
April 11, 2024
|
LM finetuning on domain specific unlabelled data
|
|
6
|
4705
|
April 21, 2021
|
Load CLIP pretrained model on GPU
|
|
6
|
8364
|
March 6, 2024
|
How do I download llama-2
|
|
1
|
15557
|
September 8, 2023
|
How to make GPT2 Tokenizer actually add special tokens
|
|
4
|
3102
|
February 28, 2025
|
Force Tokens as opposed to Bad_Word_Ids
|
|
1
|
1551
|
March 23, 2023
|
Increasing Perplexity when fine-tuning GPT-2
|
|
0
|
686
|
November 20, 2020
|
Summarization pipeline on long text
|
|
6
|
4595
|
December 14, 2022
|
Generate sentences from keywords only
|
|
4
|
3045
|
November 26, 2021
|
Get Optuna study from hyperparameter-search in Trainer?
|
|
1
|
849
|
July 14, 2021
|
How to train a Model for Erotic Story Writing with Explicit Details?
|
|
5
|
4903
|
June 19, 2025
|
ReadTimeoutError when loading model
|
|
5
|
4883
|
October 18, 2024
|
Using a fine tuned whisper sherpa onnx model to create a android app with flutter
|
|
7
|
1338
|
February 26, 2025
|
How to enable set device as GPU with Tensorflow?
|
|
3
|
5945
|
April 26, 2024
|
Transform Logits to probabilities doesn't work
|
|
4
|
9430
|
February 17, 2022
|
Can't push to a public repo I am an admin of
|
|
2
|
683
|
June 6, 2025
|
ValueError: The model did not return a loss from the inputs
|
|
1
|
4644
|
December 21, 2023
|
TypeError: forward() got an unexpected keyword argument 'token_type_ids'
|
|
3
|
3283
|
June 10, 2022
|
Which model can use to pre-train a BERT model?
|
|
1
|
463
|
December 22, 2021
|
Advice on LLMs that can be used directly in Python
|
|
1
|
256
|
November 15, 2024
|
Multiple Categories (labels)
|
|
4
|
5117
|
February 1, 2023
|
Training Flux Lora Failed
|
|
5
|
1471
|
January 16, 2025
|
Model does not exist
|
|
5
|
824
|
April 14, 2025
|
Resuming training: There were missing keys in the checkpoint model loaded: ['lm_head.weight']
|
|
2
|
2066
|
December 1, 2024
|
Sequence Classification -- Fine Tune?
|
|
3
|
3164
|
January 31, 2021
|
Do you train all layers when fine-tuning T5?
|
|
7
|
7022
|
September 26, 2023
|
Evaluation without using a Trainer
|
|
2
|
3612
|
April 16, 2021
|
Validation VS Test with Transformers Trainer
|
|
2
|
6417
|
June 6, 2022
|
Seeking advice on selecting the best OCR model for business card recognition
|
|
4
|
881
|
March 6, 2025
|
[Tutorial] Phi-3.5 Fine-tuning
|
|
0
|
3493
|
August 22, 2024
|
How Do AI Girlfriend Platforms Balance Text and Voice Training for Deep Emotional Connections?
|
|
6
|
133
|
September 24, 2025
|
Custom metrics with extra data?
|
|
8
|
3660
|
April 12, 2024
|
How to build a multi-label & multi-class dataset correctly?
|
|
4
|
873
|
April 18, 2025
|
Using Huggingface Trainer for custom models
|
|
5
|
4472
|
May 29, 2023
|
Getting "Invalid credentials in Authorization header" or "NetworkError when attempting to fetch resource" error when trying to use any text to image model
|
|
0
|
615
|
February 10, 2025
|
Using LLM for Data Analytics
|
|
1
|
1374
|
June 7, 2025
|
Get all unique labels values in a sorted manner
|
|
2
|
1981
|
December 4, 2024
|
I want to merge my PEFT adapter model with the base model and make a fully new model
|
|
4
|
4829
|
February 5, 2025
|
Question Regarding trainer arguments:: load_best_model_at_end
|
|
2
|
1964
|
April 19, 2021
|
Expected all tensors to be on the same device
|
|
3
|
9505
|
April 30, 2022
|
16 GB vs 20 GB graphics card
|
|
5
|
4362
|
October 21, 2024
|
Bad Performance Finetuning Llama Chat and Instruct Models on GSM8K
|
|
5
|
1371
|
December 5, 2024
|
400 Client Error: Bad Request for url
|
|
2
|
1087
|
August 25, 2025
|
Pros and cons of Pre-trained Models
|
|
0
|
592
|
December 6, 2022
|
Is 512 token in bert, token or character level?
|
|
3
|
9358
|
April 4, 2022
|
Understanding data of dataset_infos.json
|
|
2
|
1917
|
June 29, 2021
|