Beginners

Topic	Replies	Views	Activity
Llama 3 70b in the Chat UI Is Super Slow and Nearly Unusable	2	731	October 4, 2024
Huggingface Jupyter Notebook Login	1	15782	October 8, 2024
Cannot cancel my gated repo request	8	418	April 26, 2025
NER fine-tuning	1	4986	December 20, 2021
Model Parallelism and Pipelining for Model Training	3	3517	April 11, 2024
LM finetuning on domain specific unlabelled data	6	4705	April 21, 2021
Load CLIP pretrained model on GPU	6	8364	March 6, 2024
How do I download llama-2	1	15557	September 8, 2023
How to make GPT2 Tokenizer actually add special tokens	4	3102	February 28, 2025
Force Tokens as opposed to Bad_Word_Ids	1	1551	March 23, 2023
Increasing Perplexity when fine-tuning GPT-2	0	686	November 20, 2020
Summarization pipeline on long text	6	4595	December 14, 2022
Generate sentences from keywords only	4	3045	November 26, 2021
Get Optuna study from hyperparameter-search in Trainer?	1	849	July 14, 2021
How to train a Model for Erotic Story Writing with Explicit Details?	5	4903	June 19, 2025
ReadTimeoutError when loading model	5	4883	October 18, 2024
Using a fine tuned whisper sherpa onnx model to create a android app with flutter	7	1338	February 26, 2025
How to enable set device as GPU with Tensorflow?	3	5945	April 26, 2024
Transform Logits to probabilities doesn't work	4	9430	February 17, 2022
Can't push to a public repo I am an admin of	2	683	June 6, 2025
ValueError: The model did not return a loss from the inputs	1	4644	December 21, 2023
TypeError: forward() got an unexpected keyword argument 'token_type_ids'	3	3283	June 10, 2022
Which model can use to pre-train a BERT model?	1	463	December 22, 2021
Advice on LLMs that can be used directly in Python	1	256	November 15, 2024
Multiple Categories (labels)	4	5117	February 1, 2023
Training Flux Lora Failed	5	1471	January 16, 2025
Model does not exist	5	824	April 14, 2025
Resuming training: There were missing keys in the checkpoint model loaded: ['lm_head.weight']	2	2066	December 1, 2024
Sequence Classification -- Fine Tune?	3	3164	January 31, 2021
Do you train all layers when fine-tuning T5?	7	7022	September 26, 2023
Evaluation without using a Trainer	2	3612	April 16, 2021
Validation VS Test with Transformers Trainer	2	6417	June 6, 2022
Seeking advice on selecting the best OCR model for business card recognition	4	881	March 6, 2025
[Tutorial] Phi-3.5 Fine-tuning	0	3493	August 22, 2024
How Do AI Girlfriend Platforms Balance Text and Voice Training for Deep Emotional Connections?	6	133	September 24, 2025
Custom metrics with extra data?	8	3660	April 12, 2024
How to build a multi-label & multi-class dataset correctly?	4	873	April 18, 2025
Using Huggingface Trainer for custom models	5	4472	May 29, 2023
Getting "Invalid credentials in Authorization header" or "NetworkError when attempting to fetch resource" error when trying to use any text to image model	0	615	February 10, 2025
Using LLM for Data Analytics	1	1374	June 7, 2025
Get all unique labels values in a sorted manner	2	1981	December 4, 2024
I want to merge my PEFT adapter model with the base model and make a fully new model	4	4829	February 5, 2025
Question Regarding trainer arguments:: load_best_model_at_end	2	1964	April 19, 2021
Expected all tensors to be on the same device	3	9505	April 30, 2022
16 GB vs 20 GB graphics card	5	4362	October 21, 2024
Bad Performance Finetuning Llama Chat and Instruct Models on GSM8K	5	1371	December 5, 2024
400 Client Error: Bad Request for url	2	1087	August 25, 2025
Pros and cons of Pre-trained Models	0	592	December 6, 2022
Is 512 token in bert, token or character level?	3	9358	April 4, 2022
Understanding data of dataset_infos.json	2	1917	June 29, 2021