Models

Topic	Replies	Views	Activity
Model File Lookup by SHA256 Hash	9	173	June 13, 2025
[Not working] QA inference API and conv-ai	9	865	February 16, 2021
Please Help! How to properly label RTL ground truth data for fine-tuning/training ViT models	10	589	September 13, 2023
How to set the Pad Token for meta-llama/Llama-3 Models	6	11369	August 29, 2024
Flux Diffusers Pipeline's unusual runtime in Google colab	9	478	January 29, 2025
Unable to Access Gated Model meta-llama/Llama-3.2-1B Despite Approved Access	10	222	April 8, 2025
Any model that takes in a clean PDF and outputs a JSON of all the fillable fields that should be added to it + coordinates?	11	109	March 25, 2025
Use Fine Tuned Modal Via Hugging Face	15	142	January 11, 2025
Flan-T5 / T5: what is the difference between AutoModelForSeq2SeqLM and T5ForConditionalGeneration	5	7284	February 2, 2023
How to get model size?	6	46153	July 15, 2023
Huggingface transformers longformer optimizer warning AdamW	2	9589	April 25, 2022
[Announcement] All model cards will be migrated to hf.co model repos	5	8219	December 10, 2020
How to combine Image and Text embedding for product similarity	2	16619	May 6, 2025
TypeError: SentenceTransformerTrainer.compute_loss() got an unexpected keyword argument 'num_items_in_batch'	6	6041	February 10, 2025
Number of epochs in pre-training BERT	1	11571	December 13, 2020
Bert question answering model without context	5	11107	October 1, 2023
(feat Tokenizers): How to make models aware of structuring linebreaks?	4	6411	August 24, 2023
Saving a model and loading it	3	56902	July 5, 2024
Best practices to use models requiring flash_attn on Apple silicon macs (or non CUDA)?	2	6413	August 23, 2024
Difference between CausalLM and LMHeadModel	1	4017	April 25, 2022
What to do when HuggingFace throws "Can't load tokenizer"	8	51188	May 5, 2024
How to set Llama-2-Chat prompt context	2	15460	October 18, 2023
Rate limit reached. You reached free usage limit (reset hourly)	5	9770	May 14, 2024
Does anyone know how to setup meta's llama?	0	2158	March 4, 2023
[pegasus] evaluation datasets and build scripts are now available	0	2031	October 21, 2020
How to prevent LLM from generating multiple rounds of conversation?	3	8921	February 29, 2024
Fine tuning a TTS model	0	1780	March 7, 2023
Loading Llama 2 with quantization on M1 MacBooks	2	5366	December 15, 2023
Llama2 pad token for batched inference	7	15525	March 31, 2024
Text-generation-inference: "You are using a model of type llama to instantiate a model of type ."	5	7380	November 3, 2023
Embedding from BLIP2	0	978	June 20, 2023
Data did not match any variant of untagged enum PyPreTokenizerTypeWrapper at line 6952 column 3	1	1168	July 4, 2024
But is there even a single model working here?!	4	367	May 10, 2025
Fine-tuning BERT with sequences longer than 512 tokens	7	27358	April 4, 2022
How to Implement Few-Shot Prompting in LLaMA-2 Chat Model	4	6089	April 9, 2025
Mistral or LLaMA?	3	3667	May 1, 2024
Best model for translating English to Japanese	7	2514	April 29, 2025
How to save and load the custom Hugging face model including config.json file using pytorch	2	7226	February 16, 2023
Tips for training LongT5	0	669	June 29, 2022
Unauthorized 401	4	885	February 26, 2025
Unable to Read Username for 'https://huggingface.co'	4	2769	October 29, 2024
Failed to Import transformers.models	5	23753	March 20, 2024
Text generation pipeline - output_scores parameter	1	3930	January 20, 2021
Facebook/opt-30b model inferencing	3	2664	January 19, 2023
Fine-tuning Whisper for Audio Classification	6	3204	November 8, 2024
Making llama text generation, deterministic	1	9683	August 16, 2023
How do you use Beam Search in Whisper correctly?	3	1149	December 15, 2024
How is the data shifted by one token during CausalLM fine tuning	4	3131	April 14, 2025
Finetuning options with SAM?	4	5195	May 11, 2023
Issue with LlaMA-2 Chat Template (and out of date documentation)	7	11859	November 10, 2023