Models

Topic	Replies	Views	Activity
T5 Finetuning Tips	48	56053	November 3, 2024
Global Transformer in Llama 3.2 Vision	1	126	October 31, 2024
Low GPU utilization with the Decision Transformer	6	405	October 30, 2024
I am trying to build one text-to-sql with huggingface chatdb/natural-sql-7b model, it seems it is getting stuck every time and not generating any result. here is my code. Another problem is its notworking with "cuda". It's showing "torch is not compiled w	3	30	October 30, 2024
AI reasoning capabilities	1	171	October 29, 2024
Replace Causal Mask of T5 to custom mask	3	398	October 29, 2024
Unable to Read Username for 'https://huggingface.co'	4	2618	October 29, 2024
Build error while cloning the repository	1	69	October 29, 2024
Wget timed out in CI/CD pipeline	3	134	October 28, 2024
What are the best 3 text generation models that can be used via API for free with 128k context?	0	113	October 28, 2024
Unsupervised fine tuning mistral 7b	6	2260	October 27, 2024
Help with preparing train data for fine-tuning llama 3.1 instruct model?	0	86	October 27, 2024
ValueError: Unsupported model type mllama	3	371	October 23, 2024
Training memory footprint depends on instantiating method	1	43	October 23, 2024
How to use custom dataset in a model	0	25	October 23, 2024
Assistance Required for fudan-generative-ai/hallo2 Implementation and Model Weight Issues	1	24	October 23, 2024
Torchview/hiddenlayer produces blank nodes in visualisation	2	40	October 23, 2024
Can you help me interepret the results of my hyperparameter sweep for fine-tuning BLIP2-2.7?	0	38	October 22, 2024
AttributeError: 'TimmBackbone' object has no attribute 'model_type'	0	28	October 22, 2024
How to match locserver performance with Hugging face V3	0	30	October 22, 2024
Generate Code documentation from SQR Code	0	11	October 21, 2024
Arabic models timeout Error	1	46	October 19, 2024
Mistral-7B-Instruct-v0.3 vs Mistral-NEMO-12B	2	799	October 18, 2024
How to split PDF document by table of contents	4	647	October 17, 2024
Finetuning mT5 for specific language pair	0	123	October 17, 2024
HuggingFace offline error	1	1717	October 17, 2024
How to find models that work on low memory/CPU edge devices	3	711	October 17, 2024
Llama Introduction	1	107	October 16, 2024
Finetune whisper-tiny in german for tflite runtime	2	143	October 16, 2024
Setting `pad_token_id` to `eos_token_id`:128001 for open-end generation	5	2355	October 16, 2024