Models

Topic	Replies	Views	Activity
Hiera MAE image reconstruction - visible patch artifacts	1	63	January 7, 2025
Best model to fine-tune for argument (consideration) extraction task?	2	101	November 25, 2024
Flux Diffusers Pipeline's unusual runtime in Google colab	9	484	January 29, 2025
How to combine pre-trained weights of components from different multimodal LLMs?	2	104	January 8, 2025
Help my model return the expected data	2	136	November 18, 2024
Model Inference API error	7	975	May 7, 2024
Impact of Annotating Occluded Keypoints on Pose Estimation Accuracy	1	34	February 24, 2025
MaziyarPanahi's Mistral 0.2 Merges	4	412	February 17, 2024
Not able to finetunned(q-lora) LLama3-Instruct model for CausalLM	2	140	November 14, 2024
Vocab_size value for facebook/w2v-bert-2.0	0	253	November 13, 2024
Gated Repo Permission Still Pending	4	785	November 8, 2024
Data did not match any variant of untagged enum PyPreTokenizerTypeWrapper at line 6952 column 3	1	1169	July 4, 2024
Finetuned LLM model conversion to GGUF - performance drop	4	1761	July 31, 2024
ModernBERT Pretraining using HuggingFace API	3	232	March 17, 2025
Wget timed out in CI/CD pipeline	3	155	October 28, 2024
Training memory footprint depends on instantiating method	1	43	October 23, 2024
Llama 2 access token problem	3	40	February 16, 2025
Torchview/hiddenlayer produces blank nodes in visualisation	2	49	October 23, 2024
Special tokens & Embeddings requries grad?	2	42	February 12, 2025
Why is MMS-TTS-ITA model not available?	2	32	February 10, 2025
Setting `pad_token_id` to `eos_token_id`:128001 for open-end generation	5	3374	October 16, 2024
Multi modal models ( REALLY DO WE NEED IT? ) Can a Causal LM sufice?	2	182	October 10, 2024
Finetune whisper-tiny in german for tflite runtime	2	192	October 16, 2024
Shard Cannot Start/Inference endpoint error while deployment	5	183	April 6, 2025
Issue while loading file-tuned gemma2	3	178	December 29, 2024
Multihead attention	1	81	October 2, 2024
Encoding masks for Mask2Former and Panopic Segmentation	2	96	October 9, 2024
Why are some weights FP32 in Llama 3.1 405B FBGEMM FP8 Quantization?	7	471	September 27, 2024
LORA Adapated Deepseek R1 not working with inference endpoints	2	53	April 22, 2025
Models for reading Schematic PDF's	2	81	January 28, 2025