Models

Topic	Replies	Views	Activity
EleutherAI / lm-evaluation-harness on a custom model	0	2075	April 10, 2024
CNN related problems	1	124	April 9, 2024
Looking for a Detectron2 model zoo that has been trained on aerial(satellite) images	0	264	April 8, 2024
How to fine-tune a mistral LLM for a multi-turn conversation, are there any examples?	0	766	April 8, 2024
Conversion of neural networks to closed-form expressions as human-readable mathematical functions	0	104	April 8, 2024
Speech2TextModel does not support small d_model	0	89	April 5, 2024
Processong speed for text embedding models	0	174	April 5, 2024
BERT2BERT for summarization results	1	515	April 5, 2024
Denoising Diffusion Probabilistic Models (DDPM) - reconstruction is not sharp but blurry and noisy	1	804	April 4, 2024
Model for object classification of every possible object	0	138	April 4, 2024
Looking for Pre-trained Model for Image Categorization (Screenshots, Photos, Scans, etc.)	2	1301	April 3, 2024
Extract information from nested tables and charts (Bar/Pie charts)	0	252	April 3, 2024
Using an encoder-decoder model for Recognizing Textual Entailment (GLUE task)	0	171	March 31, 2024
Help Needed: Fine-Tuned Model for Georgian Language Not Generating Text	0	144	March 31, 2024
Llama2 pad token for batched inference	7	15697	March 31, 2024
Llama/Mistral Finetuning for Inference API	0	169	March 30, 2024
8 bit precision error	0	423	March 30, 2024
DistillGpt2 only predicts endoftext if context is full	0	93	March 30, 2024
Using mlx lora.py with llama-2-13b and mixtral-8x7b	0	483	March 30, 2024
Don't average the loss	1	622	March 30, 2024
Hermes 2's secret origin story? 🧐	0	140	March 29, 2024
Fine tuning existing hugging face model on new dataset (text to sql task)	0	1640	December 6, 2023
Incomplete/ partial response generation	3	1421	March 27, 2024
How to add noise to the intermediate layer of huggingface bert model?	0	136	March 27, 2024
Which Transformers model is suitable for Video to text and summarize the text?	0	226	March 25, 2024
Find models by size	3	2261	March 24, 2024
How would you train a model for hard/soft skill detection based on a taxonomy?	3	450	March 24, 2024
MCQA Model Underfitting the Training Data	0	124	March 22, 2024
NameError: name 'feature_extractor' is not defined	4	961	March 23, 2024
Find LLM to run on single gpu with only 8 GB ram	10	8170	March 22, 2024