EleutherAI / lm-evaluation-harness on a custom model
|
|
0
|
2075
|
April 10, 2024
|
CNN related problems
|
|
1
|
124
|
April 9, 2024
|
Looking for a Detectron2 model zoo that has been trained on aerial(satellite) images
|
|
0
|
264
|
April 8, 2024
|
How to fine-tune a mistral LLM for a multi-turn conversation, are there any examples?
|
|
0
|
766
|
April 8, 2024
|
Conversion of neural networks to closed-form expressions as human-readable mathematical functions
|
|
0
|
104
|
April 8, 2024
|
Speech2TextModel does not support small d_model
|
|
0
|
89
|
April 5, 2024
|
Processong speed for text embedding models
|
|
0
|
174
|
April 5, 2024
|
BERT2BERT for summarization results
|
|
1
|
515
|
April 5, 2024
|
Denoising Diffusion Probabilistic Models (DDPM) - reconstruction is not sharp but blurry and noisy
|
|
1
|
804
|
April 4, 2024
|
Model for object classification of every possible object
|
|
0
|
138
|
April 4, 2024
|
Looking for Pre-trained Model for Image Categorization (Screenshots, Photos, Scans, etc.)
|
|
2
|
1301
|
April 3, 2024
|
Extract information from nested tables and charts (Bar/Pie charts)
|
|
0
|
252
|
April 3, 2024
|
Using an encoder-decoder model for Recognizing Textual Entailment (GLUE task)
|
|
0
|
171
|
March 31, 2024
|
Help Needed: Fine-Tuned Model for Georgian Language Not Generating Text
|
|
0
|
144
|
March 31, 2024
|
Llama2 pad token for batched inference
|
|
7
|
15697
|
March 31, 2024
|
Llama/Mistral Finetuning for Inference API
|
|
0
|
169
|
March 30, 2024
|
8 bit precision error
|
|
0
|
423
|
March 30, 2024
|
DistillGpt2 only predicts endoftext if context is full
|
|
0
|
93
|
March 30, 2024
|
Using mlx lora.py with llama-2-13b and mixtral-8x7b
|
|
0
|
483
|
March 30, 2024
|
Don't average the loss
|
|
1
|
622
|
March 30, 2024
|
Hermes 2's secret origin story? 🧐
|
|
0
|
140
|
March 29, 2024
|
Fine tuning existing hugging face model on new dataset (text to sql task)
|
|
0
|
1640
|
December 6, 2023
|
Incomplete/ partial response generation
|
|
3
|
1421
|
March 27, 2024
|
How to add noise to the intermediate layer of huggingface bert model?
|
|
0
|
136
|
March 27, 2024
|
Which Transformers model is suitable for Video to text and summarize the text?
|
|
0
|
226
|
March 25, 2024
|
Find models by size
|
|
3
|
2261
|
March 24, 2024
|
How would you train a model for hard/soft skill detection based on a taxonomy?
|
|
3
|
450
|
March 24, 2024
|
MCQA Model Underfitting the Training Data
|
|
0
|
124
|
March 22, 2024
|
NameError: name 'feature_extractor' is not defined
|
|
4
|
961
|
March 23, 2024
|
Find LLM to run on single gpu with only 8 GB ram
|
|
10
|
8170
|
March 22, 2024
|