Why can't I reproduce benchmark scores from papers like Phi, Llama, or Qwen? Am I doing something wrong or is this normal?
|
|
2
|
58
|
June 10, 2025
|
Segmant Anything Model (SAM) ValueError: Invalid image type
|
|
2
|
12
|
June 10, 2025
|
Efficient batch inference using stacked past_key_values for multiple continuation candidates
|
|
1
|
23
|
June 10, 2025
|
Error while initializing ZeroGPU
|
|
5
|
71
|
June 10, 2025
|
Any thoughts on Novita AI?
|
|
1
|
98
|
June 10, 2025
|
Building a Multi Lingual Multi Task Model in Finance Domain
|
|
2
|
48
|
June 10, 2025
|
LLM architecture Dots1ForCausalLM conversion to GGUF
|
|
1
|
63
|
June 7, 2025
|
Opus-MT: Translation returns <unk> token
|
|
3
|
15
|
June 6, 2025
|
Convert the models downloaded in .cache/huggingface/hub/ to original format
|
|
1
|
34
|
June 6, 2025
|
Unable to access Llama 3.3 despite attaining it through Llama.com
|
|
1
|
37
|
June 4, 2025
|
Running and testing / BharatGPT-3B-Indic
|
|
5
|
38
|
May 28, 2025
|
Restricting model download
|
|
0
|
29
|
June 2, 2025
|
What are the most effective recent approaches for predicting social media post virality?
|
|
2
|
29
|
May 30, 2025
|
404 error for models
|
|
6
|
1413
|
May 29, 2025
|
Need help to find old Embeddings I lost during PC installation
|
|
4
|
24
|
May 27, 2025
|
Optimal Approach for Fine-Tuning LayoutLMv3 for Token Classification with 80 Labels
|
|
3
|
31
|
May 26, 2025
|
Why in n8n FLUX.1-dev creates only square images?
|
|
0
|
17
|
May 26, 2025
|
Llama4 routing scores
|
|
1
|
105
|
May 23, 2025
|
Looking for: Controlnets reference_only model
|
|
3
|
69
|
May 21, 2025
|
Issue with Using AventIQ for News Sentiment Analysis
|
|
3
|
20
|
May 21, 2025
|
Best model to extract text from old Church records written in cursive?
|
|
2
|
42
|
May 18, 2025
|
RoFormer (Eng language)
|
|
4
|
37
|
May 17, 2025
|
403 Client Error: Forbidden for url:
|
|
17
|
13083
|
May 16, 2025
|
Models for eye gaze data
|
|
4
|
1470
|
May 16, 2025
|
Open-source RL Model for Predicting Sales Conversion from Conversations + Free Agent Platform (Dataset, Model, Paper, Demo)
|
|
0
|
63
|
May 13, 2025
|
Catalog of valid arguments in model cards?
|
|
5
|
28
|
May 12, 2025
|
Will somebody train a model for me?
|
|
0
|
30
|
May 12, 2025
|
But is there even a single model working here?!
|
|
4
|
383
|
May 10, 2025
|
Image Captioning with ViT and GPT 2 Base
|
|
2
|
61
|
May 10, 2025
|
Best model for image object comparison?
|
|
3
|
136
|
May 9, 2025
|