Pruned Llama on lm-evaluation-harness
|
|
0
|
156
|
July 29, 2024
|
Ollama inside Space
|
|
0
|
164
|
June 13, 2024
|
Need help in fine-tuning T5-Base Model for a sequence task
|
|
0
|
164
|
May 8, 2024
|
New Battle in AI field
|
|
0
|
28
|
January 26, 2025
|
Model *and* data parallelism when training on multiple GPUs?
|
|
0
|
27
|
January 22, 2025
|
Whisper Jax setup on guide hugging face CPU
|
|
0
|
29
|
December 3, 2024
|
Inconsistent Training Time with Accelerate
|
|
0
|
29
|
November 8, 2024
|
AttributeError: 'TimmBackbone' object has no attribute 'model_type'
|
|
0
|
29
|
October 22, 2024
|
Setting up Mistral on Inferentia2 with higher number of tokens
|
|
0
|
33
|
September 25, 2024
|
Ask for help: Output inconsistency when using LLM batch inference compared to single input
|
|
4
|
44
|
March 20, 2025
|
Package compatibility issues
|
|
2
|
21
|
March 14, 2025
|
Code not working in when I import from Github
|
|
1
|
29
|
March 9, 2025
|
Repository for XAI explanations
|
|
2
|
21
|
February 11, 2025
|
Docker Space Receives Seemingly Random TERM
|
|
5
|
53
|
January 1, 2025
|
Tokenizer performance is slow, after call to dataset map
|
|
0
|
157
|
June 15, 2024
|
How to get the loss from the Trainer class?
|
|
0
|
157
|
May 25, 2024
|
Llava endpoint on Sagemaker
|
|
0
|
159
|
May 10, 2024
|
Using PDFs in the chat hardly works
|
|
2
|
51
|
March 6, 2025
|
Pip install optimum[exporters-tf]
|
|
3
|
54
|
January 18, 2025
|
Cross-encoder inference API DOWN?
|
|
1
|
61
|
October 25, 2024
|
Intermittent drop outs / slow downloads via datasets server
|
|
7
|
43
|
February 24, 2025
|
Space Service Down for 4+ Hours, No Signs of Recovery
|
|
2
|
90
|
September 23, 2024
|
Fine tune Meta-Llama-3.1-8B OOM error after the 1st training step
|
|
0
|
154
|
September 6, 2024
|
Help with dedicated endpoints
|
|
0
|
164
|
May 13, 2024
|
Finetuning a small LLM on 32GB, 4vCPU
|
|
0
|
159
|
July 12, 2024
|
How to use set_transform when map becomes unfeasible?
|
|
2
|
130
|
June 19, 2024
|
How to Add New Data to an Existing Parquet Dataset?
|
|
1
|
62
|
February 7, 2025
|
Ggml-org/gguf-my-repo the script fails with Flux FP8
|
|
3
|
82
|
September 13, 2024
|
API Usage for Gradio apps
|
|
0
|
151
|
August 18, 2024
|
Deploying chainlit app on huggingface
|
|
0
|
151
|
August 3, 2024
|
Terminal print statements in a Gradio interface?
|
|
0
|
151
|
June 18, 2024
|
Questions about Mistral and apply_chat_template with Text Generation Inference, openai API and messages API
|
|
0
|
162
|
May 15, 2024
|
Error when using Video component with Subtitle SRT
|
|
0
|
159
|
May 15, 2024
|
HF Inference Endpoints Error 429
|
|
2
|
55
|
March 27, 2025
|
Docker: ollama: Server unavailable, error code: 349453
|
|
0
|
152
|
July 29, 2024
|
Custom Dockerfile in spaces
|
|
0
|
149
|
July 5, 2024
|
How to use GPT4 with trl PPO script
|
|
0
|
159
|
May 28, 2024
|
Requests Fail with 404 on HuggingFace Inference Due to X-Forwarded-Host Header
|
|
0
|
30
|
April 8, 2025
|
New ArXiv Daily Newsletter Tool - SciSummarize
|
|
0
|
36
|
March 10, 2025
|
Rival - Open-source tool for visually comparing AI model responses
|
|
0
|
26
|
March 3, 2025
|
/tmp files directory
|
|
0
|
28
|
February 21, 2025
|
Deterministic prompt + probabilistic any prompt reasoning
|
|
0
|
26
|
February 13, 2025
|
Running DPOTrainer with custom gpu management
|
|
0
|
26
|
February 7, 2025
|
Please tell me that HF doesn't actually humour reports from PRC nationalists to ban ablating the censorship from Chinese models
|
|
0
|
27
|
February 5, 2025
|
DONUT: Reading order for pseudo-OCR pre-training task
|
|
0
|
30
|
January 16, 2025
|
Thenlper/gte-large model not initializing on hugging face endpoints
|
|
0
|
32
|
January 8, 2025
|
Why does PALIGemma use 256 tokens for a 224x224 image
|
|
0
|
28
|
December 8, 2024
|
BERT token classification / regression question
|
|
0
|
32
|
November 5, 2024
|
Invert automatic mask for forge
|
|
0
|
28
|
September 6, 2024
|
If I want to find an app/model that does "inpainting" how do I search?
|
|
2
|
18
|
April 26, 2025
|