Handling Extreme Class Imbalance for Multi-Class Classification
|
|
1
|
39
|
May 14, 2025
|
Quantization not yet implemented
|
|
0
|
98
|
June 1, 2024
|
Why is aggregation_strategy="simple" not combining subwords properly in Hugging Face token classification (DeBERTa fine-tuned model)
|
|
0
|
17
|
April 22, 2025
|
Forward method inconsistent for time series transformer
|
|
0
|
95
|
May 26, 2024
|
Accessing AltCLIP text encoder
|
|
0
|
95
|
March 14, 2024
|
Accessibility of Huggingface's OpenLLMLeaderboard Benchmark Test Sets
|
|
3
|
47
|
July 23, 2024
|
Runtime error Exit code: 0. Reason: application does not seem to be initialized Container logs: ===== Application Startup at 2025-03-21 09:57:17 =====
|
|
0
|
95
|
March 21, 2025
|
"Can someone help me? I'm looking for free software that can put a character's face into an existing video and does it relatively quickly. Does anyone know of one?"
|
|
0
|
94
|
November 13, 2024
|
SmolAgents Helium example
|
|
1
|
37
|
March 16, 2025
|
How to forbade Gemma 2 from using a certain phrase and use another one in its place?
|
|
7
|
18
|
May 21, 2025
|
Why isn't quantization config reducing memory usage?
|
|
0
|
90
|
August 16, 2024
|
Text-to-Sql model keeps missing "<" token
|
|
2
|
29
|
June 11, 2025
|
Data format for fine-tune base model
|
|
2
|
29
|
March 10, 2025
|
SFTTrainer for Llama-2
|
|
0
|
89
|
August 3, 2024
|
TGI & guidance making a strange behavior
|
|
3
|
25
|
March 24, 2025
|
VisEncoderDecoderModel generate text incomplete when predict image with long text label
|
|
0
|
88
|
May 21, 2024
|
Translation with marianmt. Early stopping stucked
|
|
4
|
22
|
June 17, 2025
|
YOLOv8 Hand Detection Fails at Close Range After TensorFlow.js Conversion
|
|
0
|
15
|
February 28, 2025
|
Creating a custom Multi Task model using a custom config
|
|
0
|
15
|
November 7, 2024
|
How can I dynamically update the system configuration for different users using my demo?
|
|
6
|
31
|
August 28, 2024
|
Inference endpoint
|
|
1
|
32
|
August 11, 2024
|
How to pass large context to pipeline once instead of again and again for each query?
|
|
0
|
14
|
February 6, 2025
|
Developing a cartoon story
|
|
3
|
39
|
September 16, 2024
|
Qlora Training with Custom Trainer
|
|
0
|
75
|
September 19, 2024
|
AI House material change
|
|
0
|
13
|
June 12, 2025
|
Stateful PEFT adapter
|
|
0
|
13
|
June 5, 2025
|
Best tool/method for AI model traceability management?
|
|
0
|
13
|
October 14, 2024
|
How to Deploy a trained transformer-based model - Emmanuel Katto Uganda
|
|
1
|
51
|
July 17, 2024
|
Together Inference Credit finished
|
|
1
|
28
|
March 13, 2025
|
Can anybody recommend a good image filename generating AI?
|
|
1
|
27
|
April 24, 2025
|
Instructions to raise PR for addition of shared library files(.so) and .cpp files
|
|
0
|
12
|
January 3, 2025
|
Implementing a Custom Contrastive Trainer for Code Embeddings with Multiple Correct and Incorrect Solutions
|
|
1
|
26
|
March 25, 2025
|
Why is the memory quickly filled up in the first few iterations when using Trainer of transformers to train the network, and then drops to a very low level as the training progresses?
|
|
0
|
11
|
May 25, 2025
|
Abstracted Application Access via Dynamic URL Distribution
|
|
0
|
11
|
October 4, 2024
|
TRL - Fine tuned small model (facebook350m) yields many empty inferences
|
|
1
|
24
|
June 19, 2025
|
Regarding the Image Generation
|
|
1
|
24
|
June 6, 2025
|
Crisp AI to AI language the road to AGI
|
|
1
|
24
|
May 29, 2025
|
Trouble fine-tuning Flan-T5 (with LoRA) for structured map generation – model repeats prompt or instructions
|
|
1
|
24
|
May 26, 2025
|
Use Trainer with2 optimizers?
|
|
0
|
60
|
July 17, 2024
|
Help in developing app
|
|
0
|
56
|
July 8, 2024
|
Finetuning a pre-trained model
|
|
0
|
55
|
August 21, 2024
|
Cannot get tools to work: InferenceClient + hf-inference + Qwen/Qwen3-235B-A22B -- Internal Server Error
|
|
3
|
15
|
June 17, 2025
|
Pretrain swin Former on xview2 dataset (satellite dataset different from imagenet)
|
|
2
|
17
|
January 8, 2025
|
The Alpha Version of TrixGenius Has Arrived — Supercharge Your Trix Editing Experience with AI!
|
|
0
|
9
|
April 10, 2025
|
Seeking Advice on Processing Support Conversations for Efficient RAG Model Search
|
|
0
|
50
|
September 9, 2024
|
DoRA for depthwise-convolutional layers
|
|
0
|
44
|
July 18, 2024
|
Test Time Fine Tuning
|
|
0
|
39
|
December 5, 2024
|
Trying to reproduce hugging face results
|
|
0
|
39
|
July 12, 2024
|
ValueError in Seq2SeqTrainer uses the Whisper model
|
|
0
|
38
|
July 13, 2024
|
Classifying text based on intent using bert
|
|
0
|
37
|
July 29, 2024
|