Injecting Multiple Modalities into a Transformer Decoder via Cross-Attention
|
|
1
|
94
|
March 9, 2025
|
ZeroGPU illegal duration. The requested GPU duration (300s) is larger than the maximum allowed
|
|
5
|
246
|
March 9, 2025
|
Does the model "Qwen/Qwen2.5-Coder-32B-Instruct" free?
|
|
1
|
162
|
March 9, 2025
|
How to Train an Image Captioning Model for specific language
|
|
3
|
18
|
March 9, 2025
|
Delving in to AI: asking for advice
|
|
0
|
24
|
March 8, 2025
|
Support for LLaMA in EncoderDecoder framework
|
|
1
|
523
|
March 8, 2025
|
AANN: Agents As Neural Networks
|
|
0
|
45
|
March 8, 2025
|
Training LLM model for asking questions
|
|
4
|
92
|
March 8, 2025
|
Unable to run Application
|
|
2
|
21
|
March 8, 2025
|
Guidance on getting started with fine tuned uncensored model
|
|
2
|
847
|
March 8, 2025
|
How to train this model
|
|
0
|
75
|
March 8, 2025
|
Freezing layers with SFTTrainer
|
|
2
|
265
|
March 8, 2025
|
Giving personality to LLM
|
|
1
|
535
|
March 8, 2025
|
Train bus routes csv
|
|
3
|
19
|
March 8, 2025
|
I canât get Email confirmation link
|
|
79
|
8787
|
March 7, 2025
|
SFTTrainer Doubling Speed on a Single GPU with DeepSpeed: Proposal for an Update to the Official Documentation and Verification Report
|
|
1
|
58
|
March 7, 2025
|
Issue with ALLaM-7B Model in Inference API - Size Limitation Error
|
|
1
|
54
|
March 7, 2025
|
Use microsoft phi-2
|
|
3
|
23
|
March 7, 2025
|
After fine tuning openai whisper model, there shows OSError WinError 123
|
|
1
|
14
|
March 7, 2025
|
Dataset of course is not available anymore
|
|
1
|
28
|
March 7, 2025
|
About Hyperparameter Search with Ray Tune
|
|
2
|
21
|
March 7, 2025
|
Is this needed: bnb 4bit use double quant = True?
|
|
3
|
2510
|
March 7, 2025
|
Cannot upload large files for a single repository
|
|
3
|
1127
|
March 7, 2025
|
Is there any difference between GPT-J and GPT-2?
|
|
3
|
2751
|
March 7, 2025
|
Trainer.train() runs for long and appears to be stuck. How do I know that it's processing and not in loop
|
|
2
|
593
|
March 7, 2025
|
Benchmarking Vision Models for Specific Use Cases
|
|
0
|
52
|
March 7, 2025
|
Cursor vs. Local IDE with Extensions
|
|
0
|
45
|
March 7, 2025
|
Google Colab vs. Local IDE for Vision Projects
|
|
0
|
20
|
March 7, 2025
|
Hugging face inference support and quota
|
|
3
|
112
|
March 7, 2025
|
As of transformers v4.44, default chat template is no longer allowed
|
|
2
|
3499
|
March 7, 2025
|