Cuda out of memory error
|
|
11
|
42560
|
January 27, 2025
|
Unexpected Things
|
|
2
|
27
|
January 25, 2025
|
LLM fine tuning for E-commerce product recommendation
|
|
1
|
1657
|
January 25, 2025
|
Compute Perplexity using compute_metrics in SFTTrainer
|
|
1
|
978
|
January 22, 2025
|
PydanticUserError: The `__modify_schema__` method is not supported in Pydantic v2. Use `__get_pydantic_json_schema__` instead in class `SecretStr`
|
|
1
|
499
|
January 22, 2025
|
Custom dataset maskformer
|
|
15
|
92
|
January 18, 2025
|
Generate without using the generate method
|
|
8
|
6256
|
January 17, 2025
|
Darshan Hiranandani : How to Create Datasets from PDF Files?
|
|
2
|
124
|
January 17, 2025
|
Darshan Hiranandani : Optimizing Model for Handling Large Transcripts with Metadata: Suggestions Needed
|
|
0
|
18
|
January 16, 2025
|
How to improve pattern detection accuracy
|
|
3
|
32
|
January 9, 2025
|
Opinion: Training Argument Fine Tuning MLM RoBERTa
|
|
1
|
281
|
January 9, 2025
|
Use LongT5 model for binary classification
|
|
0
|
89
|
January 9, 2025
|
Pyannotate pipeline() not working
|
|
6
|
359
|
January 9, 2025
|
The Correct Attention Mask For Examples Packing
|
|
6
|
3107
|
January 8, 2025
|
Non Maximum Merging for Oriented BBox
|
|
1
|
130
|
January 8, 2025
|
Pretrain swin Former on xview2 dataset (satellite dataset different from imagenet)
|
|
2
|
20
|
January 8, 2025
|
Want to host a production level server for runnin llm for code generation
|
|
0
|
73
|
January 7, 2025
|
Getting model version/commit id?
|
|
0
|
45
|
January 7, 2025
|
How can I evaluate a fine tuned LLM?
|
|
4
|
1333
|
January 7, 2025
|
Docker image "THIS IMAGE IS DEPRECATED and is scheduled for DELETION." message
|
|
0
|
105
|
January 6, 2025
|
Lora: missing adapter keys while loading the checkpoint
|
|
2
|
1350
|
January 6, 2025
|
Why is my setfit model only outputting two possible class confidence scores?
|
|
1
|
39
|
January 5, 2025
|
Combine multiple Lora's for group photo?
|
|
1
|
628
|
January 3, 2025
|
Instructions to raise PR for addition of shared library files(.so) and .cpp files
|
|
0
|
13
|
January 3, 2025
|
Darshan Hiranandani : How to Replace Specific Layer Weights in One Model with Weights from Another Model?
|
|
0
|
29
|
January 2, 2025
|
Open-LLM-Leaderboard for dummies
|
|
3
|
359
|
December 30, 2024
|
ValueError: {'code': None, 'message': 'ModelMetaclass object argument after ** must be a mapping, not str'
|
|
4
|
68
|
December 27, 2024
|
How do I backpropagate specific output tokens using Trainer?
|
|
0
|
38
|
December 25, 2024
|
Speech synthesis model with Styles Like Emoticons or emphasis
|
|
3
|
255
|
December 25, 2024
|
Torchrun, trainer, dataset setup
|
|
4
|
963
|
December 20, 2024
|