Uploading a heavy dataset to Jean-Zay
|
|
3
|
56
|
February 17, 2025
|
Pipeline output no longer matches the provided example
|
|
1
|
8
|
February 17, 2025
|
TGI with guidance generates weird output when asked to answer in a "structured" way
|
|
3
|
123
|
February 17, 2025
|
DocVQA test dataset evaluation on qwen2.5-VL-3B
|
|
0
|
71
|
February 16, 2025
|
New here just checking things out
|
|
1
|
46
|
February 16, 2025
|
LayoutLM data format for bounding box classification
|
|
1
|
260
|
February 13, 2025
|
Format Reward Function in GRPO Training Doesn't Stabilise
|
|
0
|
553
|
February 12, 2025
|
Using TRL on TPU
|
|
1
|
171
|
February 11, 2025
|
Speaker Verification: All Speakers Getting Perfect 1.000 Similarity Scores
|
|
0
|
25
|
February 10, 2025
|
DownloadAndLoadFlorence2Model 401 Client Error
|
|
1
|
277
|
February 10, 2025
|
Model size-quantization tradeoff for local offline inference
|
|
1
|
76
|
February 7, 2025
|
Creating A Team Of LLMs
|
|
2
|
191
|
February 6, 2025
|
How to pass large context to pipeline once instead of again and again for each query?
|
|
0
|
14
|
February 6, 2025
|
ValueError: You should supply an encoding or a list of encodings to this method that includes input_ids, but you provided ['label']
|
|
3
|
18073
|
February 4, 2025
|
Generation is called twice when using two GPUs
|
|
1
|
17
|
February 3, 2025
|
Custom BenchMark creation
|
|
5
|
74
|
February 2, 2025
|
TRL + PPO + Using Conditioned Reference Model
|
|
3
|
61
|
January 27, 2025
|
Fine-tune model with CoT
|
|
1
|
380
|
January 27, 2025
|
Cuda out of memory error
|
|
11
|
41588
|
January 27, 2025
|
Unexpected Things
|
|
2
|
26
|
January 25, 2025
|
LLM fine tuning for E-commerce product recommendation
|
|
1
|
1627
|
January 25, 2025
|
Facebook FAISS on Databricks
|
|
1
|
558
|
January 23, 2025
|
Compute Perplexity using compute_metrics in SFTTrainer
|
|
1
|
922
|
January 22, 2025
|
PydanticUserError: The `__modify_schema__` method is not supported in Pydantic v2. Use `__get_pydantic_json_schema__` instead in class `SecretStr`
|
|
1
|
428
|
January 22, 2025
|
Custom dataset maskformer
|
|
15
|
72
|
January 18, 2025
|
Generate without using the generate method
|
|
8
|
5994
|
January 17, 2025
|
Darshan Hiranandani : How to Create Datasets from PDF Files?
|
|
2
|
115
|
January 17, 2025
|
Darshan Hiranandani : Optimizing Model for Handling Large Transcripts with Metadata: Suggestions Needed
|
|
0
|
18
|
January 16, 2025
|
How to improve pattern detection accuracy
|
|
3
|
32
|
January 9, 2025
|
Opinion: Training Argument Fine Tuning MLM RoBERTa
|
|
1
|
178
|
January 9, 2025
|