Using TRL on TPU
|
|
0
|
9
|
February 7, 2025
|
Fine tuning LLM for text classification -- error with SFTTrainer
|
|
1
|
1301
|
February 7, 2025
|
Model size-quantization tradeoff for local offline inference
|
|
1
|
9
|
February 7, 2025
|
Creating A Team Of LLMs
|
|
2
|
26
|
February 6, 2025
|
How to pass large context to pipeline once instead of again and again for each query?
|
|
0
|
7
|
February 6, 2025
|
ValueError: You should supply an encoding or a list of encodings to this method that includes input_ids, but you provided ['label']
|
|
3
|
17375
|
February 4, 2025
|
Blip2 peft training
|
|
1
|
11
|
February 3, 2025
|
Generation is called twice when using two GPUs
|
|
1
|
9
|
February 3, 2025
|
Custom BenchMark creation
|
|
5
|
20
|
February 2, 2025
|
TRL + PPO + Using Conditioned Reference Model
|
|
3
|
29
|
January 27, 2025
|
Fine-tune model with CoT
|
|
1
|
91
|
January 27, 2025
|
Cuda out of memory error
|
|
11
|
38012
|
January 27, 2025
|
Unexpected Things
|
|
2
|
24
|
January 25, 2025
|
LLM fine tuning for E-commerce product recommendation
|
|
1
|
1474
|
January 25, 2025
|
Facebook FAISS on Databricks
|
|
1
|
411
|
January 23, 2025
|
Compute Perplexity using compute_metrics in SFTTrainer
|
|
1
|
798
|
January 22, 2025
|
PydanticUserError: The `__modify_schema__` method is not supported in Pydantic v2. Use `__get_pydantic_json_schema__` instead in class `SecretStr`
|
|
1
|
74
|
January 22, 2025
|
Resize embeddings on Peft model
|
|
3
|
39
|
January 22, 2025
|
Custom dataset maskformer
|
|
15
|
42
|
January 18, 2025
|
Generate without using the generate method
|
|
8
|
5253
|
January 17, 2025
|
Darshan Hiranandani : How to Create Datasets from PDF Files?
|
|
2
|
45
|
January 17, 2025
|
Darshan Hiranandani : Optimizing Model for Handling Large Transcripts with Metadata: Suggestions Needed
|
|
0
|
16
|
January 16, 2025
|
How to improve pattern detection accuracy
|
|
3
|
27
|
January 9, 2025
|
Opinion: Training Argument Fine Tuning MLM RoBERTa
|
|
1
|
30
|
January 9, 2025
|
Use LongT5 model for binary classification
|
|
0
|
18
|
January 9, 2025
|
Pyannotate pipeline() not working
|
|
6
|
45
|
January 9, 2025
|
The Correct Attention Mask For Examples Packing
|
|
6
|
2261
|
January 8, 2025
|
Non Maximum Merging for Oriented BBox
|
|
1
|
32
|
January 8, 2025
|
Pretrain swin Former on xview2 dataset (satellite dataset different from imagenet)
|
|
2
|
7
|
January 8, 2025
|
Want to host a production level server for runnin llm for code generation
|
|
0
|
25
|
January 7, 2025
|