LLM fine-tune with domain specific pdf documents
|
|
20
|
25248
|
November 5, 2024
|
How to improve Performance?
|
|
2
|
299
|
November 5, 2024
|
Questions about classification models
|
|
0
|
33
|
November 4, 2024
|
T5 Finetuning Tips
|
|
48
|
56949
|
November 3, 2024
|
Global Transformer in Llama 3.2 Vision
|
|
1
|
151
|
October 31, 2024
|
Low GPU utilization with the Decision Transformer
|
|
6
|
480
|
October 30, 2024
|
I am trying to build one text-to-sql with huggingface chatdb/natural-sql-7b model, it seems it is getting stuck every time and not generating any result. here is my code. Another problem is its notworking with "cuda". It's showing "torch is not compiled w
|
|
3
|
40
|
October 30, 2024
|
AI reasoning capabilities
|
|
1
|
200
|
October 29, 2024
|
Replace Causal Mask of T5 to custom mask
|
|
3
|
428
|
October 29, 2024
|
Unable to Read Username for 'https://huggingface.co'
|
|
4
|
2984
|
October 29, 2024
|
Build error while cloning the repository
|
|
1
|
74
|
October 29, 2024
|
Wget timed out in CI/CD pipeline
|
|
3
|
181
|
October 28, 2024
|
What are the best 3 text generation models that can be used via API for free with 128k context?
|
|
0
|
262
|
October 28, 2024
|
Unsupervised fine tuning mistral 7b
|
|
6
|
2479
|
October 27, 2024
|
Help with preparing train data for fine-tuning llama 3.1 instruct model?
|
|
0
|
104
|
October 27, 2024
|
ValueError: Unsupported model type mllama
|
|
3
|
429
|
October 23, 2024
|
Training memory footprint depends on instantiating method
|
|
1
|
43
|
October 23, 2024
|
How to use custom dataset in a model
|
|
0
|
28
|
October 23, 2024
|
Assistance Required for fudan-generative-ai/hallo2 Implementation and Model Weight Issues
|
|
1
|
31
|
October 23, 2024
|
Torchview/hiddenlayer produces blank nodes in visualisation
|
|
2
|
61
|
October 23, 2024
|
Can you help me interepret the results of my hyperparameter sweep for fine-tuning BLIP2-2.7?
|
|
0
|
51
|
October 22, 2024
|
AttributeError: 'TimmBackbone' object has no attribute 'model_type'
|
|
0
|
34
|
October 22, 2024
|
How to match locserver performance with Hugging face V3
|
|
0
|
30
|
October 22, 2024
|
Generate Code documentation from SQR Code
|
|
0
|
12
|
October 21, 2024
|
Arabic models timeout Error
|
|
1
|
47
|
October 19, 2024
|
How to split PDF document by table of contents
|
|
4
|
773
|
October 17, 2024
|
Finetuning mT5 for specific language pair
|
|
0
|
159
|
October 17, 2024
|
HuggingFace offline error
|
|
1
|
3024
|
October 17, 2024
|
How to find models that work on low memory/CPU edge devices
|
|
3
|
936
|
October 17, 2024
|
Llama Introduction
|
|
1
|
115
|
October 16, 2024
|