How do I load an SFTTrainer model finetuned falcon-7b-sharded-bf16 using custom dataset, and make prediction with it
|
|
2
|
1278
|
August 1, 2023
|
Error when running code from recently-posted Deeplearning.ai video that uses HF libraries (among others)
|
|
0
|
470
|
August 1, 2023
|
Why using ground-truth noise in a diffusion model does not work?
|
|
0
|
349
|
August 1, 2023
|
Deploy big model to AWS Sagemaker fails
|
|
5
|
1082
|
July 31, 2023
|
Mac M1 Colab ImportError
|
|
0
|
246
|
July 31, 2023
|
Disable interaction of row using a checkbox
|
|
0
|
149
|
July 31, 2023
|
RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cpu and cuda:0! (when checking argument for argument index in method wrapper__index_select)
|
|
2
|
1859
|
July 30, 2023
|
Can't connect to huggingface
|
|
1
|
1796
|
July 30, 2023
|
Moving Beyond Spec-Obsession: Embracing Application-Focused AI Models
|
|
1
|
288
|
July 30, 2023
|
How to add additional features last layer of T5 before finetuning on seq2seq
|
|
0
|
126
|
July 30, 2023
|
Using GenerationMixin.generate on my own model
|
|
2
|
639
|
July 30, 2023
|
What does load_best_model_at_end=True and evaluation_strategy="no" mean?
|
|
0
|
1300
|
July 29, 2023
|
Suggestions about a model for a single individual human
|
|
0
|
189
|
July 29, 2023
|
K fold cross validation
|
|
5
|
13039
|
July 29, 2023
|
Trainer Ignoring Weight Decay, Beta arguments
|
|
1
|
910
|
July 28, 2023
|
AttributeError: module 'gradio' has no attribute 'themes'
|
|
0
|
2121
|
July 28, 2023
|
How do you calculate max steps
|
|
2
|
2323
|
July 28, 2023
|
How to fine-tune mT5 model for QA task?
|
|
0
|
486
|
July 28, 2023
|
Train LLM Model using multiple datasets
|
|
0
|
789
|
July 28, 2023
|
Identifying duplicates in csv
|
|
0
|
212
|
July 28, 2023
|
I don't understand the difference between asymmetric retrieval, sentence similarity, and semantic search
|
|
2
|
6326
|
July 28, 2023
|
Inference API - Response of Higher Length
|
|
0
|
850
|
April 22, 2021
|
Pipeline-parrallel vram consumation
|
|
0
|
183
|
July 27, 2023
|
How to set "src_lang" and "tgt_lang" for Hosted Inference API?
|
|
0
|
375
|
July 27, 2023
|
Visibility of code vs. visibility of an app (space)
|
|
1
|
778
|
July 27, 2023
|
How to add a custom CRF head on top of BERT for token classification?
|
|
1
|
2070
|
July 27, 2023
|
Http://localhost:3000 has been blocked by CORS policy - running nextjs localhost with huggingface spaces
|
|
1
|
987
|
July 27, 2023
|
How do I evaluate a pretrained model on a test dataset?
|
|
1
|
8835
|
February 24, 2022
|
Size of saved model: Is there a way to make it smaller for deploy?
|
|
1
|
604
|
July 27, 2023
|
Share your pain points building AI model
|
|
2
|
286
|
July 27, 2023
|