Refiner SD-XL-1.0 is degraded latent of base model
|
|
0
|
113
|
August 14, 2023
|
Unable to load checkpoint after finetuning
|
|
0
|
87
|
August 14, 2023
|
The model did not return a loss from the inputs, only the following keys: logits. For reference, the inputs it received are input_values
|
|
5
|
10651
|
August 11, 2023
|
Train loss goes to zero after some epochs
|
|
0
|
65
|
August 11, 2023
|
Past_key_value with multiple new tokens
|
|
1
|
95
|
August 10, 2023
|
CUDA OOM. Is it possible to distribute the usage of memory across 2gpu evenly?
|
|
1
|
108
|
August 9, 2023
|
TypeError: Repository.__init__() got an unexpected keyword argument 'token'
|
|
8
|
5094
|
August 9, 2023
|
Train instruct pix2pix task with dreambooth
|
|
0
|
62
|
August 6, 2023
|
Regenerate Prompt tuning result with appended prompt on base model
|
|
0
|
125
|
August 6, 2023
|
Issues when using `accelerate` with `fp16`
|
|
2
|
2780
|
August 5, 2023
|
Multi-gpu batch processing fails when using Peft Lora with Huggingface
|
|
0
|
162
|
August 4, 2023
|
Multi-Task dataset with Custom Sampler and Sharding
|
|
4
|
751
|
August 1, 2023
|
DocVQA for Recognizing Page Numbers in Older Text
|
|
0
|
56
|
August 1, 2023
|
How to make a QA model generate full sentences
|
|
0
|
78
|
July 31, 2023
|
How can I use evaluate's perplexity metric on a model that's already loaded?
|
|
0
|
123
|
July 28, 2023
|
Out of memory training 3B param model on 8 GPU (320GB memory) with FSDP
|
|
1
|
387
|
July 28, 2023
|
Implementation of NER model with relationship extraction?
|
|
2
|
3118
|
July 27, 2023
|
What is the official way to run a wandb sweep with hugging face (HF) transformers?
|
|
2
|
402
|
July 25, 2023
|
TextIteratorStreamer compatibility with batch processing
|
|
1
|
200
|
July 25, 2023
|
Create a new model from scratch
|
|
0
|
97
|
July 25, 2023
|
Invalid key for dataset -- is this a bug with Trainers or with my code?
|
|
1
|
78
|
July 24, 2023
|
Finetune Donut with new tokenizer
|
|
5
|
624
|
July 24, 2023
|
Train loss is not decreasing on siamese model based on xlm-roberta
|
|
0
|
93
|
July 24, 2023
|
Sagemaker model parallelism- running the model results in Maximum recursion limit
|
|
7
|
109
|
July 24, 2023
|
DeepSpeed giving Assertion Error
|
|
2
|
402
|
July 22, 2023
|
Different outputs when using pipeline
|
|
2
|
663
|
July 20, 2023
|
Custom loss: does this word exist
|
|
0
|
76
|
July 20, 2023
|
I Fine-tuned a llama 7b on a custom dataset, The response from inference generation start good, then words start to connect with out space
|
|
4
|
689
|
July 19, 2023
|
Using alpaca with local embedding
|
|
1
|
783
|
July 19, 2023
|
Help with Tokenizer Word Length Limit
|
|
2
|
189
|
July 16, 2023
|