Image Comparison Models for Line Drawings
|
|
0
|
336
|
September 1, 2023
|
How to understand the answer_start parameter of Squad dataset for training BERT-QA model + practical implications for creating custom dataset?
|
|
1
|
1009
|
September 1, 2023
|
Accelerate: 'RobertaModel' object has no attribute 'roberta'
|
|
1
|
506
|
August 29, 2023
|
SegformerFeatureExtractor not working as expected - Feature extractor not returning the label object
|
|
0
|
288
|
August 26, 2023
|
How to ensuring a new instance of a Language Model (LLM) agent is created or simply specific function executed with every refresh of a web application, as demonstrated in the provided Python code
|
|
0
|
480
|
August 26, 2023
|
How to use tensorflow is a QACHAIN
|
|
0
|
317
|
August 25, 2023
|
Using Tensorboard SummaryWriter with HuggingFace TrainerAPI
|
|
4
|
11495
|
August 24, 2023
|
Showing the data type of model files
|
|
0
|
244
|
August 23, 2023
|
Unable to lower to STABLEHLO hugging face ViT model
|
|
0
|
316
|
August 23, 2023
|
Explanation of the default "auto" values for DeepSpeed stage 3?
|
|
1
|
467
|
August 22, 2023
|
Generate low contrast images after training instruct pix2pix
|
|
0
|
261
|
August 16, 2023
|
Modify network architecture from default model
|
|
0
|
245
|
August 20, 2023
|
TPU Out of memory in Pix2Struct ForConditionalGeneration model
|
|
0
|
248
|
August 13, 2023
|
Add_faiss_index with multiple columns
|
|
0
|
743
|
August 19, 2023
|
InstructBLIP number of parameters
|
|
0
|
279
|
August 18, 2023
|
Accessing model from a callback to predict between epochs
|
|
1
|
1496
|
August 17, 2023
|
Blip2 with a new LLM
|
|
0
|
804
|
August 15, 2023
|
Train loss goes to zero after some epochs
|
|
0
|
283
|
August 11, 2023
|
Past_key_value with multiple new tokens
|
|
1
|
1370
|
August 10, 2023
|
CUDA OOM. Is it possible to distribute the usage of memory across 2gpu evenly?
|
|
1
|
329
|
August 9, 2023
|
TypeError: Repository.__init__() got an unexpected keyword argument 'token'
|
|
8
|
14706
|
August 9, 2023
|
Train instruct pix2pix task with dreambooth
|
|
0
|
257
|
August 6, 2023
|
Regenerate Prompt tuning result with appended prompt on base model
|
|
0
|
888
|
August 6, 2023
|
Multi-Task dataset with Custom Sampler and Sharding
|
|
4
|
1379
|
August 1, 2023
|
DocVQA for Recognizing Page Numbers in Older Text
|
|
0
|
125
|
August 1, 2023
|
How to make a QA model generate full sentences
|
|
0
|
342
|
July 31, 2023
|
How can I use evaluate's perplexity metric on a model that's already loaded?
|
|
0
|
1684
|
July 28, 2023
|
Out of memory training 3B param model on 8 GPU (320GB memory) with FSDP
|
|
1
|
1705
|
July 28, 2023
|
What is the official way to run a wandb sweep with hugging face (HF) transformers?
|
|
2
|
2048
|
July 25, 2023
|
Create a new model from scratch
|
|
0
|
302
|
July 25, 2023
|