Intermediate

Topic	Replies	Views	Activity
Image Comparison Models for Line Drawings	0	336	September 1, 2023
How to understand the answer_start parameter of Squad dataset for training BERT-QA model + practical implications for creating custom dataset?	1	1009	September 1, 2023
Accelerate: 'RobertaModel' object has no attribute 'roberta'	1	506	August 29, 2023
SegformerFeatureExtractor not working as expected - Feature extractor not returning the label object	0	288	August 26, 2023
How to ensuring a new instance of a Language Model (LLM) agent is created or simply specific function executed with every refresh of a web application, as demonstrated in the provided Python code	0	480	August 26, 2023
How to use tensorflow is a QACHAIN	0	317	August 25, 2023
Using Tensorboard SummaryWriter with HuggingFace TrainerAPI	4	11495	August 24, 2023
Showing the data type of model files	0	244	August 23, 2023
Unable to lower to STABLEHLO hugging face ViT model	0	316	August 23, 2023
Explanation of the default "auto" values for DeepSpeed stage 3?	1	467	August 22, 2023
Generate low contrast images after training instruct pix2pix	0	261	August 16, 2023
Modify network architecture from default model	0	245	August 20, 2023
TPU Out of memory in Pix2Struct ForConditionalGeneration model	0	248	August 13, 2023
Add_faiss_index with multiple columns	0	743	August 19, 2023
InstructBLIP number of parameters	0	279	August 18, 2023
Accessing model from a callback to predict between epochs	1	1496	August 17, 2023
Blip2 with a new LLM	0	804	August 15, 2023
Train loss goes to zero after some epochs	0	283	August 11, 2023
Past_key_value with multiple new tokens	1	1370	August 10, 2023
CUDA OOM. Is it possible to distribute the usage of memory across 2gpu evenly?	1	329	August 9, 2023
TypeError: Repository.__init__() got an unexpected keyword argument 'token'	8	14706	August 9, 2023
Train instruct pix2pix task with dreambooth	0	257	August 6, 2023
Regenerate Prompt tuning result with appended prompt on base model	0	888	August 6, 2023
Multi-Task dataset with Custom Sampler and Sharding	4	1379	August 1, 2023
DocVQA for Recognizing Page Numbers in Older Text	0	125	August 1, 2023
How to make a QA model generate full sentences	0	342	July 31, 2023
How can I use evaluate's perplexity metric on a model that's already loaded?	0	1684	July 28, 2023
Out of memory training 3B param model on 8 GPU (320GB memory) with FSDP	1	1705	July 28, 2023
What is the official way to run a wandb sweep with hugging face (HF) transformers?	2	2048	July 25, 2023
Create a new model from scratch	0	302	July 25, 2023