Intermediate

Topic	Replies	Views	Activity
Does higher work with huggingface (hugging face, HF) models? e.g. ViT?	1	351	March 19, 2023
Fine-tuning LLM model for E-commerce Chatbot recomendation	0	1450	March 17, 2023
Inference Endpoint - Simultaneous Generations taking a long time	0	243	March 14, 2023
FAQ question generation and answering using few shot learning	1	1145	March 14, 2023
Custom bert embedding cause "RuntimeError: Trying to backward through the graph a second time"	0	952	March 10, 2023
HF Dataset as a Replay Buffer for RL applications	6	486	March 9, 2023
Write user-inputted data from app to csv in space directory	0	315	March 7, 2023
Multi-GPU support lost when overwriting functions for Custom Trainer	1	650	March 5, 2023
Combining tokenizer.decode and model.generate scores for probability prediction	2	1352	March 1, 2023
How to correct TypeError: zip argument #1 must support iteration training in multiple GPU	1	894	February 28, 2023
Example of hyper-parameter search of fine tuned fill mask model	0	215	February 27, 2023
Ensemble Learning with various BERT models	1	1586	February 25, 2023
How to fine-tune to 3 very different sized datasets (very large to very small)	0	791	February 24, 2023
Padding to the left of the inputs, GPT2LMHeadModel gives different answer	2	1300	February 21, 2023
Trainer "load_best_model_at_end" doesn't load the best model	0	2592	February 21, 2023
T5: why do we have more tokens expressed via cross attentions than the decoded sequence?	1	387	February 21, 2023
How to set input to validate of T5 Model	1	479	February 21, 2023
How to implement Key Query Layer Normalized Transformers/LLMs in Huggingface?	0	1265	February 18, 2023
Error saving quantized model	4	3955	February 16, 2023
Code example of getting cross attention from T5?	0	366	February 15, 2023
Mlflow with Hugging Face	0	645	February 14, 2023
Dreambooth Training not reading instance data	0	327	February 12, 2023
Create speech to text training dataset using text to speech model	0	406	February 8, 2023
Fine-tuning Zero-shot models	4	6355	February 7, 2023
Extract most important words from model	3	3047	February 6, 2023
QA model with human like answers	1	499	February 4, 2023
Errors with Distributed Fine Tuning T5 for seq2seq on sagemaker	0	307	February 3, 2023
Ways to incentivize recall over precision	0	223	February 2, 2023
Resuming accelerate-based pretraining with different batch size	0	772	January 31, 2023
How to load my own pretrained model to huggingface code	1	865	January 31, 2023