Hugging Face Forums

Topic	Replies	Views	Activity
What does the `use_cache` in `generate` actually do? 🤗Transformers	1	2181	May 9, 2024
AWD-LSTM beats finetuned BERT as train ds decreases?! :person_shrugging:t4: 🤗Transformers	2	124	May 9, 2024
How to count how many forward passes were done in model.generate when using assistant_model 🤗Transformers	0	81	May 9, 2024
SavedModel file does not exist at: Models	0	426	May 9, 2024
Encode token without spaced between them 🤗Tokenizers	0	143	May 9, 2024
Example DeTr Object Detectors not predicting after fine tuning Beginners	6	1336	May 9, 2024
How to get log probs if we already have a generation? Beginners	1	491	May 9, 2024
How to pass multiple datasets into Trainer for Knowledge distillation in NMT 🤗Transformers	3	331	May 9, 2024
RoBERTa large: HF vs. FAIRseq Models	1	206	May 9, 2024
Multimodal Transformers with signal inputs Beginners	0	88	May 9, 2024
Llama3 8b instruct not answering question Amazon SageMaker	6	435	May 9, 2024
Deploying Fine-tune LLama3 🤗AutoTrain	0	270	May 9, 2024
Trainer doesn't show the loss at each step 🤗Transformers	20	34840	May 9, 2024
Lazy model initialization 🤗Transformers	3	899	May 8, 2024
Using Fine-Grained Access Tokens for Inference Endpoints Inference Endpoints on the Hub	0	420	May 8, 2024
Seperating Paragraphs in Text File Based on Topics for Zero-Shot Classification Beginners	1	213	May 8, 2024
Memory Requirements for Running LLM Beginners	2	7001	May 8, 2024
Batch size limit 32 Beginners	2	1135	May 8, 2024
Storing and restoring GPT-J model Beginners	3	783	May 8, 2024
Getting zero gradients for image patch embeddings when implementing GRADCAM for ViLT 🤗Transformers	0	91	May 8, 2024
ONNX T5 - Decoding seq2seq tokens 🤗Tokenizers	1	490	May 8, 2024
Deploying Llama2 7B fine tuned model on inf2.xlarge Amazon Inferentia & Trainium	0	191	May 8, 2024
Input to reshape is a tensor with 3763200 values, but the requested shape requires a multiple of 20384 🤗Transformers	0	84	May 8, 2024
Problem with data collator Beginners	1	222	May 8, 2024
Llama2 tools instruction wierd reponse Intermediate	2	152	May 8, 2024
Having multiple candidate labels in a zero shot classification model 🤗Transformers	3	561	May 8, 2024
Why eval_accumulation_steps takes so much memory 🤗Transformers	5	1300	May 8, 2024
Add metrics to object detection example 🤗Transformers	12	3771	May 8, 2024
Need help in fine-tuning T5-Base Model for a sequence task Beginners	0	164	May 8, 2024
Unisloth 4-bit Llama models acting weirdly when used in a Function Beginners	0	164	May 8, 2024