Intermediate

Topic	Replies	Views	Activity
Tranformers Trainer API	0	63	November 25, 2024
Accelerate - WeightedRandomSampler Dataloader	1	250	June 18, 2024
Blank Responses	1	250	October 5, 2023
Vector search from text-image pairs : separate or common space?	0	353	July 11, 2023
Mismatched Tokenizer and LLM leading to odd evaluation result	0	353	May 18, 2023
How do GPT2 pretrained models allow custom hyperparams?	0	352	March 10, 2021
🔧 Optimizing Phi-4 MM Instruct Vision Model (ONNX Inference)	1	44	April 24, 2025
Howto train translation model WITHOUT VALIDATION data?	0	349	November 8, 2023
Working of MultipleChoiceModel	0	349	October 30, 2020
Model Performance and Sanity check	0	348	March 7, 2024
Gpt2 model training , Loss nan	0	347	July 10, 2023
Whisper fine-tuning without Seq2SeqTrainer	0	346	December 15, 2023
Why does increasing sequence length reduce Q&A performance on my test set?	0	347	August 30, 2021
Darshan Hiranandani : How to Create Datasets from PDF Files?	2	112	January 17, 2025
Fill mask bad_words_ids	0	343	October 20, 2022
DistillBERT pre-training for a new text corpus	0	342	April 29, 2021
Default parameters when querying models with TGI	0	342	April 23, 2024
HfHubHTTPError: 502 Server Error: Bad Gateway for url: https://api-inference.huggingface.co/models/HuggingFaceH4/zephyr-7b-beta	0	341	March 13, 2024
How to make a QA model generate full sentences	0	341	July 31, 2023
Hosted inference API - Limit output, text classification	0	339	July 13, 2023
Run portion of model during inference	0	339	January 27, 2022
EncoderDecoder LM output is perfect ... except that the ending is missing or duplicated	0	339	May 6, 2021
Having issues with my finetuned llama v2 model understanding instructions	0	336	September 3, 2023
QNLI on custom dataset using RoBERTa/BERT	0	335	May 20, 2021
Image Comparison Models for Line Drawings	0	334	September 1, 2023
Adapter support for summarisation task	0	334	October 28, 2022
Special tokens and inference	0	333	November 16, 2020
How Can I Understand the Exact Cost of My Inference API Requests?	2	111	April 16, 2025
Loss computed for single token in GPT-2	0	331	April 12, 2023
Tokenizer causes TRL completion data collator failure	0	330	March 3, 2024
StoppingCriteria - do not include the last triggering token	0	330	January 18, 2023
Back Translation Using T5	0	327	June 21, 2023
Horovod Image for USing multi server	0	326	April 4, 2023
Dreambooth Training not reading instance data	0	326	February 12, 2023
Runtime error Exit code: 1. Reason: Traceback (most recent call last): File "/home/user/ann/ann.nu" line 3 in modules Logs Build Container Application Startup at 2024-09-15 08:20:37 Traceback (most recent call last): File "/home/user/app/app.py",	1	230	September 15, 2024
Machine Translation using Hugging Face problem	0	323	May 8, 2023
MaskFormer Jagged Edges Issues of output masks	1	228	June 5, 2024
Refine BERT to pay more attention to key words	0	320	November 24, 2023
Vector DB - Exhaustive search in RAG	0	320	November 14, 2023
BERTology compute_heads_importance without zero grad	0	320	October 7, 2020
Lacking config attribute of model after finetuning	0	319	September 28, 2022
Custom BenchMark creation	5	73	February 2, 2025
Training Instruct-pix-2-pix with my own dataset: torch error	0	315	March 13, 2024
Pegasus fine-tuned model from pytorch to tensorflow	0	315	July 2, 2021
How to use tensorflow is a QACHAIN	0	314	August 25, 2023
Unable to lower to STABLEHLO hugging face ViT model	0	313	August 23, 2023
ValueError - number of spatial dimensions	0	312	January 19, 2023
Please explain how HF TFSequenceClassifier implements variable input length	0	312	August 21, 2022
Creating an Instruction-to-Code model for a custom library: Strategies and Guidelines?	0	311	May 23, 2023
Execute finetuned QA model in parallel	0	311	September 29, 2022