Intermediate

Topic	Replies	Views	Activity
How to setup JSON based workflow/flowchart generation based on user prompt?	1	39	May 9, 2025
Build pretrained huggingface whisper for tensorrt-llm	0	311	May 20, 2024
Distillation code works on TPU?	0	310	October 19, 2020
Want to host a production level server for runnin llm for code generation	0	55	January 7, 2025
Tanglebox model runner / inferencing UI	0	309	April 14, 2023
Run_squad occasionally finds an answer to a question asked of a text fragment	0	308	September 9, 2020
Errors with Distributed Fine Tuning T5 for seq2seq on sagemaker	0	307	February 3, 2023
DPO with Chat Data	0	306	April 1, 2024
Accelerate socket timeout on multi-node LLM training	0	304	June 14, 2024
Get indices of image patches that are inside a bounding box	0	303	October 2, 2022
How to reproduce the performance of bert-large-uncased-whole-word-masking-finetuned-squad?	0	302	July 25, 2021
[text classification] different result format for inference API and inference endpoint	0	301	October 25, 2023
Create a new model from scratch	0	301	July 25, 2023
Which chunker to utilize for code based data	1	120	March 12, 2025
Fine Tuning Format/Structure for data for llma3.1 models	0	53	October 28, 2024
How to train an already finetuned LLM(LLama2)?	0	298	March 13, 2024
Grid Beam Search implementation	0	298	July 14, 2023
Pythia Tuning Question	0	298	June 14, 2023
Need some assistance understanding [Community Event] Doc Tests Sprint #16292	0	297	October 16, 2022
Difficulties encountered while adapting the SpeechT5 Fine-Tuning tutorial for Fon language data	0	296	October 27, 2023
Struggling in Cross lingual Summarization by mt5-base	0	296	October 11, 2023
Learning sets and disabling positional embedding knowledge?	0	295	May 10, 2023
Torchscript with Encoder-Decoder architecture	0	295	October 11, 2021
Error while generating more then one Beam output in T5	0	295	September 26, 2021
Hyper parameter tuning on Colab?	0	293	September 10, 2021
Character level attention with Longformer for sequence classification	0	293	February 25, 2021
Memory overhead/usage calculation	3	26	June 20, 2025
HF pipelines for simulating user-agent conversations	0	52	November 19, 2024
Error while training Mixtral in 8bit	0	292	January 16, 2024
How to get out of stagnant loss	7	58	February 21, 2025
Fine-tuning translator based on a single language	0	289	September 22, 2021
Is it possible to use BART model for question answering purpose which responses like a human like conversation	0	286	May 31, 2023
Manual download distilbert-base-uncased	1	202	November 6, 2023
SegformerFeatureExtractor not working as expected - Feature extractor not returning the label object	0	285	August 26, 2023
Issues after fine-tuning BLOOMZ-3b using peft library	0	285	July 1, 2023
What Model and approach should i use for my use case	2	164	May 20, 2024
How to load trials and analyze results on trainer.hyperparameter_search	0	283	January 3, 2023
Electra Question answering	0	283	January 12, 2021
OpenAI AI Assistant Alternative Using HuugingFace Models	0	281	December 7, 2023
How to obatin gradients on different GPUs to do custom accumulations	0	281	September 2, 2023
Do we need a new programming language optimized for AI to write code?	3	79	June 6, 2025
Autogen AI Issues	0	280	January 19, 2024
How to change the label names in Hosted Inference API results	0	280	September 5, 2023
Parallelizing inputs to ONNX model	0	280	March 31, 2023
InstructBLIP number of parameters	0	277	August 18, 2023
How to stop LLM from going up to the max token limit?	1	110	September 25, 2024
Position Embedding error in HuggingFace	1	195	September 21, 2023
Requirements Llama2	0	275	April 13, 2024
Weight and shape different than the number of channels in input	0	275	April 4, 2024
Is it reasonable to add controlent to instruct pix2pix?	0	275	November 2, 2023