How to setup JSON based workflow/flowchart generation based on user prompt?
|
|
1
|
39
|
May 9, 2025
|
Build pretrained huggingface whisper for tensorrt-llm
|
|
0
|
311
|
May 20, 2024
|
Distillation code works on TPU?
|
|
0
|
310
|
October 19, 2020
|
Want to host a production level server for runnin llm for code generation
|
|
0
|
55
|
January 7, 2025
|
Tanglebox model runner / inferencing UI
|
|
0
|
309
|
April 14, 2023
|
Run_squad occasionally finds an answer to a question asked of a text fragment
|
|
0
|
308
|
September 9, 2020
|
Errors with Distributed Fine Tuning T5 for seq2seq on sagemaker
|
|
0
|
307
|
February 3, 2023
|
DPO with Chat Data
|
|
0
|
306
|
April 1, 2024
|
Accelerate socket timeout on multi-node LLM training
|
|
0
|
304
|
June 14, 2024
|
Get indices of image patches that are inside a bounding box
|
|
0
|
303
|
October 2, 2022
|
How to reproduce the performance of bert-large-uncased-whole-word-masking-finetuned-squad?
|
|
0
|
302
|
July 25, 2021
|
[text classification] different result format for inference API and inference endpoint
|
|
0
|
301
|
October 25, 2023
|
Create a new model from scratch
|
|
0
|
301
|
July 25, 2023
|
Which chunker to utilize for code based data
|
|
1
|
120
|
March 12, 2025
|
Fine Tuning Format/Structure for data for llma3.1 models
|
|
0
|
53
|
October 28, 2024
|
How to train an already finetuned LLM(LLama2)?
|
|
0
|
298
|
March 13, 2024
|
Grid Beam Search implementation
|
|
0
|
298
|
July 14, 2023
|
Pythia Tuning Question
|
|
0
|
298
|
June 14, 2023
|
Need some assistance understanding [Community Event] Doc Tests Sprint #16292
|
|
0
|
297
|
October 16, 2022
|
Difficulties encountered while adapting the SpeechT5 Fine-Tuning tutorial for Fon language data
|
|
0
|
296
|
October 27, 2023
|
Struggling in Cross lingual Summarization by mt5-base
|
|
0
|
296
|
October 11, 2023
|
Learning sets and disabling positional embedding knowledge?
|
|
0
|
295
|
May 10, 2023
|
Torchscript with Encoder-Decoder architecture
|
|
0
|
295
|
October 11, 2021
|
Error while generating more then one Beam output in T5
|
|
0
|
295
|
September 26, 2021
|
Hyper parameter tuning on Colab?
|
|
0
|
293
|
September 10, 2021
|
Character level attention with Longformer for sequence classification
|
|
0
|
293
|
February 25, 2021
|
Memory overhead/usage calculation
|
|
3
|
26
|
June 20, 2025
|
HF pipelines for simulating user-agent conversations
|
|
0
|
52
|
November 19, 2024
|
Error while training Mixtral in 8bit
|
|
0
|
292
|
January 16, 2024
|
How to get out of stagnant loss
|
|
7
|
58
|
February 21, 2025
|
Fine-tuning translator based on a single language
|
|
0
|
289
|
September 22, 2021
|
Is it possible to use BART model for question answering purpose which responses like a human like conversation
|
|
0
|
286
|
May 31, 2023
|
Manual download distilbert-base-uncased
|
|
1
|
202
|
November 6, 2023
|
SegformerFeatureExtractor not working as expected - Feature extractor not returning the label object
|
|
0
|
285
|
August 26, 2023
|
Issues after fine-tuning BLOOMZ-3b using peft library
|
|
0
|
285
|
July 1, 2023
|
What Model and approach should i use for my use case
|
|
2
|
164
|
May 20, 2024
|
How to load trials and analyze results on trainer.hyperparameter_search
|
|
0
|
283
|
January 3, 2023
|
Electra Question answering
|
|
0
|
283
|
January 12, 2021
|
OpenAI AI Assistant Alternative Using HuugingFace Models
|
|
0
|
281
|
December 7, 2023
|
How to obatin gradients on different GPUs to do custom accumulations
|
|
0
|
281
|
September 2, 2023
|
Do we need a new programming language optimized for AI to write code?
|
|
3
|
79
|
June 6, 2025
|
Autogen AI Issues
|
|
0
|
280
|
January 19, 2024
|
How to change the label names in Hosted Inference API results
|
|
0
|
280
|
September 5, 2023
|
Parallelizing inputs to ONNX model
|
|
0
|
280
|
March 31, 2023
|
InstructBLIP number of parameters
|
|
0
|
277
|
August 18, 2023
|
How to stop LLM from going up to the max token limit?
|
|
1
|
110
|
September 25, 2024
|
Position Embedding error in HuggingFace
|
|
1
|
195
|
September 21, 2023
|
Requirements Llama2
|
|
0
|
275
|
April 13, 2024
|
Weight and shape different than the number of channels in input
|
|
0
|
275
|
April 4, 2024
|
Is it reasonable to add controlent to instruct pix2pix?
|
|
0
|
275
|
November 2, 2023
|