Finetuned model of Codellam
|
|
0
|
229
|
September 21, 2023
|
How to use set_transform when map becomes unfeasible?
|
|
2
|
132
|
June 19, 2024
|
falcon-40B inference on older version of torch
|
|
0
|
228
|
June 27, 2023
|
Hugging Face Evaluator
|
|
1
|
161
|
December 26, 2023
|
Finetuning Scibert and encountering ValueError
|
|
0
|
225
|
September 8, 2023
|
A specific documents AI API for Hugging Face?
|
|
0
|
225
|
May 12, 2023
|
InformationRetrievalEvaluator with training semantic search model
|
|
0
|
222
|
September 6, 2024
|
Ways to incentivize recall over precision
|
|
0
|
222
|
February 2, 2023
|
Teams Transcript Summarisation
|
|
0
|
220
|
April 17, 2023
|
Alternating between batches of different datasets
|
|
0
|
220
|
February 8, 2024
|
Size Mismatch when loading Lora Adapter for Phi3
|
|
0
|
216
|
July 30, 2024
|
Audio classification VS Transcribing and using classifier
|
|
0
|
215
|
December 29, 2023
|
Predictions format sent to compute_metrics depends on model used
|
|
0
|
214
|
December 4, 2023
|
Example of hyper-parameter search of fine tuned fill mask model
|
|
0
|
214
|
February 27, 2023
|
Export to Onnx and run inference Bigbirdpegasus summariser
|
|
0
|
213
|
January 24, 2023
|
Converting HFT Checkpoint to CoreML
|
|
0
|
212
|
November 20, 2023
|
Sockpuppet detector based on NLP: where to start?
|
|
0
|
212
|
May 21, 2023
|
Smolagent - handle multiuser chat
|
|
1
|
84
|
April 1, 2025
|
Cloning dataset in MLM training
|
|
0
|
211
|
April 13, 2023
|
Training "don't know" and "don't understand" responses
|
|
0
|
210
|
May 31, 2023
|
Correct configuration to train Mask2Former on Amazon Sagemaker multi GPU ml.p4d.24xlarge instance
|
|
2
|
68
|
March 24, 2025
|
OSError: dggokul21/Testcase_Generator does not appear to have a file named pytorch_model.bin, tf_model.h5, model.ckpt or flax_model.msgpack
|
|
0
|
209
|
February 21, 2024
|
Trainer for MT with source and target tokenizers
|
|
0
|
209
|
June 9, 2023
|
Training large language models to consider two texts to generate output text
|
|
0
|
208
|
April 26, 2023
|
Issues after finetuning BLOOMZ-3b with peft library
|
|
0
|
207
|
July 1, 2023
|
Checking if two column have the language i want
|
|
1
|
26
|
May 1, 2025
|
Help with Quantizing phi-4 MM Fine-Tuned Vision Model and Converting to ONNX
|
|
3
|
60
|
May 2, 2025
|
How to setup mt5-xxl server with a checkpoint?
|
|
0
|
205
|
November 3, 2023
|
How to finetune any transformer custom layers using tf
|
|
0
|
204
|
January 27, 2024
|
Multiple Classification Heads (For two tier labelling)
|
|
0
|
204
|
January 10, 2024
|
Understanding How GPT Models Differentiate Between Questions and Instructions in API Usage
|
|
1
|
81
|
November 25, 2024
|
How do I backpropagate specific output tokens using Trainer?
|
|
0
|
36
|
December 25, 2024
|
AutoModel Classifier distilBERT on Parallel GPUs
|
|
0
|
36
|
November 13, 2024
|
What is the correct way to parse data for DPO? Do you seperate out prompt or not?
|
|
0
|
202
|
May 19, 2024
|
I need a hint on how to start developing a new `.ipynb` project for Jupyter Notebook on Time Series with a specific demands
|
|
0
|
202
|
September 3, 2023
|
Custom loss: does this word exist
|
|
0
|
200
|
July 20, 2023
|
Remove PE/Encoder on BartModel
|
|
0
|
198
|
December 18, 2023
|
What's a **fair** way to compute similarities for Contrastive Learning?
|
|
0
|
196
|
February 18, 2024
|
Error when loading weights
|
|
0
|
196
|
September 21, 2023
|
CUDA issue after a few hours
|
|
0
|
195
|
October 5, 2023
|
Timm & HuggingFace
|
|
0
|
194
|
May 16, 2023
|
GradioUI + Smolagents + MCP "Event loop is closed"
|
|
1
|
79
|
May 16, 2025
|
New Model Architecture
|
|
0
|
193
|
November 16, 2023
|
Uploading a heavy dataset to Jean-Zay
|
|
3
|
54
|
February 17, 2025
|
Question regarding multiple prompt-tuning
|
|
0
|
192
|
March 19, 2024
|
How do you know whether the model is merged and uploaded?
|
|
0
|
34
|
December 20, 2024
|
Should i use LLama-2 for text summerization?
|
|
0
|
191
|
April 18, 2024
|
No accuracy of model in autotrain
|
|
0
|
190
|
December 12, 2023
|
In your experience, which Linux distro version generally has the fewest compatibility issues?
|
|
0
|
189
|
November 4, 2023
|
How to Deploy an Vision Language model in azure?
|
|
1
|
76
|
April 3, 2025
|