Fine-Tuning Help for Personal Project

IlginH · March 28, 2025, 2:04pm

Hello there! First of all, i am a internt in a financial corp who works on a R&D project. My goal is that train an AI that can help customers and able to reasoning. It is basically classic app assistant but i need it to make it reasoning. I tried unsloth, grpo, lora and feed it with a data-set that i made with the app’s spesific button mapping. But it keep responding absurd answers. So if there are anyone that want to help, i can provide more information about process. Thank you in advance.

John6666 · March 28, 2025, 3:03pm

It seems that the know-how for Reasoning LLM training is also becoming quite well-established, and the course is about to be released.

There are many people on HF Discord who have know-how regarding LLM training, so if you have any specialized questions, it’s quicker to ask them there.

Topic		Replies	Views
How to transition from linguistic prompt engineering to NLP/ML/FT Beginners	1	586	November 1, 2024
Feeling lost when starting this course Beginners	1	44	June 18, 2025
Help this newbie Beginners	6	157	May 15, 2025
How to fine-tune a pretrained LLM on custom code libraries? Beginners	3	7429	April 26, 2025
Building a School AI: Encouraging Critical Thinking, Not Just Answers Models	2	38	April 3, 2025

Fine-Tuning Help for Personal Project

Related topics