Training for langgraph agent

chgrdj · July 11, 2024, 2:54pm

Hello everyone,

I wanna use a LLM to create an agent in langgraph with this kind of architecture:

The idea is similar to a React agent where the model has to provide a prompt for my terminal tool and observe if it achieve a given objective.
I have a question : how should i train such model?
Could i use DPO/ORPO procedure to align my model with multi-step context ?
Or is there a smarter way to do that?
Thanks

Topic		Replies	Views
Create your LLM model Beginners	1	1924	December 9, 2024
Best LLM to pretrain? 🤗Transformers	0	834	February 29, 2024
Create my LLM model Beginners	1	1589	April 1, 2024
Using LLM for Data Analytics Beginners	1	1297	June 7, 2025
Training LLM using local application Community Calls	2	1113	May 1, 2024

Training for langgraph agent

Related topics