LLM fine-tune with domain specific pdf documents

br4tp1t · March 15, 2024, 7:20am

what i did to train my LLM on our documents, ive used GPT-4 API and wrote python code, to send text from document, and asked GPT to give me 20 questions to each document and the resposne was in json format with INPUT (question + doc text) and OUTPUT as answer. than ive finetuned my model with this data, and its working preety impressive, ive also added vector database where i store new documents and even on documents that are not in LLM is working very well.

Topic		Replies	Views
Fine tune LLMs on PDF Documents Models	34	35993	October 14, 2025
Fine tuning llm model Models	2	4517	May 16, 2024
Creating Own model for custom data Beginners	1	317	November 5, 2024
Generate dataset for fine tuning on PDF(s) 🤗Transformers	7	4056	August 3, 2025
Need Suggestion Research	2	247	April 19, 2024

LLM fine-tune with domain specific pdf documents

Related topics