Need Suggestion

khurramnaseem · April 13, 2024, 3:06pm

Hi all,

I’m new to this LLM’s world and I need suggestions on the following idea. I want to fine-tune a LLM model based on exam past papers, which are currently available on PDF format.

Objective: The model understand and explain past answers and able to generate new questions and answers.

Which LLM model I should select for this purpose? Keep in mind I want to start from very basic as I mentioned earlier I’m very novice.
The past papers are in PDF format with pictures as well, Do I need to convert them on some specific format like JSON?

PervaizKhan · April 18, 2024, 10:19pm

Hi,

You may look at huggingface llms leaderboard (search on google for exact link), and try to pick the latest models for your tasks.
For pdf, I am not sure, but I guess you may convert them to text format some , maybe in csv format or json etc.
For finetuning llm, please look at the topics such parameter efficient finetunig (PEFT).

I am nit exactly sure but you may also look at “vision llm” that works on images and text.
I hope this helps.
Good luck.

khurramnaseem · April 19, 2024, 11:45am

Thank you @PervaizKhan for your reply.

Topic		Replies	Views
Fine tune LLMs on PDF Documents Models	29	32242	March 3, 2025
Fine tuning llm model Models	2	4416	May 16, 2024
LLM fine-tune with domain specific pdf documents Models	20	25040	November 5, 2024
Generate dataset for fine tuning on PDF(s) 🤗Transformers	6	3386	September 3, 2024
How to use a LLM for specific task Beginners	2	93	March 14, 2025

Need Suggestion

Related topics