How to use a LLM for specific task

Hello,
For example I want my LLM learn a pdf file.
Its good to send pdf text for it or finetunning?
if I want to my llm send response in a specific format, Its good to use system-instructions or fine tune?
Can you give me a guide or some links about it?

1 Like

If you want to treat a PDF as text, you can simply use a Python library to extract the text data, clean it up, and use it for fine-tuning.

On the other hand, if you want to treat PDFs as images that contain both text and layout, it becomes more complicated, and it is more in the realm of VLM or multimodal models than LLM. In this case, you can either convert the PDF to an image first, or use a more complicated method.

Also, if you want to have a chatbot accurately interpret PDFs, it is probably easier in the end to use a system called RAG. Find a method that seems to fit your use case. I think it’s a good idea to try out various finished products in Spaces first.

PDF (RAG / LLM / VLM, …) Spaces

PDF extraction tools

about RAG

VLM

1 Like

This topic was automatically closed 12 hours after the last reply. New replies are no longer allowed.