I need train the model for my own domain. The data format is text and I can insert it has pdf. Is there any guideline to do that. I trained some model using internet. But they are not giving good result. Simply I need to create a bot that is specific for my own domain. This is for learning purpose. So, I need open source
1 Like
Could it be a case similar to these?
We are currently seeking assistance in fine-tuning the Mistral model using approximately 48 PDF documents. Specifically, our challenge lies in training the model using peft and preparing the documents for optimal fine-tuning. We are facing difficulties in locating suitable resources for this task, and we are also uncertain about the proper procedures for document preparation, storage, and supply.
If anyone within the community has expertise in this area or can provide guidance on the aforementi…
Total newbie here when it comes to ML etc.
I have a pdf with pages that look like this which I can export to jpegs:
[IMG_2688]
I want to train my model to be able to get the:
Question number
The question linked to the number
The number of marks linked to that question
Any diagrams linked to the question
Any answer spaces linked to the question
I’m having a go at using Label Studio to label the areas. Then train with Tensorflow? Is this correct first steps?
Once I labelled them, how do I …