Total newbie here when it comes to ML etc.
I have a pdf with pages that look like this which I can export to jpegs:
I want to train my model to be able to get the:
- Question number
- The question linked to the number
- The number of marks linked to that question
- Any diagrams linked to the question
- Any answer spaces linked to the question
I’m having a go at using Label Studio to label the areas. Then train with Tensorflow? Is this correct first steps?
Once I labelled them, how do I know that it also extract or taking into account the actual text or content - not just ‘how it looks like’?
Extra: Once trained, how can I integrate my model into embeddings(?) so that I can use LLM (GPT etc) to query/chat bot etc?
Would appreciate any help or how you would approach this?
Many thanks in advance.