Hi everyone,
I’m Darshan Hiranandani, looking for ways to extract text from PDF files and turn it into a well-structured question-and-answer dataset. Has anyone successfully done this, or does anyone have experience creating datasets from the text within PDF files?
Any advice, tools, or methods you’ve used for this process would be greatly appreciated!
Regards
Darshan Hiranandani
Thanks in advance!