I have collect a lot of pashto data for my research… But all the data in pdf format. And there is no OCR for pashtu language to convert those data into .txt format.
Can I load pdf dataset using hugging face datasets library?
Need a solution plz.
I have collect a lot of pashto data for my research… But all the data in pdf format. And there is no OCR for pashtu language to convert those data into .txt format.
Can I load pdf dataset using hugging face datasets library?
Need a solution plz.