New to NLP / transformers - tried some examples and it is awesome. Love it! great work.
I am trying to create a Q&A system - to answer questions from a corpus of pdf documents in English.
questions where i need help to correct my understanding -
any example of fine tuning a pre-trained model on your own custom data set from PDF documents available? My understanding is i need to use a pretrained model for QA and then fine tune it with my own questions and answers from my corpus to increase the model accuracy.
need more info on pipelines - specially text2text-generation. how do i see what models and parameters does it use behind the abstraction? in python how do i access the model metadata being used when i use pipelines. I love the abstraction but would also like control on tweaking the parameters being used by the pipeline.
whats the best way to save the models in the cloud so that they can be pointed to for inference instead of getting downloaded?
again - awesome work!