Fundamental newbie questions

altafr · December 6, 2020, 7:03am

New to NLP / transformers - tried some examples and it is awesome. Love it! great work.

I am trying to create a Q&A system - to answer questions from a corpus of pdf documents in English.

questions where i need help to correct my understanding -

any example of fine tuning a pre-trained model on your own custom data set from PDF documents available? My understanding is i need to use a pretrained model for QA and then fine tune it with my own questions and answers from my corpus to increase the model accuracy.
need more info on pipelines - specially text2text-generation. how do i see what models and parameters does it use behind the abstraction? in python how do i access the model metadata being used when i use pipelines. I love the abstraction but would also like control on tweaking the parameters being used by the pipeline.
whats the best way to save the models in the cloud so that they can be pointed to for inference instead of getting downloaded?

again - awesome work!

sgugger · December 6, 2020, 2:19pm

For 1, your use case does seem a little specific, so there is no example of that exactly in our examples. Otherwise all our examples are in the examples folder of the repo and there is a tutorial on how to fine-tune your model on a custom dataset in the documentation.

For 2, if you need more control, you should directly use the tokenizer/model and not the pipeline API. The task summary tutorial shows examples of both ways on most tasks supported by the library.

For 3, you can upload models to our model hub. There is a (paying) inference API to use them directly without downloads.

Topic		Replies	Views
Issues with Finetuning QuestionAnswer model Beginners	0	363	May 27, 2021
Preparing datasets for NLP tasks 🤗Datasets	1	547	July 28, 2021
How can we customize pipeline? 🤗Transformers	5	743	January 19, 2021
Https://huggingface.co/allenai/longformer-large-4096-finetuned-triviaqa Model cards	0	1143	March 28, 2022
[RFC] Transformers Pipeline v2 🤗Transformers	4	1860	October 14, 2020

Fundamental newbie questions

Related topics