How to get a model on patent data for question answering

seunghon · October 15, 2021, 2:25am

Dear list,

I want to have a question answering model for US patent text. For example, I want to ask it to read a patent’s text and ask questions such as ‘what is the specific problem to solve in this text?’. I tried with some general question answering models such as ‘distilbert-base-cased-distilled-squad’ but the answers were not satisfactory.

Now I am considering if I can get a better model through fine-tuning the model with patent data. So, I wonder if this is the right approach and if it is, then how can I fine-tune a model with patent data so that I can get more satisfactory answers?

Thanks in advance.

rosenjcb · October 15, 2021, 6:21am

You’re going to have to finetune as you said, luckily you can finetune Squad pretty easily. See here:

In a Python Notebook, import your data into a Pandas dataframe and export the table so that it matches the schema of the Squad dataset (see https://huggingface.co/datasets/viewer/?dataset=squad ). In this case, you need 5 fields in your exported file: id, title, context, question and answers.

Once you’ve formatted your data to the schema and exported the JSON/CSV locally, run the run_qa.py file and pass the train and test/validation files like so;

python run_qa.py \
  --model_name_or_path bert-base-uncased \
  --train_file=train-v1.1.json \
  --validation_file=dev-v1.1.json

And of course pass any other (hyper)parameters that you have for your finetuning task.

Topic		Replies	Views
Evaluate question answering with squad dataset Beginners	2	1308	October 10, 2021
Fine tuning llm model Models	2	4412	May 16, 2024
How to identity a QA model and fine tune it with custom data? Beginners	0	98	April 12, 2024
How to prepare dataset using patent pdf? 🤗Datasets	0	11	January 29, 2025
How to get answer with RobertaForQuestionAnswering Models	1	1073	October 26, 2021

How to get a model on patent data for question answering

Related topics