How to fine tune LUKE for NER?

KerenzaDoxolodeo · May 11, 2021, 12:50pm

Hello. I am wondering if I can fine-tune LUKE with my own dataset of NER. I am aware that LUKE has a unique model so the code in the example notebook is off the table. I am aware that Studio Ousia has fine-tuning code with GitHub - studio-ousia/luke: LUKE -- Language Understanding with Knowledge-based Embeddings, but if I do that route, I am wondering if I can convert the new fine-tuned model to be transformers-compatible.

vaibhav-01 · August 20, 2021, 11:12am

Hi Kerenza

Were you able to make any progress on fine-tuning LukeForEntitySpanClassification for custom labels? Actually, I am also looking to fine-tune Luke for a NER task with multi-token entities. Any help is much appreciated.

Thanks

ahmadaii · February 18, 2022, 8:08am

Hi Read through Readme GitHub - studio-ousia/luke: LUKE -- Language Understanding with Knowledge-based Embeddings

nielsr · March 17, 2022, 4:00pm

Hi,

We now have an example script that illustrates how to fine-tune LUKE for NER (and other token classification tasks): transformers/examples/research_projects/luke at master · huggingface/transformers · GitHub

TalNeemanHalpern · April 24, 2022, 9:59am

Hi @nielsr, I have been trying to run the script but it fails on conll2003 dataset during tokenization. Am i doing somthing wrong? Any specific versions I should use?

initesh · September 7, 2022, 5:38pm

Hi @nielsr, Thanks for sharing the script. I was able to run the training on conll2003 dataset with small modifications. However, the performance is extremely low: {‘precision’: 0.8983516483516484, ‘recall’: 0.40541976620616366, ‘f1’: 0.5587014888943129, ‘accuracy’: 0.8970917690009956}

Dataset should not be an issue as I am using standard conll2033 dataset. I also tried exact hyper-parameters mentioned in the paper but with no luck. Here is the exact command I ran:

python run_luke_ner_no_trainer.py
–model_name_or_path studio-ousia/luke-base
–dataset_name conll2003
–task_name $TASK_NAME
–max_length 128
–per_device_train_batch_size 8
–learning_rate 1e-5
–num_train_epochs 5
–output_dir ./$TASK_NAME/

Any suggestions on how to match the performance reported in the paper would be highly appreciated.

Thanks in advance!

Topic		Replies	Views
Fine tuning Ner with autotrain Beginners	0	249	December 20, 2023
Tutorial: Fine-tuning with custom datasets – sentiment, NER, and question answering 🤗Transformers	19	12831	February 12, 2024
NER fine-tuning Beginners	1	4660	December 20, 2021
Any Model for NER on French Models	7	1001	September 18, 2020
Fine-tuned transformers model generats nonsensical results Beginners	0	216	July 10, 2024

How to fine tune LUKE for NER?

Related topics