Autotrain NER models

wineguru · December 22, 2022, 10:51am

Hi, I have a fairly large NER dataset (about 12 million rows). I am looking for an economical way to train the model. It appears that huggingface’s autotrain feature could help. But, it appears that the training set needs be uploaded in csv format rather than a dataset that has already been pre-processed (i.e, tokenized, aligned, etc…).

In general, is there a way to use huggingface’s autotrain for NER models? Can I use my already pre-proccessed dataset? If not, what is the correct format to provide using something like csv?

abhishek · December 22, 2022, 11:03am

Hi,

AutoTrain does support processed datasets if they are in hub. However, this is a fairly large dataset which might require custom deployment. I suggest mailing us at autotrain@hf.co and we can discuss the details.

Topic		Replies	Views
Huggingface Autotrain fail 🤗AutoTrain	2	59	August 8, 2024
Hugging Face Auto Train - POS vs NER 🤗AutoTrain	0	263	October 23, 2023
New Easy AutoTrain Examples? Models	0	482	December 9, 2022
AutoTrain - unable to upload the dataset 🤗AutoTrain	7	2790	August 9, 2022
Models trained with autotrain cannot be used 🤗AutoTrain	0	513	July 16, 2023

Autotrain NER models

Related topics