What is the purpose of training the model in https://huggingface.co/blog/how-to-train

hkbluesky · May 15, 2023, 2:26am

Hi,

I tried to learn transformer by following this article: How to train a new language model from scratch using Transformers and Tokenizers

But why do we need to train the esperberto model? What problem does this article try to solve? Does it try to solve a classification problem?
I don’t understand how it verifies the model actually works.
Could someone help?

Thanks,

Tom

Guldeniz · May 26, 2023, 11:37am

Hello Tom

BERT like models might take times to train, so I think for this blog they wanted to use an easy-to-learn dataset. As you can read in the post:

Esperanto is a constructed language with a goal of being easy to learn. We pick it for this demo for several reasons:

it is a relatively low-resource language (even though it’s spoken by ~2 million people) so this demo is less boring than training one more English model

its grammar is highly regular (e.g. all common nouns end in -o, all adjectives in -a) so we should get interesting linguistic results even on a small dataset.

finally, the overarching goal at the foundation of the language is to bring people closer (fostering world peace and international understanding) which one could argue is aligned with the goal of the NLP community

You can find different dataset at Huggingface.

I hope I understood your question right

hkbluesky · June 19, 2023, 2:33pm

Got you. Thanks

Topic		Replies	Views
Doing classification 100% from scratch? 🤗Transformers	4	1717	September 17, 2021
Tutorial on Pretraining BERT Beginners	1	538	December 15, 2020
Help with Training a Custom Model using Hugging Face Transformers Beginners	0	30	October 11, 2024
Further pre-train language model in transformers like BERT Models	3	1108	March 27, 2022
Tuto on how to train a translation from scratch in a pythonic way? Beginners	2	619	October 23, 2023

What is the purpose of training the model in https://huggingface.co/blog/how-to-train

Related topics