PreTrain Electra/T5 for Korean from scratch

JCooky · June 24, 2021, 8:35am

This project will be train the pretrained-model (Electra, T5, …) from Korean corpus called as a 모두의말뭉치.

2. Language

The model will be trained in Korean.

We can make use of example to train the model.>

Addtionaly, we will be make the train codes origin from the transformers example

A pretrained weights. After, fine-tune both text classification and text summarization

chyjis · July 1, 2021, 3:54pm

I’m interested in pretrained korean model, too! I wanna join in. ol

patrickvonplaten · July 1, 2021, 6:22pm

Awesome, finalizing you guys!

PAU · July 2, 2021, 7:45am

Good! I’d like to join this.

Topic		Replies	Views
PreTrain ELECTRA from scratch in Portuguese Flax/JAX Projects	2	1328	November 8, 2021
Pretrain GPT2 from scratch in Korean Flax/JAX Projects	3	989	July 16, 2021
Pretrain T5 for Arabic Flax/JAX Projects	17	2683	June 11, 2023
Pretrain and Fine Tune Byte-level model for multilingual extractive QA (Like ByT5) Flax/JAX Projects	13	1983	July 2, 2021
Pretrain T5 from scratch in Dutch Flax/JAX Projects	2	2089	July 7, 2021