Train a GPT2 model for contextual common sense reasoning using the COSMOS QA dataset

Rohan · June 27, 2021, 3:18pm

Idea:-A unique task wherein we train a GPT-2 model for contextual common sense reasoning using the COSMOS QA dataset.The goal is the test the GPT-2 for the common sense reasoning case.

Model:-Need to add support for ‘FlaxGPT2ForMultipleChoice’ model which can be easily done using the existing ‘FlaxGPT2Model’.

Dataset:-cosmos_qa · Datasets at Hugging Face available at huggingface.

Available training scripts:-Need to prepare a script for multiple choice training for our Flax Model.

We need some guidance for the project,we would love to hear feedback(@patrickvonplaten and @valhalla please do have a look) and we are very excited to contribute this task to huggingface.

We are currently a team of two:-1.Rohan.V.Kashyap(@Rohan)
2.Vivek.V.Kashyap(@Vivek)

Vivek · June 27, 2021, 3:21pm

I’m in.Really excited for this upcoming project @Rohan

patrickvonplaten · June 29, 2021, 2:34pm

Awesome, let’s finalize it - 2 should be enough, but let’s hope more people will join

Vivek · June 30, 2021, 10:28am

Sure,Let’s make it final and see if more people wants to join and contribute to this project

Rohan · June 30, 2021, 1:15pm

Thanks @patrickvonplaten,we are very much looking forward for this.We are confident of getting this done and hope more people can join our project and contribute for it.

Rohan · June 30, 2021, 3:01pm

@patrickvonplaten it would if you could can our project to the google sheets,we would like to make this a part of huggingface,we are be absolutely excited to get started with this.

patrickvonplaten · July 1, 2021, 10:19am

Great adding it!

Digialert · July 1, 2021, 10:38am

I would love to be part of this project! I am a student with slim experience in Machine Learning, but I would love to help in whatever way I can!

as-stevens · July 1, 2021, 12:45pm

I would love to be a part of this project, I have previously contributed to HF transformer. Currently, I am currently working as an ML engineer.

patrickvonplaten · July 1, 2021, 11:57pm

added you guys

patrickvonplaten · July 5, 2021, 9:02am

Just checking here: Are you guys active?

Vivek · July 7, 2021, 5:18am

Hi @patrickvonplaten ,we are working on it.We have finished writing our training script for Bert multiple choice using flax and also our cosmosq&a dataset.it’s working perfectly fine.we just need access for the tpu and accomodate training the same with the gpt-2 multiple choice model.That should get completed by tomorrow.We have created the discord channel also.

Topic		Replies	Views
Pretrained GPT2 for Tamil Flax/JAX Projects	13	1099	July 12, 2021
Finetune GPT2on ATOMIC2020 Flax/JAX Projects	0	324	June 23, 2021
PreTrain GPT2 from scratch in Punjabi Flax/JAX Projects	2	426	June 29, 2021
GPT2 for German poetry generation Flax/JAX Projects	0	869	June 22, 2021
Train GPT2/3 on social media posts and comments (reddit/Facebook etc) Flax/JAX Projects	4	453	June 29, 2021

Train a GPT2 model for contextual common sense reasoning using the COSMOS QA dataset

Related topics