Teaming Up for Kaggle NLP Competitions

sadhaklal · April 20, 2022, 11:19am

Hi All,

I’m planning to attack a few Kaggle NLP competitions soon. Is anyone interested in attacking them together?

Prerequisites: 1. Decent grasp of raw PyTorch, 2. Familiarity with HF Transformers (e.g., the material covered in the official HF Course). 3. Minimum time commitment of 10 hours a week. If you’re busy / don’t have time, then this won’t work out.

If you meet the above prerequisites, then I propose a “cooperate to dominate” strategy for each new NLP competition:

We’ll start off with a Zoom call to discuss the problem statement & do some basic EDA.
Jointly discuss a watertight CV strategy, architecture options, training procedure ideas, etc.
Divide the work of reviewing the best Kaggle notebooks each week, extract the best ideas from each, and combine the best strategies to create the best possible model.
Divide up the experimentation - for example, one fold / one seed by each person.
Divide the responsibility of reading & summarising relevant research papers & notebooks from past competitions.
Create a central knowledge repository for each live competition (of what worked / what didn’t etc).
Use an experiment tracking tool (e.g., Weights & Biases / Neptune.ai) to track all the experiments by all the team members.
Ensemble the best models to get the best possible rank.

Look forward to your responses!

P.S. The Kaggle team member limit is 5. If more than 5 people are interested, then we can create multiple teams.

chatuur · April 26, 2022, 6:33am

I’d be up for it.

StoneZhang · April 26, 2022, 10:26am

I would like to do it

sadhaklal · April 26, 2022, 2:26pm

Excellent! Getting back to you with details soon!

sadhaklal · April 26, 2022, 2:27pm

Superb! Getting back to you with details shortly!

mab · April 27, 2022, 6:34pm

Hey sounds interesting!! I would be up to it!!

sadhaklal · April 28, 2022, 5:14pm

Awesome! Will get back to you shortly with details…

mab · May 9, 2022, 11:22am

Hey! Any updates?

Topic		Replies	Views
[Open-to-the-community] One week team-effort to reach v2.0 of HF datasets library 🤗Datasets	292	13867	October 30, 2022
[Call for participation] Interactive Grounded Language Understanding in a Collaborative Environment (IGLU) Competition@NeurIPS2021 Research	0	727	September 9, 2021
Thai NLP - Introductions Languages at Hugging Face	3	1639	October 10, 2022
🚧 ReTool: PyTorch Implementation of Strategic Tool Use in LLMs (Seeking Collaborators) Research	0	30	June 1, 2025
EMNLP Picks from the Hugging Face Science Team Research	1	4067	December 2, 2020

Teaming Up for Kaggle NLP Competitions

Related topics