Teaming Up for Kaggle NLP Competitions

Hi All,

I’m planning to attack a few Kaggle NLP competitions soon. Is anyone interested in attacking them together?

Prerequisites: 1. Decent grasp of raw PyTorch, 2. Familiarity with HF Transformers (e.g., the material covered in the official HF Course). 3. Minimum time commitment of 10 hours a week. If you’re busy / don’t have time, then this won’t work out.

If you meet the above prerequisites, then I propose a “cooperate to dominate” strategy for each new NLP competition:

  1. We’ll start off with a Zoom call to discuss the problem statement & do some basic EDA.
  2. Jointly discuss a watertight CV strategy, architecture options, training procedure ideas, etc.
  3. Divide the work of reviewing the best Kaggle notebooks each week, extract the best ideas from each, and combine the best strategies to create the best possible model.
  4. Divide up the experimentation - for example, one fold / one seed by each person.
  5. Divide the responsibility of reading & summarising relevant research papers & notebooks from past competitions.
  6. Create a central knowledge repository for each live competition (of what worked / what didn’t etc).
  7. Use an experiment tracking tool (e.g., Weights & Biases / Neptune.ai) to track all the experiments by all the team members.
  8. Ensemble the best models to get the best possible rank.

Look forward to your responses!

P.S. The Kaggle team member limit is 5. If more than 5 people are interested, then we can create multiple teams.

1 Like

I’d be up for it.

1 Like

I would like to do it

1 Like

Excellent! Getting back to you with details soon!

Superb! Getting back to you with details shortly!

Hey sounds interesting!! I would be up to it!!

1 Like

Awesome! Will get back to you shortly with details…

Hey! Any updates?