@sgugger Progress Update Aug 4 -> Aug 19

sgugger · August 19, 2020, 5:19pm

Following @sshleifer example, here is what I worked on the past few weeks and plan to work in the near future.
).

Model outputs:

Finished cleaning up all model outputs and made sure PyTorch and TensorFlow have the same API.

Documentation:

Work on cleaning up and updating docstrings and documentation of the main classes: config, tokenizer, models and pipelines.
Automatic conversion of the tutorials into notebooks and added the “open in colab” button.

Trainer:

A bit of clean-up to make sure Trainer and TFTrainer have the same API.
Exposed the customization points when the user wants to subclass and override.
Initial work to add hyperparameter search (see #6576).
Initial work to have an easy bridge between nlp and Trainer (see #6449).

Repository consistency:

People love the fact each model file is self-contained and the code is not refactored since they can then quickly experiment, but it can be hard to maintain! Added a script that checks all models are tested (by the common tests) and documented.

Funnel Transformer:

Paper - Initial work to understand the implementation and port it to Transformers. PyTorch version is almost done.

Plans:

Continue the work on Trainer with hyperparameter search and nlp interface.
Work on refactoring all examples to use Trainer and nlp
Finish porting Funnel Transformer.

valhalla · August 19, 2020, 5:34pm

Very excited about Optuna integration and Funnel Transformer! Also absolutely love your code reviews and suggestions !

sshleifer · August 19, 2020, 6:01pm

+1 to that. Your code reviews have been super helpful and appreciated.

BramVanroy · August 19, 2020, 8:10pm

Optuna integration would be ground-breaking, at least for my use case. Now I used to write my own config class that is an iterator over all possible hyperparameter combinations and just loops over them. Tedious to maintain. Being able to rely on a well tested, well supported package would be a load of my mind!

prajjwal1 · August 20, 2020, 2:08am

Looking forward to Optuna integration.

stefan-it · August 20, 2020, 1:57pm

Great to see that you’re working on the Funnel Transformer implementation

I’ve already trained a model from scratch and can’t wait to properly evaluate it with Transformers

Topic		Replies	Views
Using hyperparameter-search in Trainer 🤗Transformers	101	38115	July 2, 2024
Open Source survey results [Jan 2022] Community Calls	1	2256	March 10, 2022
Tutorial notebooks 🤗Transformers	9	1611	February 14, 2022
Boilerplate for Trainer using torch.distributed Beginners	4	2038	January 11, 2022
How to transition from linguistic prompt engineering to NLP/ML/FT Beginners	1	582	November 1, 2024

@sgugger Progress Update Aug 4 -> Aug 19

Related topics