Please explain how HF TFSequenceClassifier implements variable input length

Serj · August 21, 2022, 12:29pm

Background: I’m creating Keras models that are based on HF models such as - TFDistilBertForSequenceClassification (uncased).

I’m using distilbert as a feature tokenizer, and adding a classifier on top. It means I need to define inputs, use distilbert, bind the output and create a new Keras model.

Here’s the thing - when creating a model like that, I need to define a fix input length, for instance 512. Due to that, any sentence that will use this model, needs to be padded to 512. Even during inference time, which is a bummer because we are just wasting time. Any sentence of shorter length will cause an error.

When using HF implementation, I can use which ever input length I want, up to 512.
How did you achieve this feat?

I’m going over your repository: https://github.dev/huggingface/transformers/blob/main/src/transformers/models/distilbert/modeling_tf_distilbert.py

But still don’t get it. Please help

Topic		Replies	Views
Need help to give inputs to my fine tuned model Beginners	1	328	December 7, 2021
No dynamic sized input with huggingface-transformers ALBERT and TFjs Intermediate	0	1013	October 1, 2020
Question regarding TF DistilBert For Sequence Classification Beginners	1	270	December 16, 2021
DistilBertModel to sequence classification 🤗Transformers	0	234	January 23, 2023
How can we test Transformer Models after converting it to TFLite format Beginners	9	3283	March 26, 2024

Please explain how HF TFSequenceClassifier implements variable input length

Related topics