Can i use Transformer-XL for text classification task?

whoisltd · May 14, 2022, 3:06am

I want to use transformer xl for text classification tasks. But I don’t know the architect model for the text classification task. I use dense layers with activation softmax for logits output from the transformer xl model, but this doesn’t seem right. when training I see the loss not reduce and accuracy is very low.
I was build, training this form scratch with imdb dataset
My training step:

whoisltd · May 14, 2022, 3:07am

My logits:

Topic		Replies	Views
CodeClassifier: Shall i use Transformers or my own Custom Architecture Beginners	0	92	April 27, 2024
How to use Transformer XL for sequence classification? 🤗Transformers	2	592	October 6, 2021
Text classification on small dataset (8K) Intermediate	1	895	July 27, 2021
XLM-Roberta for many-topic classification Beginners	1	1166	December 31, 2021
Is it possible to use Decision Transformers on text? 🤗Transformers	0	231	December 22, 2022

Can i use Transformer-XL for text classification task?

Related topics