Tiny Tapas origin

sandmaker · August 5, 2022, 1:52am

Hi is there any info available about what model comperssion method was used for creating tiny-tapas (pruning, distallation etc). There is no info on the model card and I have been unsuccesful in finding some online.

nielsr · August 5, 2022, 2:06pm

Hi,

There’s no distillation happening there. It’s just the tiniest architecture of all TAPAS variants the authors released (which means, only 2 hidden layers as can be seen here, 2 attention heads compared to the bigger base model which uses 12 hidden layers and 12 attention heads).

Topic		Replies	Views
TinyReformer/TinyLongformer details Models	3	432	November 6, 2020
TAPAS fine-tuning 🤗Transformers	0	303	July 26, 2023
TAPAS question answering for missing informations? Models	1	637	November 8, 2021
Smaller RoBERTa model Beginners	1	820	July 10, 2020
Convert TAPAS tf checkpoint to PyTorch 🤗Transformers	0	597	July 17, 2020

Tiny Tapas origin

Related topics