Hi! Hope you are well!
I would like to know for the following pre-trained models:
- if they are trained on the same dataset
- which is better in terms of metrics
- are there any specific differences, e.g. difference in input max size or trained to generate different output lengths.
sshleifer/distilbart-cnn-12-6
sshleifer/distill-pegasus-cnn-16-4
Thanks!