Concatenate Sentances

david-waterworth · March 8, 2021, 10:13am

Is there a helper class somewhere in one of the huggingface libraries which can assist in the concatenation of multiple short sentences into a single tensor - I want to tokenise each input example, shuffle and then combine until they’re ~512 tokens long. I thought that was part of the RoBERTa training process but I could be mistaken.

Topic		Replies	Views
Chapter 2 questions Course	98	9165	June 1, 2025
Question on splitting input sequence Beginners	3	5581	June 14, 2022
Multiple sentences in RoBERTa training 🤗Datasets	0	573	August 10, 2021
How to concat laserembeddings with huggingface funnel transformers simple CLS output for fine tuning on downstream NLP sequence classification data problem? 🤗Transformers	0	941	August 4, 2022
Hugdatafast: hugginface/nlp + fastai 🤗Datasets	1	1512	September 8, 2020

Concatenate Sentances

Related topics