Treating Punctuatio restoration as Seq2Seq task

surajp · December 11, 2020, 9:00am

Punctuation restoration has been tried to solve by treating it as a multi-class classification problem. i.e., fill in the punctuation between words {O, “,”, “!”, “:”}.

I tried seq2seq approach on for that and the trained BART model seems really good at it. To compare it to previous methods, I need to frame the results as a classification format like this https://github.com/nkrnrnk/BertPunc

What could be best way to get Precision, Recall and F1 scores for seq2seq task?

I tried to use tokenizer to separate the punctuations but it usually combines the punctuations and use another id.

Topic		Replies	Views
Hey guys please could someone help me figure out the bert-restore-punctuation model Beginners	0	336	November 18, 2021
XLSR-Wav2Vec2 with punctuation Research	1	1388	October 12, 2022
How to train a translation model from scratch Beginners	9	12555	March 1, 2022
Convert Bart to seq to seq form 🤗Transformers	0	308	July 5, 2022
BART - Input two sentences? Beginners	2	728	June 13, 2022

Treating Punctuatio restoration as Seq2Seq task

Related topics