I wonder if an untrained Electra model loaded from config with same number of classes(Tokens) as bert tokenizer, will it be able to use bert tokenizer and do the training part?
1 Like
I wonder if an untrained Electra model loaded from config with same number of classes(Tokens) as bert tokenizer, will it be able to use bert tokenizer and do the training part?