Is this the same for BertForSequenceClassification model? The documentation mentions the similar method to obtain hidden layer embeddings for both the models. I just want to confirm.
Also, it should be a vector of 768 size having float values between -1 and 1 , correct?
Or, is it normalized ( between 0 and 1) . I get it between -1 and 1. is it how it should look like?