Probability of a word within a given context / Reasonability of a sequence of words

Toom · August 2, 2022, 3:38pm

I am looking for a NLP model that tells me how probable/reasonable a given word is within the context of some other words or word sequences.

For example:
Consider the sentence “I build a house.” which is a reasonable sentence.

Now, what I want to know is P(“build” | “I”, “a house”) or to put it differently model.score(“I build a house”).

This means, I want to know how reasonable this sentence is and I would expect a score or probability that is significantly different from zero.

Please note: I do not want to predict a word but rather I want to know if an already existing sequence of words is reasonable ie. makes sense.

A negative example for a sentence/word sequence that does not make any sense would be the sentence “I build a soup”.

In this case model.score(“I build a soup”) or P(“build” | “I”, “a soup”) should be close to zero or a least extremely low.

Do you know of any model that can accomplish this task?

srhm · August 9, 2022, 6:08pm

Because you want to provide probabilities for words in the middle of a string, a bidirectional encoder model would do you best. I recommend bert-base-uncased. One of the self-supervised tasks it was trained with is token masking whereby it attempts to predict missing tokens in a given sentence. Details are available in its paper.

A thorough how-to is here but the steps you’d want to take are:

Load the model with BertTokenizer, BertForMaskedLM, &c.
Take the sentence for which you want P(x|S) for S = {X_1, …, X_n} and mask x with “[MASK]”
e.g., “I build a house” → “I [MASK] a house”
Encode the new string and pass it to the model, collect predicted IDs
Decode them and determine the loss for the X you had in mind

Hope that helps.

Topic		Replies	Views
Inference from a fine-tuned model -- help with interpretation of results Beginners	3	369	January 26, 2024
Out of context word Models	0	348	August 15, 2022
Sentence Prediction Beginners	3	1069	March 3, 2022
Restricting BERT scores; Methods to counter high confidence in classification of short non-word-like-phrases to labels Beginners	0	467	May 27, 2021
Any BERT model recommendation needed for getting feature of structured sentences Beginners	0	397	June 8, 2022

Probability of a word within a given context / Reasonability of a sequence of words

Related topics