How to determine if a sentence is correct?

zokica · September 15, 2021, 8:23pm

Is there any way to calculate if a sentence is correct. I have tried to calculate sentence perplexity using gtp2 as here - GPT-2 Perplexity Score Normalized on Sentence Lenght?.

So there i get quite close results, considering it is obvious that the other sentence is wrong by all means.
I am a man. 50.63967
I is an man. 230.10565

Is there any other way to calculate if i sentence is correct. Because this is a quite close result.

Maybe finetune T5 on examples, if there is a training set?

I have made some huge 3 gram and 4 gram models and they seem to be useless, even I used around 800 GB of text and i cant tell if a sentence is good or not.

ehalit · September 16, 2021, 5:04am

Although I cannot vouch for their quality, there are a number of grammar correction models in model hub: Models - Hugging Face

They seem to finetune T5 or GPT as you mentioned. However, there will never be a guarantee that the model output is 100% grammatically correct. I think a rule-based approach suits grammar the most, since it mostly follows well-defined rules.

nielsr · September 16, 2021, 7:55am

Hi,

The task you are referring to is one of the subtasks in the GLUE benchmark (which is an important benchmark in NLP): the CoLa dataset (CoLa is short for Corpus of Linguistic Acceptability). This is a simply binary classification task: given a sentence, the model needs to determine whether the sentence is grammatically correct or not.

Hence, you can use a BERT model (or one of its variants, such as RoBERTa, DistilBERT, etc.) fine-tuned on this dataset. This is already available on the hub, for example this one.

zokica · September 19, 2021, 7:04pm

Rule based grammar. Which library? But i really doubt they can determine if a sentence is good or not. I found Cola can somestimes determine it well.

zokica · September 19, 2021, 7:06pm

It is good sometimes, i mean it worked for shorter sentences. But there is not any info on how to use it, just how to load the model.

Coolcoder009 · July 9, 2024, 7:47am

Hi if you’re looking for a model that predicts whether a given sentence is correct or not. You can go with gramformer, and also it has corrector as well.

Topic		Replies	Views
Model or Dataset available for classifying a grammatical sentence? Research	1	1690	February 3, 2021
Identify grammatical correctness of text Beginners	0	535	March 9, 2023
How to get probability of a sentence using GPT-2 model? 🤗Transformers	1	3098	January 8, 2023
GPT-2 Perplexity Score Normalized on Sentence Lenght? Beginners	2	1828	October 15, 2021
How to calculate perplexity properly Beginners	2	1501	October 27, 2021

How to determine if a sentence is correct?

Related topics