A common method in the literature for detecting whether a text has been generated by AI involves using the output probabilities of a Language Model (LLM) and checking how well they match the given text. Naturally, this is an LLM-dependent feature, as each LLM assigns different token probabilities in various contexts. Given the universal applicability of this approach and its significance, I was curious if the transformers
library offers a similar feature. I looked in the pipeline section, as it seemed the most appropriate place, but I didn’t find anything relevant.
I would like to gather your insights to understand:
- If this feature is of interest to other members of the community?
- If it has already been implemented or if there are any ongoing efforts in this direction?
Thank you for your feedback!