Belgpt is NSFW in inference API

cedpsam · February 7, 2021, 4:28pm

belgpt with default prompt generate a sightly NSFW sentence

Mon nom est Julien et j’aime beaucoup lecher les sexes . Le premier est un peu

with the prompt “Mon nom est Julien et j’aime”

for information it deepl translation is just

My name is Julien and I love to lick the sexes. The first one is a little

certainly a good prompt for erotic fanfiction…

BramVanroy · February 7, 2021, 4:47pm

Who chose this example… Almost feels like it was intended, in poor taste, but might be unfortunate coincidence as well.

Not sure if the examples on the model card should be checked for proper language. Might be a lot of work to do manually for all new models.

FL33TW00D · February 7, 2021, 5:47pm

I would be curious to see how long a well programmed string matching algorithm could test for unscrupulous language against a list such as:

Could perhaps avoid a PR mishap for HuggingFace.

cedpsam · February 7, 2021, 7:45pm

hugginface hosted api repetabily generate this on the site, I was just testing the dialog and it generated the start of an erotic fiction apparently. Not that it disturbs me but it could be a problem from some uses

cedpsam · February 7, 2021, 7:51pm

Maybe a certain type of text classification should be better than a simple list of words for this purpose. I wonder if there are datasets for this, with sexual, offensive or hateful tags

sgugger · February 8, 2021, 3:22am

cc @julien-c

BramVanroy · February 8, 2021, 8:52am

Still, it is a lot of overhead for every submitted (and edited) model card to run the model and test for obscenity. It would be the better approach but maybe not worth while (this has to be multilingual, too, and reliably test on the correct languages).

julien-c · February 8, 2021, 11:12am

@cedpsam Thanks for reporting. I’m assuming you’re referring to this model: antoiloui/belgpt2 · Hugging Face

This model doesn’t override the default input samples in its model card so the model page is using the defaults defined for French in widgets-server/DefaultWidget.ts at 50744183ba833548630b5240c8d4ac3ab57fb8b4 · huggingface/widgets-server · GitHub

Perhaps we can change the default inputs to be less susceptible of triggering this kind of language?

cedpsam · February 8, 2021, 2:09pm

i confirm that it’s this one, I just tried the default sample

Topic		Replies	Views
Guidance on getting started with fine tuned uncensored model Beginners	2	1017	March 8, 2025
Seeking uncensored Chatgpt for Creative Writing Models	1	3722	September 26, 2024
How to train a Model for Erotic Story Writing with Explicit Details? Beginners	5	2639	June 19, 2025
French NLP - Introduction 🇫🇷 Languages at Hugging Face	4	1221	January 18, 2024
My generate reponse is wrong Beginners	0	232	February 9, 2024

Belgpt is NSFW in inference API

Related topics