Rephrasing hatespeech comment into neutral using Generative AI

I fine-tuned twitter roberta model with hatespeech dataset from kaggle to classify whether the comment I pass is neutral hate speech or offensive. classification is working fine but I am not able to properly use Generative AI models to transform hate speech comment into neutral comment. I tried multiple models, fine tuning with the hatespeech and neutral comment dataset, using prompt but nothing is working. If anyone can help or give some advice? Thankyou