Bounty: replicate a plattform like heygen

Looking for someone to help building a simular plattfrom like heygen…
translating videos into other languages with lipsync video.
our research suggested to :
transscript with whisper →
translate with deepl →
modify audio with bark with learned audio model →
lipsync modify mp4 with new mp3(or wav) → still missing
merge mp3 and mp4 with ffmpeg

any suggestions ? lets discuss how big the bounty should be… to be noticed :slight_smile: 1k ? 2k ? 10k … offers are welcome