ASR on multilingual audio data (code-switching)

Hello , i want to finetune whisper model on audio files of an arabic dialect however the audios can have foreign words in french or english (code-switching) how can i handle these words to be transcribed correctly?