Fix sample code to deploy and inference tasks using audio/image files

Does anyone know where to contribute that?