Trying to interface with Roberta (Tensorflow)

I am trying to duplicate a Kaggle competition code having to do with tweet sentiment and toxic phrase location identification. I am not completely clear on how input and output data streams and connections to the model work, There [input_ids,attention_mask,token_type_ids]
Y = [start_tokens,end_tokens]. Any links to code example will help.