Pre-training: ValueError: You should supply an encoding or a list of encodings to this method that includes input_ids, but you provided []

You are loading the data directly, You need to encode the data before sending to the model. Looks like you are directly sending the data without any encoding to the model.