ValueError: Need either a dataset name or a training/validation file

I am fine tuning gpt-2 following the example script:

export TRAIN_FILE=/path/to/dataset/wiki.train.raw
export TEST_FILE=/path/to/dataset/wiki.test.raw

python run_lm_finetuning.py
–output_dir=output
–model_type=gpt2
–model_name_or_path=gpt2
–do_train
–train_data_file=$TRAIN_FILE
–do_eval
–eval_data_file=$TEST_FILE

I am using run_clm.py in this link:

I got:
Traceback (most recent call last):
File “/Users/wenzhao/Downloads/Keras/run_clm.py”, line 627, in
main()
File “/Users/wenzhao/Downloads/Keras/run_clm.py”, line 222, in main
model_args, data_args, training_args = parser.parse_args_into_dataclasses()
File “/Users/wenzhao/anaconda3/lib/python3.10/site-packages/transformers/hf_argparser.py”, line 346, in parse_args_into_dataclasses
obj = dtype(**inputs)
File “”, line 15, in init
File “/Users/wenzhao/Downloads/Keras/run_clm.py”, line 200, in post_init
raise ValueError(“Need either a dataset name or a training/validation file.”)
ValueError: Need either a dataset name or a training/validation file.

Hi,

The flag is called --train_file, not –train_data_file