Due to proxies and various other restrictions and policies, I cannot download the data using the APIs like:
from datasets import load_dataset
raw_datasets = load_dataset("glue", "mrpc")
I had the same problem when downloading pretrain models, but there is an alternative, to download the model files and load the model locally, for example:
git lfs install
git clone https://huggingface.co/bert-base-uncased
Then i can use
model = AutoModelForSequenceClassification.from_pretrained("path/to/locally/downloaded/model/files")
Can I download the dataset files in a similar fashion directly and for example use? if yes how?
raw_datasets = load_dataset("path/to/locally/downloaded/dataset/files")