Uploading and Download Model Errors

Leon68 · July 14, 2023, 12:32am

I first fine tune a model using qlora, similar to this notebook here. Google Colab

I then save it using trainer.push_to_hub()

Next, I open a new notebook with this code here:

model_name = “Leon68/falcon-7b-openassistant”
#“tiiuae/falcon-7b-instruct”

model = AutoModelForCausalLM.from_pretrained(model_name,device_map=‘auto’,trust_remote_code=True)

model_name = “tiiuae/falcon-7b-instruct”
tokenizer = AutoTokenizer.from_pretrained(model_name, trust_remote_code=True)
tokenizer.pad_token = tokenizer.eos_token

input_text = “teach me how to fly”
input_ids = tokenizer(input_text, return_tensors=“pt”).input_ids.to(“cuda”)

next_input = input_ids
max_length = 80 # Change this to your desired output length. Too long could cause an OOM Out of Memory error!
current_length = input_ids.shape[1]

while True:
if current_length >= max_length: # Check if we’ve reached the length limit
break

output = model(next_input)
next_token_logits = output.logits[:, -1, :]
next_token_id = torch.argmax(next_token_logits, dim=-1).unsqueeze(0)
print(tokenizer.decode(next_token_id[0].cpu().tolist(), skip_special_tokens=True), end='', flush=True)

next_input = torch.cat([next_input, next_token_id.to("cuda")], dim=-1)

current_length += 1

if next_token_id[0].item() == tokenizer.eos_token_id:
    break

And when I run inference, it spits out nonsense:

Vel Educational这样"… Fond visitors bangs ClassesINSenciasoney Bills analyzed ll quere Fond表 Fond QB lips Sociology asegur betray Killer birthplace geb"…表 Fond表

Any idea what is going on?

Topic		Replies	Views
OOM issues with exported vs. model card models Models	1	300	March 9, 2021
Is this CUDA memory error on Inference API coming from HuggingFace or Google Collab? Beginners	0	613	July 20, 2021
OOM issues with save_pretrained models 🤗Transformers	0	1063	March 9, 2021
PushToHubCallback not uploading the model on huggingface automatically 🤗Transformers	10	1430	May 12, 2022
CUDA out of memory for Longformer Beginners	6	1281	October 22, 2021

Uploading and Download Model Errors

Related topics