OSError: Error no file named pytorch_model.bin, tf_model.h5, model.ckpt.index or flax_model.msgpack found in directory gpt2-finetuned-science-20240111T135646Z-001

pranavsid · January 11, 2024, 4:59pm

Hello folks, I am facing this issue and I don’t know how to deal with it. I finetuned gpt2 model on google colab and then installed pretrained model files locally but now when I am trying to use it locally it keeps giving me this error. The path provided is absolutely correct. Here are files in my directory.

Here is how code looks like:(I also tried using absolute path still no difference)

from transformers import GPT2LMHeadModel, GPT2Tokenizer
import requests
from googlesearch import search
# Load the fine-tuned model
fine_tuned_model = GPT2LMHeadModel.from_pretrained('gpt2-finetuned-science-20240111T135646Z-001/gpt2-finetuned-science/')
# Initialize GPT-2 tokenizer and model

tokenizer = GPT2Tokenizer.from_pretrained('gpt2-finetuned-science-20240111T135646Z-001/gpt2-finetuned-science/', pad_token='<pad>')

# Google Custom Search API configuration
API_KEY = 'AIzaSyCXjFWc4SsycKTTQHELH20_2fjsaB8n6UE'
CX = '922c9da5c6a21421d'

import spacy
from transformers import GPT2LMHeadModel, GPT2Tokenizer

def google_search(query, api_key, cx):
    base_url = "https://www.googleapis.com/customsearch/v1"
    params = {
        'key': api_key,
        'cx': cx,
        'q': query,
    }

    response = requests.get(base_url, params=params)
    results = response.json()

    return results
# Load the spaCy English NLP model
nlp = spacy.load("en_core_web_sm")

# Function to extract keywords from a question
def extract_keywords(question):
    doc = nlp(question)
    named_entities = [ent.text for ent in doc.ents]
    nouns = [token.text for token in doc if token.pos_ == "NOUN"]
    keywords = named_entities + nouns
    return keywords

# Function to generate response
def generate_response(prompt, max_length=50):
    # Extract keywords from the prompt
    keywords = extract_keywords(prompt)
    # print(keywords)
    input_ids = tokenizer.encode(prompt, return_tensors='pt', max_length=max_length, truncation=True)
    attention_mask = input_ids.ne(tokenizer.pad_token_id).long()

    output = fine_tuned_model.generate(input_ids, attention_mask=attention_mask, max_length=max_length, num_return_sequences=1)
    generated_text = tokenizer.decode(output[0], skip_special_tokens=True)

    # Check if the main word is not present in the generated response
    # main_word = prompt.split()[0].lower()
    for keyword in keywords:
      if keyword.lower() not in generated_text.lower().split(" "):

        #Perform Google search and use the top result
        results = google_search(prompt,'AIzaSyCXjFWc4SsycKTTQHELH20_2fjsaB8n6UE','922c9da5c6a21421d')
        if 'items' in results and len(results['items']) > 0:
          top_result = results['items'][0]['snippet']
          return f"Sorry, I don't have information on that. Here's what I found on the web: {top_result}"
    return generated_text

# Example usage
user_query = "What is banach space?"
bot_response = generate_response(user_query)
print(bot_response)

17bj5 · May 2, 2024, 9:42pm

Im also facing the same problem please let me know if you have found any solution!

sdelargy · September 19, 2024, 7:42pm

Was there ever a solution to this? I seem to have the same issue from the UI.

Topic		Replies	Views
OSError: Model name 'gpt2' was not found in tokenizers model name list (gpt2,...) 🤗Tokenizers	8	7374	August 10, 2023
OSError: dggokul21/Testcase_Generator does not appear to have a file named pytorch_model.bin, tf_model.h5, model.ckpt or flax_model.msgpack Intermediate	0	208	February 21, 2024
OSError When Trying to Load Model from Local Disk (Offline) Models	2	3395	August 28, 2024
Can't load weights for gpt2 error Beginners	0	1601	July 13, 2020
Loading pre-trained BERT model error - Error no file named ['pytorch_model.bin', 'tf_model.h5'] found Beginners	0	4080	December 1, 2020

OSError: Error no file named pytorch_model.bin, tf_model.h5, model.ckpt.index or flax_model.msgpack found in directory gpt2-finetuned-science-20240111T135646Z-001

Related topics