Model crashing with a 1.6 MB txt file?

Aurelie123 · December 14, 2024, 1:48pm

Hi,

when I tried to run the model below with a very small dataset 600kb txt file , it works.

but when I load the dataset that contains the 1.6 MB txt file ( “Aurelie123/chatbotdatxt” ), the model crashes on google colab and I get an error of RAM issue but the file is only 1.6 MB !

Has anyone come across this before ?

Thanks

from datasets import load_dataset

from haystack import Document

from haystack.components.readers import ExtractiveReader

#trying on a 4 pages .txt dataset as the full file crashes due to lack of RAM

dataset = load_dataset(“Aurelie123/data2”)

# Convert the dataset into a list of Documents, each with a string content

docs = [Document(content=example[“text”]) for example in dataset[“train”]]

reader = ExtractiveReader(model=“deepset/roberta-base-squad2”)

reader.warm_up()

question = “Can I get more information about computer vision?”

result = reader.run(query=question, documents=docs)

print(result)

Topic		Replies	Views
Colab session crashing after using all available RAM Beginners	0	2431	January 16, 2021
How to train a language model from scratch when my dataset is bigger than RAM? Beginners	19	9752	September 18, 2020
I had collected data for a language text for translation How can I add it up into datsets 🤗Datasets	7	1585	August 23, 2021
Colab RAM crash error - Fine-tuning RoBERTa in Colab Beginners	3	6532	December 15, 2020
Training RoBERTa on a large corpus 🤗Transformers	5	3352	August 25, 2020

Model crashing with a 1.6 MB txt file?

Related topics