Batch size limit 32

kakagi · April 18, 2024, 5:47pm

Hello, I’m currently utilizing the mixedbread-ai/mxbai-embed-large-v1 model via a dedicated endpoint, the configuration details of which I’ve provided in the attached screenshot. My goal is to use this embedding model through the endpoint to convert a PDF file into vectors and store them in a vector database using Langchain. However, I’m encountering an issue where I receive an error stating “maximum allowed batch size 32” when I run my code. I would greatly appreciate any assistance in resolving this error.

from langchain_community.document_loaders import PyPDFLoader
from langchain.text_splitter import RecursiveCharacterTextSplitter
from langchain_community.vectorstores import Chroma
from langchain_community.embeddings import HuggingFaceHubEmbeddings
embeddings = HuggingFaceHubEmbeddings(model="https://sze1pr91t48e3kuu.us-east-1.aws.endpoints.huggingface.cloud",
                                      huggingfacehub_api_token="hf_xxxxx")



loader=PyPDFLoader('aws.pdf')
docs=loader.load()
MARKDOWN_SEPARATORS = [
    "\n#{1,6} ",
    "```\n",
    "\n\\*\\*\\*+\n",
    "\n---+\n",
    "\n___+\n",
    "\n\n",
    "\n",
    " ",
    "",
]


text_splitter = RecursiveCharacterTextSplitter(
    chunk_size=5000,  # the maximum number of characters in a chunk: we selected this value arbitrarily
    chunk_overlap=500,  # the number of characters to overlap between chunks
    add_start_index=True,  # If `True`, includes chunk's start index in metadata
    strip_whitespace=True,  # If `True`, strips whitespace from the start and end of every document
    separators=MARKDOWN_SEPARATORS,
)

docs_processed = []
for doc in docs:
    docs_processed += text_splitter.split_documents([doc])
db = Chroma.from_documents(docs_processed,embeddings,persist_directory="./chroma_db")

Abhishek9998 · May 8, 2024, 6:46pm

Did you get resolution!?

9998Abhishek · May 8, 2024, 7:20pm

Already Raised PR few months ago: community: Batching added in embed_documents of HuggingFaceInferenceAPIEmbeddings by abhishek9998 · Pull Request #16457 · langchain-ai/langchain · GitHub

Topic		Replies	Views
Using alpaca with local embedding Intermediate	1	1354	July 19, 2023
Use alpaca with local embedding Beginners	0	634	May 11, 2023
Embedding Vectors taking up large amounts of memory Models	0	662	October 26, 2022
RuntimeError: stack expects each tensor to be equal size, but got [197] at entry 0 and [194] at entry 11 when trying to produce embeddings with FlauBert model Beginners	0	1814	June 30, 2021
Distilbert-base-nli-stsb-mean-tokens OOM encoding sentences of 100K docs Beginners	4	691	February 9, 2021

Batch size limit 32

Related topics