Cannot reproduce the BAAI/bge-reranker-large re-ranker model results

suraj-gade · November 9, 2023, 1:16pm

Hi

I am using “BAAI/bge-reranker-large” model using AutoModelForSequenceClassification class to rerank the relevant documents to the a question in my RAG setup.

Here is the sample code that I am using.

import torch
from transformers import AutoModelForSequenceClassification, AutoTokenizer

tokenizer = AutoTokenizer.from_pretrained('BAAI/bge-reranker-large')
model = AutoModelForSequenceClassification.from_pretrained('BAAI/bge-reranker-large')
model.eval()

pairs = [[user_input, doc] for doc in documents]
with torch.no_grad():
    inputs = tokenizer(pairs, padding=True, truncation=True, return_tensors='pt', max_length=512)
    scores = model(**inputs, return_dict=True).logits.view(-1, ).float()
    print(scores)

documents is a list of relevant documents to the user_input which is a user question.

Most of the time I am getting expected results, i.e. most relevant document to the user question is ranked at top with highest score
But sometimes I am getting different irrelevant document ranked at top for the same question where I was getting correct results earlier.

How can I reproduce the same results each time.
Is there any parameter that we can use (like seed) to reproduce the results?

Topic		Replies	Views
Inconsistency in Model Output [ Token Classification] 🤗Transformers	0	333	April 12, 2023
PyTorch models predictions varies with the same data input Models	0	508	October 27, 2022
Model results differ after creating pipeline with same model Beginners	2	862	September 30, 2020
RAG isnt working as expected Beginners	3	227	May 2, 2024
Reproducing bert2bert_cnn_dm result Models	0	234	June 30, 2021

Cannot reproduce the BAAI/bge-reranker-large re-ranker model results

Related topics