ModuleNotFoundError: No module named 'huggingface_hub.inference._types'

HoangCuongNguyen · December 21, 2024, 4:36am

I am running a RAG pipeline, with LlamaIndex and quantized LLama3-8B-Instruct. I just installed these libraries:

!pip install --upgrade huggingface_hub 
!pip install --upgrade peft
!pip install llama-index bitsandbytes accelerate llama-index-llms-huggingface llama-index-embeddings-huggingface
!pip install --upgrade transformers
!pip install --upgrade sentence-transformers

Then I was looking to run the quantization pipeline like this:

import torch
from llama_index.llms.huggingface import HuggingFaceLLM
from transformers import BitsAndBytesConfig

quantization_config = BitsAndBytesConfig(
    load_in_4bit=True,
    bnb_4bit_compute_dtype=torch.float16,
    bnb_4bit_quant_type="nf4",
    bnb_4bit_use_double_quant=True,
)

However, I got this error returned to me: ModuleNotFoundError: No module named 'huggingface_hub.inference._types'. Last time I worked with this pipeline two months ago, the code worked, so I think LlamaIndex has changed something; especially since when I clicked on the error, it referenced to: from huggingface_hub.inference._types import ConversationalOutput, but ConversationalOutput module doesn’t exist in HuggingFace docs.

So, what should I do to fix this error and be able to run this RAG pipeline?

Topic		Replies	Views
Need help with error: OSX ModuleNotFoundError: No module named 'huggingface_hub' Beginners	1	1028	June 2, 2024
Unable to load model from HUB Beginners	0	374	June 5, 2024
Langchain-huggingface Beginners	7	8173	September 6, 2024
Using a paid inference end point to query llamaindex knowledge graph gives worse results than the free inference api Beginners	2	721	March 8, 2024
Getting issue on query_engine for llm_completion_callback Beginners	1	827	April 26, 2024

ModuleNotFoundError: No module named 'huggingface_hub.inference._types'

Related topics