Load the model fails for sentence-transformers/sentence-t5-xl

Sujyothi · March 29, 2025, 9:16pm

Trying to load the model ‘sentence-transformers/sentence-t5-xl’

model = SentenceTransformer(‘sentence-transformers/sentence-t5-xl’)
tmp_dir = “sentence-t5-xl”
model.save(tmp_dir)
import boto3
s3 = boto3.client(‘s3’)
bucket = “bucket_name”
s3_prefix = “models/sentence-t5-xl”
import os
for root, _, files in os.walk(tmp_dir):
for file in files:
local_path = os.path.join(root, file)
s3_path = os.path.join(s3_prefix, os.path.relpath(local_path, tmp_dir))
s3.upload_file(local_path, bucket, s3_path)

print(f"Model uploaded to s3://{bucket}/{s3_prefix}")

model saved but failed to load

def download_s3_directory(bucket_name, s3_prefix, local_dir):
# Create an S3 client
s3_client = boto3.client(‘s3’)

# List objects in the S3 directory (prefix)
response = s3_client.list_objects_v2(Bucket=bucket_name, Prefix=s3_prefix)

import os
# Ensure the local directory exists
if not os.path.exists(local_dir):
    os.makedirs(local_dir)

# Iterate over the files and download them
for obj in response.get('Contents', []):
    s3_key = obj['Key']
    file_name = os.path.basename(s3_key)  # Get the file name from the S3 key
    local_file_path = os.path.join(local_dir, file_name)

    if not file_name:  # Skip directories in S3
        continue

    print(f"Downloading {s3_key} -> {local_file_path}...")
    s3_client.download_file(bucket_name, s3_key, local_file_path)

def load_model_from_s3(bucket, model_path, local_dir=“/tmp/sentence-t5-xl”):
import os
if not os.path.exists(local_dir):
download_s3_directory(bucket, model_path, local_dir)
os.environ[“TRANSFORMERS_OFFLINE”] = “1”
os.environ[“HF_DATASETS_OFFLINE”] = “1”
os.environ[“HF_HUB_OFFLINE”] = “1”
import torch
torch.set_grad_enabled(False)
return SentenceTransformer(local_dir, local_files_only=True)

model = load_model_from_s3(bucket, sbert_model_path)

shows the error
modules, self.module_kwargs = self._load_sbert_model(
File “/home/hadoop/environment/lib64/python3.9/site-packages/sentence_transformers/SentenceTransformer.py”, line 1736, in _load_sbert_model
module_path = load_dir_path(
File “/home/hadoop/environment/lib64/python3.9/site-packages/sentence_transformers/util.py”, line 1399, in load_dir_path
repo_path = snapshot_download(**download_kwargs)
File “/home/hadoop/environment/lib64/python3.9/site-packages/huggingface_hub/utils/_validators.py”, line 114, in _inner_fn
return fn(*args, **kwargs)
File “/home/hadoop/environment/lib64/python3.9/site-packages/huggingface_hub/_snapshot_download.py”, line 219, in snapshot_download
raise LocalEntryNotFoundError(
huggingface_hub.errors.LocalEntryNotFoundError: Cannot find an appropriate cached snapshot folder for the specified revision on the local disk and outgoing traffic has been disabled. To enable repo look-ups and downloads online, pass ‘local_files_only=False’ as input.

Not sure what is the problem
sentence-transformers==3.3.1

John6666 · March 30, 2025, 5:15am

I can’t reproduce the bug… Is it a problem specific to S3, or is it a problem with dependencies…?

import os
from sentence_transformers import SentenceTransformer
model = SentenceTransformer("sentence-transformers/sentence-t5-xl")
tmp_dir = "sentence-t5-xl"
tmp_dir_dummy = "sentence-t5-xl-dummy"
model.save(tmp_dir)
print(model)
# SentenceTransformer(
#  (0): Transformer({'max_seq_length': 256, 'do_lower_case': False}) with Transformer model: T5EncoderModel ...

model2 = SentenceTransformer(tmp_dir, local_files_only=True)
print(model2)
# SentenceTransformer(
#  (0): Transformer({'max_seq_length': 256, 'do_lower_case': False}) with Transformer model: T5EncoderModel ...

model3 = SentenceTransformer(tmp_dir_dummy, local_files_only=True)
# OSError: We couldn't connect to 'https://huggingface.co' to load this file, couldn't find it in the cached files and it looks like sentence-transformers/sentence-t5-xl-dummy is not the path to a directory containing a file named config.json.

os.makedirs(tmp_dir_dummy, exist_ok=True)
model4 = SentenceTransformer(tmp_dir_dummy, local_files_only=True)
# OSError: Error no file named pytorch_model.bin, model.safetensors, tf_model.h5, model.ckpt.index or flax_model.msgpack found in directory sentence-t5-xl-dummy.

It seems that in some cases, the problem can be fixed by updating the huggingface_hub library.

github.com/huggingface/huggingface_hub

Issue with SentenceTransformer Model Validation for Local Paths

opened 09:43AM - 30 Jan 24 UTC

AndreaArmx

bug

### Describe the bug Currently, there is an issue in the **validate_hf_hub_ar…gs** function within the **utils._validators.py** file that prevents the proper loading of models from local paths. The problem lies in the following check: ``` if arg_name in ["repo_id", "from_id", "to_id"]: validate_repo_id(arg_value) ``` The problem is that this condition is checked without considering the value of the **local_files_only** field. This creates a conflict, as validation erroneously fails for local paths, which can contain multiple / characters. It is recommended to update the above condition to also consider the value of **local_files_only**. The condition should be modified similar to the following: ``` if not kwargs["local_files_only"] and arg_name in ["repo_id", "from_id", "to_id"]: validate_repo_id(arg_value) ``` otherwise the **validate_repo_id** function will be called and perform the following check which leads to the failure: ``` if repo_id.count("/") > 1: raise HFValidationError( "Repo id must be in the form 'repo_name' or 'namespace/repo_name':" f" '{repo_id}'. Use `repo_type` argument if needed." ) ``` ### Reproduction ``` from sentence_transformers import SentenceTransformer sentence_transformer_model = SentenceTransformer( model_name_or_path="/Users/john/PycharmProjects/LLM/llm-embedding/models/all-MiniLM-L6-v2", device="cpu", ) ``` ### Logs _No response_ ### System info ```shell - huggingface_hub version: 0.20.3 - Platform: macOS-12.3-arm64-arm-64bit - Python version: 3.10.5 - Running in iPython ?: No - Running in notebook ?: No - Running in Google Colab ?: No - Token path ?: /Users/john/.cache/huggingface/token - Has saved token ?: False - Configured git credential helpers: osxkeychain - FastAI: N/A - Tensorflow: N/A - Torch: 2.1.2 - Jinja2: 3.1.3 - Graphviz: N/A - Pydot: N/A - Pillow: 10.2.0 - hf_transfer: N/A - gradio: N/A - tensorboard: N/A - numpy: 1.26.3 - pydantic: 1.10.14 - aiohttp: N/A - ENDPOINT: https://huggingface.co - HF_HUB_CACHE: /Users/john/.cache/huggingface/hub - HF_ASSETS_CACHE: /Users/john/.cache/huggingface/assets - HF_TOKEN_PATH: /Users/john/.cache/huggingface/token - HF_HUB_OFFLINE: False - HF_HUB_DISABLE_TELEMETRY: False - HF_HUB_DISABLE_PROGRESS_BARS: None - HF_HUB_DISABLE_SYMLINKS_WARNING: False - HF_HUB_DISABLE_EXPERIMENTAL_WARNING: False - HF_HUB_DISABLE_IMPLICIT_TOKEN: False - HF_HUB_ENABLE_HF_TRANSFER: False - HF_HUB_ETAG_TIMEOUT: 10 - HF_HUB_DOWNLOAD_TIMEOUT: 10 ```

pip install -U huggingface_hub

Sujyothi · March 30, 2025, 12:18pm

it was working for few days and now started failing. Can you show me a way to load the model from S3 instead of SentenceTransformer(“sentence-transformers/sentence-t5-xl”)
If I save it to s3 and load it from S3 I am getting lot of error and not able to load the load the model properly.
If I use SentenceTransformer(“sentence-transformers/sentence-t5-xl”) for inference then requests.exceptions.ReadTimeout: (ReadTimeoutError(“HTTPSConnectionPool(host=‘cdn-lfs.hf.co’, port=443): Read timed out. (read timeout=10)”), ‘(Request ID: 0af29d71-521b-4a4e-bbfc-eed6c171e671)’)

John6666 · March 30, 2025, 12:47pm

Although we don’t know which is the real cause, the actual process of downloading from the Hugging Face Hub is usually handled by the huggingface_hub library, so if you can solve or avoid the error in that part, you can often get around it.

Well, there are also individual library cache issues from time to time…

github.com/huggingface/huggingface_hub

requests.exceptions.ConnectionError: HTTPSConnectionPool(host='cdn-lfs.huggingface.co', port=443): Read timed out.

opened 06:51PM - 08 Oct 23 UTC

closed 02:27PM - 10 Oct 23 UTC

gauraviiita

Hi there I am running a command as given below, but each time, I get an error a…s maintained below. Please help to deal with this error. Thanking you The command is as follows: ```py python conceptgraph/scripts/generate_gsa_results.py \ --dataset_root /home/gaurav/Desktop/Robotics/concept-graphs/Datasets/Replica \ --dataset_config /home/gaurav/Desktop/Robotics/concept-graphs/conceptgraph/dataset/dataconfigs/replica/replica.yaml \ --scene_id room0 \ --class_set none \ --stride 5 ``` The error is as follows: ```sh (conceptgraph) gaurav@gaurav-GF65-Thin-10UE:~/Desktop/Robotics/concept-graphs$ python conceptgraph/scripts/generate_gsa_results.py \ --dataset_root /home/gaurav/Desktop/Robotics/concept-graphs/Datasets/Replica \ --dataset_config /home/gaurav/Desktop/Robotics/concept-graphs/conceptgraph/dataset/dataconfigs/replica/replica.yaml \ --scene_id room0 \ --class_set ram \ --box_threshold 0.2 \ --text_threshold 0.2 \ --stride 5 \ --add_bg_classes \ --accumu_classes \ --exp_suffix withbg_allclasses /home/gaurav/conda/envs/conceptgraph/lib/python3.10/site-packages/torchvision/io/image.py:13: UserWarning: Failed to load image Python extension: /home/gaurav/conda/envs/conceptgraph/lib/python3.10/site-packages/torchvision/image.so: undefined symbol: _ZN3c106detail19maybe_wrap_dim_slowEllb warn(f"Failed to load image Python extension: {e}") [2023-10-08 23:43:44,596] [INFO] [real_accelerator.py:110:get_accelerator] Setting ds_accelerator to cuda (auto detect) /home/gaurav/Desktop/Robotics/concept-graphs/conceptgraph/groundingdino/models/GroundingDINO/ms_deform_attn.py:31: UserWarning: Failed to load custom C++ ops. Running on CPU mode Only! warnings.warn("Failed to load custom C++ ops. Running on CPU mode Only!") final text_encoder_type: bert-base-uncased Downloading (…)ip_pytorch_model.bin: 87%|██████████████████████████████████████████████████████████████████████████████████████████▋ | 3.44G/3.94G [09:54<01:24, 5.98MB/s]Traceback (most recent call last): File "/home/gaurav/conda/envs/conceptgraph/lib/python3.10/site-packages/urllib3/response.py", line 444, in _error_catcher yield File "/home/gaurav/conda/envs/conceptgraph/lib/python3.10/site-packages/urllib3/response.py", line 567, in read data = self._fp_read(amt) if not fp_closed else b"" File "/home/gaurav/conda/envs/conceptgraph/lib/python3.10/site-packages/urllib3/response.py", line 533, in _fp_read return self._fp.read(amt) if amt is not None else self._fp.read() File "/home/gaurav/conda/envs/conceptgraph/lib/python3.10/http/client.py", line 466, in read s = self.fp.read(amt) File "/home/gaurav/conda/envs/conceptgraph/lib/python3.10/socket.py", line 705, in readinto return self._sock.recv_into(b) File "/home/gaurav/conda/envs/conceptgraph/lib/python3.10/ssl.py", line 1307, in recv_into return self.read(nbytes, buffer) File "/home/gaurav/conda/envs/conceptgraph/lib/python3.10/ssl.py", line 1163, in read return self._sslobj.read(len, buffer) TimeoutError: The read operation timed out During handling of the above exception, another exception occurred: Traceback (most recent call last): File "/home/gaurav/conda/envs/conceptgraph/lib/python3.10/site-packages/requests/models.py", line 816, in generate yield from self.raw.stream(chunk_size, decode_content=True) File "/home/gaurav/conda/envs/conceptgraph/lib/python3.10/site-packages/urllib3/response.py", line 628, in stream data = self.read(amt=amt, decode_content=decode_content) File "/home/gaurav/conda/envs/conceptgraph/lib/python3.10/site-packages/urllib3/response.py", line 566, in read with self._error_catcher(): File "/home/gaurav/conda/envs/conceptgraph/lib/python3.10/contextlib.py", line 153, in __exit__ self.gen.throw(typ, value, traceback) File "/home/gaurav/conda/envs/conceptgraph/lib/python3.10/site-packages/urllib3/response.py", line 449, in _error_catcher raise ReadTimeoutError(self._pool, None, "Read timed out.") urllib3.exceptions.ReadTimeoutError: HTTPSConnectionPool(host='cdn-lfs.huggingface.co', port=443): Read timed out. During handling of the above exception, another exception occurred: Traceback (most recent call last): File "/home/gaurav/Desktop/Robotics/concept-graphs/conceptgraph/scripts/generate_gsa_results.py", line 652, in <module> main(args) File "/home/gaurav/Desktop/Robotics/concept-graphs/conceptgraph/scripts/generate_gsa_results.py", line 382, in main clip_model, _, clip_preprocess = open_clip.create_model_and_transforms( File "/home/gaurav/conda/envs/conceptgraph/lib/python3.10/site-packages/open_clip/factory.py", line 308, in create_model_and_transforms model = create_model( File "/home/gaurav/conda/envs/conceptgraph/lib/python3.10/site-packages/open_clip/factory.py", line 222, in create_model checkpoint_path = download_pretrained(pretrained_cfg, cache_dir=cache_dir) File "/home/gaurav/conda/envs/conceptgraph/lib/python3.10/site-packages/open_clip/pretrained.py", line 425, in download_pretrained target = download_pretrained_from_hf(model_id, cache_dir=cache_dir) File "/home/gaurav/conda/envs/conceptgraph/lib/python3.10/site-packages/open_clip/pretrained.py", line 395, in download_pretrained_from_hf cached_file = hf_hub_download(model_id, filename, revision=revision, cache_dir=cache_dir) File "/home/gaurav/conda/envs/conceptgraph/lib/python3.10/site-packages/huggingface_hub/utils/_validators.py", line 118, in _inner_fn return fn(*args, **kwargs) File "/home/gaurav/conda/envs/conceptgraph/lib/python3.10/site-packages/huggingface_hub/file_download.py", line 1364, in hf_hub_download http_get( File "/home/gaurav/conda/envs/conceptgraph/lib/python3.10/site-packages/huggingface_hub/file_download.py", line 541, in http_get for chunk in r.iter_content(chunk_size=10 * 1024 * 1024): File "/home/gaurav/conda/envs/conceptgraph/lib/python3.10/site-packages/requests/models.py", line 822, in generate raise ConnectionError(e) requests.exceptions.ConnectionError: HTTPSConnectionPool(host='cdn-lfs.huggingface.co', port=443): Read timed out. Downloading (…)ip_pytorch_model.bin: 87%|██████████████████████████████████████████████████████████████████████████████████████████▋ | 3.44G/3.94G [10:06<01:29, 5.67MB/s] ```

github.com/huggingface/transformers

SSLError: HTTPSConnectionPool(host='huggingface.co', port=443)

opened 03:46PM - 08 Jun 22 UTC

closed 03:02PM - 15 Aug 22 UTC

alexsomoza

I'm trying in python: from sentence_transformers import SentenceTransformer …sbert_model = SentenceTransformer('all-MiniLM-L6-v2') and I get this error: SSLError: HTTPSConnectionPool(host='huggingface.co', port=443): Max retries exceeded with url: /api/models/sentence-transformers/all-MiniLM-L6-v2 (Caused by SSLError(SSLCertVerificationError(1, '[SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed: self signed certificate in certificate chain (_ssl.c:1091)'))) I have no proxy, just getting direct to internet !!!

github.com/huggingface/huggingface_hub

Unable to use HF cache on a readonly filesystem

opened 03:04AM - 29 Jan 25 UTC

stodoran

bug

### Describe the bug Even if all relevant files are downloaded to the folder sp…ecified by `HF_HUB_CACHE`, the `hf_hub_download` function will always fail due to [these lines](https://github.com/huggingface/huggingface_hub/blob/main/src/huggingface_hub/file_download.py#L977-L978). ### Reproduction * Create directory `/hf_cache` * Download some asset to said directory, e.g. `huggingface-cli download Qwen/Qwen2.5-32B-Instruct` * Change directory `/hf_cache` to readonly mode * Try to load the asset, e.g. `AutoModel.from_pretrained("Qwen/Qwen2.5-32B-Instruct")` ### Logs ```shell [rank5]: Traceback (most recent call last): ... [rank5]: config_new = AutoConfig.from_pretrained(model_name_or_path, token=access_token) [rank5]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank5]: File ".venv/lib/python3.12/site-packages/transformers/models/auto/configuration_auto.py", line 1006, in from_pretrained [rank5]: config_dict, unused_kwargs = PretrainedConfig.get_config_dict(pretrained_model_name_or_path, **kwargs) [rank5]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank5]: File ".venv/lib/python3.12/site-packages/transformers/configuration_utils.py", line 570, in get_config_dict [rank5]: config_dict, kwargs = cls._get_config_dict(pretrained_model_name_or_path, **kwargs) [rank5]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank5]: File ".venv/lib/python3.12/site-packages/transformers/configuration_utils.py", line 629, in _get_config_dict [rank5]: resolved_config_file = cached_file( [rank5]: ^^^^^^^^^^^^ [rank5]: File ".venv/lib/python3.12/site-packages/transformers/utils/hub.py", line 403, in cached_file [rank5]: resolved_file = hf_hub_download( [rank5]: ^^^^^^^^^^^^^^^^ [rank5]: File ".venv/lib/python3.12/site-packages/huggingface_hub/utils/_validators.py", line 114, in _inner_fn [rank5]: return fn(*args, **kwargs) [rank5]: ^^^^^^^^^^^^^^^^^^^ [rank5]: File ".venv/lib/python3.12/site-packages/huggingface_hub/file_download.py", line 860, in hf_hub_download [rank5]: return _hf_hub_download_to_cache_dir( [rank5]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank5]: File ".venv/lib/python3.12/site-packages/huggingface_hub/file_download.py", line 977, in _hf_hub_download_to_cache_dir [rank5]: os.makedirs(os.path.dirname(blob_path), exist_ok=True) [rank5]: File "<frozen os>", line 215, in makedirs [rank5]: File "<frozen os>", line 215, in makedirs [rank5]: File "<frozen os>", line 225, in makedirs [rank5]: OSError: [Errno 30] Read-only file system: '/app/hub_cache' ``` ### System info ```shell - huggingface_hub version: 0.28.0 - Platform: Linux-6.8.0-1020-gcp-x86_64-with-glibc2.35 - Python version: 3.12.8 - Running in iPython ?: No - Running in notebook ?: No - Running in Google Colab ?: No - Running in Google Colab Enterprise ?: No - Has saved token ?: True - Configured git credential helpers: - FastAI: N/A - Tensorflow: N/A - Torch: 2.5.1 - Jinja2: 3.1.5 - Graphviz: N/A - keras: N/A - Pydot: 3.0.3 - Pillow: 10.4.0 - hf_transfer: N/A - gradio: N/A - tensorboard: N/A - numpy: 1.26.4 - pydantic: 2.10.4 - aiohttp: 3.11.11 - ENDPOINT: https://huggingface.co - HF_HUB_CACHE: /app/hf_cache - HF_HUB_OFFLINE: False - HF_HUB_DISABLE_TELEMETRY: False - HF_HUB_DISABLE_PROGRESS_BARS: None - HF_HUB_DISABLE_SYMLINKS_WARNING: False - HF_HUB_DISABLE_EXPERIMENTAL_WARNING: False - HF_HUB_DISABLE_IMPLICIT_TOKEN: False - HF_HUB_ENABLE_HF_TRANSFER: False - HF_HUB_ETAG_TIMEOUT: 10 - HF_HUB_DOWNLOAD_TIMEOUT: 10 ```

github.com/huggingface/huggingface_hub

requests.exceptions.ConnectionError: HTTPSConnectionPool(host='cdn-lfs.huggingface.co', port=443): Read timed out.

opened 06:51PM - 08 Oct 23 UTC

closed 02:27PM - 10 Oct 23 UTC

gauraviiita

Hi there I am running a command as given below, but each time, I get an error a…s maintained below. Please help to deal with this error. Thanking you The command is as follows: ```py python conceptgraph/scripts/generate_gsa_results.py \ --dataset_root /home/gaurav/Desktop/Robotics/concept-graphs/Datasets/Replica \ --dataset_config /home/gaurav/Desktop/Robotics/concept-graphs/conceptgraph/dataset/dataconfigs/replica/replica.yaml \ --scene_id room0 \ --class_set none \ --stride 5 ``` The error is as follows: ```sh (conceptgraph) gaurav@gaurav-GF65-Thin-10UE:~/Desktop/Robotics/concept-graphs$ python conceptgraph/scripts/generate_gsa_results.py \ --dataset_root /home/gaurav/Desktop/Robotics/concept-graphs/Datasets/Replica \ --dataset_config /home/gaurav/Desktop/Robotics/concept-graphs/conceptgraph/dataset/dataconfigs/replica/replica.yaml \ --scene_id room0 \ --class_set ram \ --box_threshold 0.2 \ --text_threshold 0.2 \ --stride 5 \ --add_bg_classes \ --accumu_classes \ --exp_suffix withbg_allclasses /home/gaurav/conda/envs/conceptgraph/lib/python3.10/site-packages/torchvision/io/image.py:13: UserWarning: Failed to load image Python extension: /home/gaurav/conda/envs/conceptgraph/lib/python3.10/site-packages/torchvision/image.so: undefined symbol: _ZN3c106detail19maybe_wrap_dim_slowEllb warn(f"Failed to load image Python extension: {e}") [2023-10-08 23:43:44,596] [INFO] [real_accelerator.py:110:get_accelerator] Setting ds_accelerator to cuda (auto detect) /home/gaurav/Desktop/Robotics/concept-graphs/conceptgraph/groundingdino/models/GroundingDINO/ms_deform_attn.py:31: UserWarning: Failed to load custom C++ ops. Running on CPU mode Only! warnings.warn("Failed to load custom C++ ops. Running on CPU mode Only!") final text_encoder_type: bert-base-uncased Downloading (…)ip_pytorch_model.bin: 87%|██████████████████████████████████████████████████████████████████████████████████████████▋ | 3.44G/3.94G [09:54<01:24, 5.98MB/s]Traceback (most recent call last): File "/home/gaurav/conda/envs/conceptgraph/lib/python3.10/site-packages/urllib3/response.py", line 444, in _error_catcher yield File "/home/gaurav/conda/envs/conceptgraph/lib/python3.10/site-packages/urllib3/response.py", line 567, in read data = self._fp_read(amt) if not fp_closed else b"" File "/home/gaurav/conda/envs/conceptgraph/lib/python3.10/site-packages/urllib3/response.py", line 533, in _fp_read return self._fp.read(amt) if amt is not None else self._fp.read() File "/home/gaurav/conda/envs/conceptgraph/lib/python3.10/http/client.py", line 466, in read s = self.fp.read(amt) File "/home/gaurav/conda/envs/conceptgraph/lib/python3.10/socket.py", line 705, in readinto return self._sock.recv_into(b) File "/home/gaurav/conda/envs/conceptgraph/lib/python3.10/ssl.py", line 1307, in recv_into return self.read(nbytes, buffer) File "/home/gaurav/conda/envs/conceptgraph/lib/python3.10/ssl.py", line 1163, in read return self._sslobj.read(len, buffer) TimeoutError: The read operation timed out During handling of the above exception, another exception occurred: Traceback (most recent call last): File "/home/gaurav/conda/envs/conceptgraph/lib/python3.10/site-packages/requests/models.py", line 816, in generate yield from self.raw.stream(chunk_size, decode_content=True) File "/home/gaurav/conda/envs/conceptgraph/lib/python3.10/site-packages/urllib3/response.py", line 628, in stream data = self.read(amt=amt, decode_content=decode_content) File "/home/gaurav/conda/envs/conceptgraph/lib/python3.10/site-packages/urllib3/response.py", line 566, in read with self._error_catcher(): File "/home/gaurav/conda/envs/conceptgraph/lib/python3.10/contextlib.py", line 153, in __exit__ self.gen.throw(typ, value, traceback) File "/home/gaurav/conda/envs/conceptgraph/lib/python3.10/site-packages/urllib3/response.py", line 449, in _error_catcher raise ReadTimeoutError(self._pool, None, "Read timed out.") urllib3.exceptions.ReadTimeoutError: HTTPSConnectionPool(host='cdn-lfs.huggingface.co', port=443): Read timed out. During handling of the above exception, another exception occurred: Traceback (most recent call last): File "/home/gaurav/Desktop/Robotics/concept-graphs/conceptgraph/scripts/generate_gsa_results.py", line 652, in <module> main(args) File "/home/gaurav/Desktop/Robotics/concept-graphs/conceptgraph/scripts/generate_gsa_results.py", line 382, in main clip_model, _, clip_preprocess = open_clip.create_model_and_transforms( File "/home/gaurav/conda/envs/conceptgraph/lib/python3.10/site-packages/open_clip/factory.py", line 308, in create_model_and_transforms model = create_model( File "/home/gaurav/conda/envs/conceptgraph/lib/python3.10/site-packages/open_clip/factory.py", line 222, in create_model checkpoint_path = download_pretrained(pretrained_cfg, cache_dir=cache_dir) File "/home/gaurav/conda/envs/conceptgraph/lib/python3.10/site-packages/open_clip/pretrained.py", line 425, in download_pretrained target = download_pretrained_from_hf(model_id, cache_dir=cache_dir) File "/home/gaurav/conda/envs/conceptgraph/lib/python3.10/site-packages/open_clip/pretrained.py", line 395, in download_pretrained_from_hf cached_file = hf_hub_download(model_id, filename, revision=revision, cache_dir=cache_dir) File "/home/gaurav/conda/envs/conceptgraph/lib/python3.10/site-packages/huggingface_hub/utils/_validators.py", line 118, in _inner_fn return fn(*args, **kwargs) File "/home/gaurav/conda/envs/conceptgraph/lib/python3.10/site-packages/huggingface_hub/file_download.py", line 1364, in hf_hub_download http_get( File "/home/gaurav/conda/envs/conceptgraph/lib/python3.10/site-packages/huggingface_hub/file_download.py", line 541, in http_get for chunk in r.iter_content(chunk_size=10 * 1024 * 1024): File "/home/gaurav/conda/envs/conceptgraph/lib/python3.10/site-packages/requests/models.py", line 822, in generate raise ConnectionError(e) requests.exceptions.ConnectionError: HTTPSConnectionPool(host='cdn-lfs.huggingface.co', port=443): Read timed out. Downloading (…)ip_pytorch_model.bin: 87%|██████████████████████████████████████████████████████████████████████████████████████████▋ | 3.44G/3.94G [10:06<01:29, 5.67MB/s] ```

Sujyothi · March 30, 2025, 12:57pm

I would appreciate if you can show me the right way to load the model from S3 instead of
model = SentenceTransformer(“sentence-transformers/sentence-t5-xl”), It would be Like loading from pretrainted model.

Save the model to S3
Download the model from S3
Load it for infernce

John6666 · March 30, 2025, 1:09pm

I don’t know how S3 behaves…

There are also the following ways to download the Hugging Face model.

by Hugging Chat

To load a Sentence Transformer model from Amazon S3, you can follow these steps:

Step 1: Save the Model to a Directory

First, save your pretrained Sentence Transformer model to a local directory.

from sentence_transformers import SentenceTransformer

model = SentenceTransformer("sentence-transformers/sentence-t5-xl")
model.save("saved_model")

Step 2: Upload the Model to S3

Use boto3 to upload the saved model files to an S3 bucket.

import boto3
import os
from botocore.exceptions import NoCredentialsError

def upload_to_s3(bucket_name, local_path, s3_prefix):
    s3 = boto3.client('s3')
    for root, dirs, files in os.walk(local_path):
        for file in files:
            local_file = os.path.join(root, file)
            s3_key = os.path.join(s3_prefix, os.path.relpath(local_file, local_path))
            try:
                s3.upload_file(local_file, bucket_name, s3_key)
                print(f"Uploaded {local_file} to s3://{bucket_name}/{s3_key}")
            except NoCredentialsError:
                print("Credentials not available")
                return False
    return True

# Replace with your bucket name and S3 path
upload_to_s3("your-bucket-name", "saved_model", "path/to/model")

Step 3: Download the Model from S3

When you need to load the model, download the files from S3 into a temporary directory.

import tempfile
from contextlib import contextmanager

@contextmanager
def download_model_from_s3(bucket_name, s3_prefix, temp_dir):
    s3 = boto3.client('s3')
    try:
        # List all objects under the prefix
        paginator = s3.get_paginator('list_objects_v2')
        pages = paginator.paginate(Bucket=bucket_name, Prefix=s3_prefix)
        
        for page in pages:
            for obj in page['Contents']:
                key = obj['Key']
                local_path = os.path.join(temp_dir, os.path.relpath(key, s3_prefix))
                os.makedirs(os.path.dirname(local_path), exist_ok=True)
                s3.download_file(bucket_name, key, local_path)
                print(f"Downloaded {key} to {local_path}")
        yield temp_dir
    finally:
        # Clean up the temporary directory
        shutil.rmtree(temp_dir, ignore_errors=True)

Step 4: Load the Model for Inference

Use the temporary directory to load the model.

with tempfile.TemporaryDirectory() as temp_dir:
    with download_model_from_s3("your-bucket-name", "path/to/model", temp_dir) as model_dir:
        model = SentenceTransformer.from_pretrained(model_dir)
        # Use the model for inference
        embeddings = model.encode("Your text here")
        print(embeddings)

Summary

Save the Model: Use model.save() to save the model to a local directory.
Upload to S3: Use boto3 to upload the entire directory to your S3 bucket.
Download from S3: When needed, download the model files into a temporary directory.
Load the Model: Use the temporary directory to load the model for inference.

This method ensures that all necessary model files are correctly handled, and the temporary directory is cleaned up automatically. Make sure your AWS credentials are properly configured to allow access to the S3 bucket.

References

John6666 · March 30, 2025, 1:16pm

Also by Hugging Chat.

The error occurs because the model cannot find the necessary files in the local cache and is prohibited from downloading online. To resolve this, you need to ensure all required files are present in the correct directory and adjust how the model is loaded. Here’s how to fix it:

Inspect the Model Structure: Verify that all necessary files (e.g., config.json, model.bin, tokenization.json) are included when saving the model.
Modify Cache Directory: Set the TRANSFORMERS_CACHE environment variable to point to the local directory where the model is saved.
Adjust Loading Code: Ensure that the model loads from the correct local directory without relying on the default cache.

Here’s the fixed code:

import os
from sentence_transformers import SentenceTransformer

def load_model_from_s3(bucket, model_path, local_dir="/tmp/sentence-t5-xl"):
    # Ensure necessary imports and settings
    import boto3
    from sentence_transformers import SentenceTransformer

    # Create S3 client
    s3_client = boto3.client('s3')

    # Download the model from S3 to local_dir
    if not os.path.exists(local_dir):
        os.makedirs(local_dir)
    
    # List objects in the S3 directory
    response = s3_client.list_objects_v2(Bucket=bucket, Prefix=model_path)
    
    for obj in response.get('Contents', []):
        s3_key = obj['Key']
        file_name = os.path.basename(s3_key)
        local_file_path = os.path.join(local_dir, file_name)
        
        if not file_name:
            continue
        
        print(f"Downloading {s3_key} -> {local_file_path}...")
        s3_client.download_file(bucket, s3_key, local_file_path)
    
    # Set environment variables
    os.environ["TRANSFORMERS_OFFLINE"] = "1"
    os.environ["HF_DATASETS_OFFLINE"] = "1"
    os.environ["HF_HUB_OFFLINE"] = "1"
    os.environ["TRANSFORMERS_CACHE"] = local_dir  # Point cache to the local directory
    
    # Load the model from the local directory
    return SentenceTransformer(local_dir, local_files_only=True)

# Usage example
bucket_name = "your-bucket-name"
s3_model_path = "models/sentence-t5-xl"
model_load_dir = "/tmp/sentence-t5-xl"

model = load_model_from_s3(bucket_name, s3_model_path, model_load_dir)

Explanation:

Environment Variable Adjustment: By setting TRANSFORMERS_CACHE to point to the local directory, we ensure the model looks for files in the correct location.
Downloading All Files: The code ensures all necessary files are downloaded from S3 to the local directory.
Loading Correctly: The model is loaded with local_files_only=True to prevent online downloads and using the specified directory.

This should resolve the LocalEntryNotFoundError by ensuring the model finds all required files locally.

Sujyothi · March 30, 2025, 1:18pm

SentenceTransformer.from_pretraine - It throws error saying SentenceTransformer has no method named from_pretrained. I also used Hugging Chat, I am not successful in loading the model.

John6666 · March 30, 2025, 1:22pm

I think it’s probably a version problem with the hugingface_hub library. It’s not the version of the sentence_transformers, but that version that changes the save and load (serialization, deserialization, downloading, uploading, cache management, token management…)…

In that case, the fastest thing to do is to downgrade.

The next thing to consider is changing the default permission settings on the cloud service side. In this case, you can sometimes deal with it by changing the temporary folder by changing things like HF_HOME.

Anyway, it’s certain that something has changed between when it was working and now when it’s not working. The only thing that could have changed within the scope of what we can do easily here is the versions of the libraries.

If it’s a change on the cloud service side, it’s going to be quite difficult, but in that case, I think someone else might have noticed…

John6666 · March 30, 2025, 1:33pm

So if I read the first error message here, it’s going to download and fail…
I think it didn’t go to download if it was the same code before.

Incidentally, if you follow the source, it seems that the original download method uses hf_hub_download.

github.com/UKPLab/sentence-transformers

sentence_transformers/SentenceTransformer.py

master


      
              is_sentence_transformer_model,
              load_dir_path,
              load_file_path,
              save_to_hub_args_decorator,
              truncate_embeddings,
          )
          
          logger = logging.getLogger(__name__)
          
          
          class SentenceTransformer(nn.Sequential, FitMixin, PeftAdapterMixin):
              """
              Loads or creates a SentenceTransformer model that can be used to map sentences / text to embeddings.
          
              Args:
                  model_name_or_path (str, optional): If it is a filepath on disc, it loads the model from that path. If it is not a path,
                      it first tries to download a pre-trained SentenceTransformer model. If that fails, tries to construct a model
                      from the Hugging Face Hub with that name.
                  modules (Iterable[nn.Module], optional): A list of torch Modules that should be called sequentially, can be used to create custom
                      SentenceTransformer models from scratch.
                  device (str, optional): Device (like "cuda", "cpu", "mps", "npu") that should be used for computation. If None, checks if a GPU

github.com/UKPLab/sentence-transformers

sentence_transformers/util.py

master


      
                      model_name_or_path,
                      "modules.json",
                      token=token,
                      cache_folder=cache_folder,
                      revision=revision,
                      local_files_only=local_files_only,
                  )
              )
          
          
          def load_file_path(
              model_name_or_path: str,
              filename: str,
              token: bool | str | None = None,
              cache_folder: str | None = None,
              revision: str | None = None,
              local_files_only: bool = False,
          ) -> str | None:
              """
              Loads a file from a local or remote location.

github.com/huggingface/huggingface_hub

src/huggingface_hub/file_download.py

main


      
                              "Not enough free disk space to download the file. "
                              f"The expected file size is: {expected_size / 1e6:.2f} MB. "
                              f"The target location {target_dir} only has {target_dir_free / 1e6:.2f} MB free disk space."
                          )
                      return
                  except OSError:  # raise on anything: file does not exist or space disk cannot be checked
                      pass
          
          
          @validate_hf_hub_args
          def hf_hub_download(
              repo_id: str,
              filename: str,
              *,
              subfolder: Optional[str] = None,
              repo_type: Optional[str] = None,
              revision: Optional[str] = None,
              library_name: Optional[str] = None,
              library_version: Optional[str] = None,
              cache_dir: Union[str, Path, None] = None,
              local_dir: Union[str, Path, None] = None,

Sujyothi · March 30, 2025, 1:40pm

There are two ways to download →

model = SentenceTransformer(‘sentence-transformers/sentence-t5-xl’)
tmp_dir = “sentence-t5-xl”
model.save(tmp_dir)
huggingface-cli download sentence-transformers/sentence-t5-xl --local-dir sentence-t5-xl
I tried uploading these file to S3 and while inferencing refering that to avoid connection issue with the huggingface

Both save the model differently.
download_s3_directory(bucket, model_path, local_dir)
model = SentenceTransformer(local_dir)
shows the error for cache files, even if I specify cache to refer local dir it doesn’t work
return SentenceTransformer(local_dir, cache_folder=local_dir, local_files_only=True)

John6666 · March 30, 2025, 1:47pm

shows the error for cache files, even if I specify cache to refer local dir it doesn’t work

Actually, a similar error was reported on HF Discord yesterday, and in that case, only the Transformer model downloaded from git did not work. He had apparently worked around the problem by doing from_pretrained and save_pretrained beforehand…

Could it be that the cause of the error is the same…?

My suspicion is that it doesn’t work properly if there are any additional files, because it doesn’t happen with a repo that has a simple structure with only a single model…

Sujyothi · March 30, 2025, 1:53pm

Can you give the reference ?

Sujyothi · March 30, 2025, 1:54pm

Can you give the reference ?

John6666 · March 30, 2025, 1:55pm

Error one: deepseek-ai/DeepSeek-R1
No problem one: HuggingFaceTB/SmolLM2-135M-Instruct

John6666 · March 30, 2025, 2:06pm

Hmm… Perhaps this commit? You may be able to get around this problem for the time being by setting the cache_folder argument or the SENTENCE_TRANSFORMERS_HOME environment variable.

github.com/huggingface/huggingface_hub

src/huggingface_hub/_snapshot_download.py

main


      
              [`~utils.RevisionNotFoundError`]
                  If the revision to download from cannot be found.
              [`EnvironmentError`](https://docs.python.org/3/library/exceptions.html#EnvironmentError)
                  If `token=True` and the token cannot be found.
              [`OSError`](https://docs.python.org/3/library/exceptions.html#OSError) if
                  ETag cannot be determined.
              [`ValueError`](https://docs.python.org/3/library/exceptions.html#ValueError)
                  if some parameter value is invalid.
          """
          if cache_dir is None:
              cache_dir = constants.HF_HUB_CACHE
          if revision is None:
              revision = constants.DEFAULT_REVISION
          if isinstance(cache_dir, Path):
              cache_dir = str(cache_dir)
          
          if repo_type is None:
              repo_type = "model"
          if repo_type not in constants.REPO_TYPES:
              raise ValueError(f"Invalid repo type: {repo_type}. Accepted repo types are: {str(constants.REPO_TYPES)}")

github.com/UKPLab/sentence-transformers

sentence_transformers/util.py

master


      
          dir_path = os.path.join(model_name_or_path, directory)
          if os.path.exists(dir_path):
              return dir_path
          
          download_kwargs = {
              "repo_id": model_name_or_path,
              "revision": revision,
              "allow_patterns": f"{directory}/**" if directory not in ["", "."] else None,
              "library_name": "sentence-transformers",
              "token": token,
              "cache_dir": cache_folder,
              "local_files_only": local_files_only,
              "tqdm_class": disabled_tqdm,
          }
          # Try to download from the remote
          try:
              repo_path = snapshot_download(**download_kwargs)
          except Exception:
              # Otherwise, try local (i.e. cache) only
              download_kwargs["local_files_only"] = True
              repo_path = snapshot_download(**download_kwargs)

Sujyothi · March 31, 2025, 2:29am

Thank you John6666
os.environ[‘TRANSFORMERS_OFFLINE’] = ‘1’
os.environ[‘HF_DATASETS_OFFLINE’] = ‘1’
os.environ[‘HF_HUB_OFFLINE’] = ‘1’
helped to continue.
Instead of downloading it, I did created the zipped version of the model and uploaded to S3 and referred same using
–conf spark.archives={model_path}/sentence-t5-xl.zip#sentence-t5-xl
–conf spark.driverEnv.SENTENCE_TRANSFORMERS_HOME=./sentence-t5-xl
–conf spark.executorEnv.SENTENCE_TRANSFORMERS_HOME=./sentence-t5-xl
then
model = SentenceTransformer(model_name_or_path=‘sentence-t5-xl’, device=‘cpu’, local_files_only=True) resolved the issue.

Topic		Replies	Views
Local downloaded models uploading Beginners	1	269	November 6, 2024
On AWS EMR Serverless 'sentence-transformers/sentence-t5-xl' started failing from 2 dsys, which was working earlier Beginners	0	20	March 29, 2025
Problem in loading an old sentence classification roberta model generated using transformer version 3.0.2 with new library 🤗Transformers	0	639	September 30, 2022
I want to upload my model but I'm not sure what I'm doing wrong Models	1	584	February 25, 2024
Error for loading checkpoints sometimes Beginners	2	1594	September 6, 2020

Step 1: Save the Model to a Directory

Step 2: Upload the Model to S3

Step 3: Download the Model from S3

Step 4: Load the Model for Inference

Summary

References

Related topics