Deploying Sentence Transformer as sagemaker endpoint
|
|
18
|
7956
|
March 26, 2024
|
Calling Sagemaker Endpoint for fine-tuned summarization model
|
|
15
|
5027
|
March 22, 2024
|
Deploy model with prompt-tuned adapter saved in S3
|
|
0
|
199
|
March 21, 2024
|
[SOLVED] Error of input when requesting batch-transform job of zero-shot-text-classification on SageMaker
|
|
1
|
253
|
March 20, 2024
|
ValueError: Could not load model /opt/ml/model with any of the following classes: (<class 'transformers.models.auto.modeling_auto.AutoModelForCausalLM'>, <class 'transformers.models.llama.modeling_llama.LlamaForCausalLM'>)
|
|
0
|
382
|
March 13, 2024
|
Is there a difference between Llama-2-7b-chat-hf and the Sagemaker version?
|
|
0
|
235
|
March 11, 2024
|
When deployed meta-llama/Llama-2-7b-chat-hf on sagemaker, it resulted in complete hallunciations
|
|
0
|
295
|
March 11, 2024
|
Pre-trained models that can handle text data, numerical, and categorical data
|
|
0
|
188
|
March 5, 2024
|
Multi-Model Endpoint with Hugging Face
|
|
6
|
2373
|
March 3, 2024
|
Volume Size Parameter in HuggingFace Model Class
|
|
1
|
682
|
February 29, 2024
|
Keep getting error '400' status code
|
|
0
|
363
|
February 29, 2024
|
Is it necessary to create model in model.tar.gz format for deployment over amazon sagemaker
|
|
1
|
1212
|
February 28, 2024
|
Sagemaker gpt-j train file error
|
|
27
|
2902
|
February 22, 2024
|
CUDA error when deploying model with custom inference
|
|
0
|
299
|
February 21, 2024
|
How to train KenLM no AWS Sagemaker?
|
|
3
|
1096
|
February 11, 2024
|
Distibuted Data Parallel in SageMaker
|
|
0
|
290
|
February 5, 2024
|
How to deploy Whisper for other languages to Sagemaker?
|
|
0
|
293
|
February 5, 2024
|
Cannot invoke sagemaker endpoint, keep getting OS error
|
|
3
|
2764
|
February 2, 2024
|
Multi-task instruction fine-tuning
|
|
1
|
1029
|
February 2, 2024
|
HF_TASK Enviournment Variable error
|
|
1
|
446
|
January 29, 2024
|
How to make an inference for HuggingFaceModel of type 'image-to-text'
|
|
0
|
485
|
January 27, 2024
|
HuggingFaceModel loading model data from us-east-2 (?)
|
|
4
|
670
|
January 27, 2024
|
Deploy distiluse-base-multilingual-cased-v2 on Sagemaker
|
|
1
|
479
|
January 25, 2024
|
Truncated un-finished response after deploying hugging-face models
|
|
0
|
369
|
January 19, 2024
|
Sentence similarity models on Sagemaker
|
|
6
|
2647
|
January 12, 2024
|
HuggingFace Model hyperparameter search with ray as backend not saving best trial hyperparameters
|
|
0
|
286
|
January 8, 2024
|
Deploying Fine-Tune Falcon 40B with QLoRA on Sagemaker Inference Error
|
|
29
|
6766
|
January 8, 2024
|
Creating Sagemaker Endpoint for 2 models (Segment Anything & YOLOv8) and Invoking it
|
|
0
|
403
|
January 6, 2024
|
Deploying custom inference script with llama2 finetuned model
|
|
6
|
1222
|
January 4, 2024
|
"OS Errorr: No space left on device" when trying to load a trained model from S3
|
|
1
|
1285
|
December 28, 2023
|