I spoke to AWS support and they told me that I should set the RAM of the container runtime to 3 GB. With 3GB assigned I the endpoint returns:
{"ErrorCode":"INTERNAL_FAILURE_FROM_MODEL","LogStreamArn":null,"Message":"Received server error (0) from model with message \"Amazon SageMaker could not get a response from the gottbert-job-class-endpoint endpoint.\". .........
.
I checked the CloudWatchLogs and there I see:
python: can't open file '/usr/local/bin/deep_learning_container.py': [Errno 13] Permission denied
It seems that the issue is comparable to this discussion… ?