-
Notifications
You must be signed in to change notification settings - Fork 84
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Integrate with SageMaker #66
Comments
Afaict, it shouldn't be very hard to support SageMaker, it would just require going one level deeper in the Truss design. Truss creates a standardized model representation that can be operated on with a This could be a great contribution to Truss. I'd love for someone to pick this up. Otherwise, I can take a stab at it if/when there's enough demand for it. |
Hey @Sam152! We have come up with a new spec for pushing directly to Sagemaker and are hoping to roll in this issue as part of that work (we're already made the server and docker image compatible with Sagemaker, next step is just coordinating the pushing to Sagemaker server by pushing to ECR then creating a sagemaker servince). If you're interested in contributing, please reach out. We'll try to keep this post updated |
hi @bolasim, I've been trying to assess whether it's possible to configure Truss to use model weights hosted on a S3 bucket (manually pushed) and then use these instructions to deploy to Sagemaker ( |
Really like the look of this project. I saw the AWS integration guide, but I was wondering what it'd be like to integrate with SageMaker (https://aws.amazon.com/sagemaker/).
I suspect there might be quite a bit of additional work in creating training and serving containers in the format that SageMaker can consume: https://docs.aws.amazon.com/sagemaker/latest/dg/your-algorithms-inference-code.html
In addition to that, I believe the officially supported containers are framework specific and have specific drivers that connect the frameworks to the available GPUs, so I don't know how easy it'd be to even maintain a single set of containers that worked for all frameworks.
The text was updated successfully, but these errors were encountered: