tchaton/sagemaker-pytorch-boilerplate

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 54 Commits
conf		conf
container		container
deployement		deployement
local_test		local_test
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
build_and_push.sh		build_and_push.sh
build_local_env.sh		build_local_env.sh
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
train		train
workflow.ipynb		workflow.ipynb

Repository files navigation

sagemaker-pytorch-boilerplate

Production ML, as a field, has matured. It’s increasingly common for companies to have at least one model in production. As more teams deploy models, the conversation around tooling has shifted from "What gets the job done?" to "What does it take to deploy a model at production scale?"

This project is a boilerplate codebase to train / serve / publish Pytorch Model using AWS Sagemaker.

We aim at simplifying MLOps worflow by providing a template for production ready development, allowing ML engineer to focus uniquely on their models and datasets.

We rely on Hydra for elegantly configuring our application and Pytorch Lightning, a lightweight PyTorch wrapper for ML researchers to scale their experiments with less boilerplate.

How to use this project

This project implements a 1-layer MLP on iris dataset as a baby demo.

sh build_local_env.sh 3.7.8 # It will create a local env to ease local dev

sh build_and_push.sh {IMAGE_NAME} {MODEL} {DATASET}
# It will build the folder container and push the image to AWS Elastic Container Registry (ECR)

Local development

## Training

Used to make quick dev.

source .venv/bin/activate
python src/train model={MODEL} dataset={DATASET}

or within docker image

Used to make sure the docker image is correcly working

sh local_test/train_local.sh ${IMAGE_NAME} ${ARGS_1} ${ARGS_2} ${ARGS_3} ...

## Local Serving

Terminal 1

In:
sh build_and_push.sh {IMAGE_NAME} {MODEL} {DATASET}.
cd local_test
sh serve_local.sh {IMAGE_NAME}

Out:
Starting the inference server with 4 workers.
[2020年08月19日 11:41:31 +0000] [9] [INFO] Starting gunicorn 20.0.4
[2020年08月19日 11:41:31 +0000] [9] [INFO] Listening at: unix:/tmp/gunicorn.sock (9)
[2020年08月19日 11:41:31 +0000] [9] [INFO] Using worker: gevent
[2020年08月19日 11:41:31 +0000] [13] [INFO] Booting worker with pid: 13
[2020年08月19日 11:41:31 +0000] [14] [INFO] Booting worker with pid: 14
[2020年08月19日 11:41:31 +0000] [15] [INFO] Booting worker with pid: 15
[2020年08月19日 11:41:31 +0000] [16] [INFO] Booting worker with pid: 16

Terminal 2

In:
cd local_test
sh predict.sh {SAMPLE_DATA} # Currently support only 'text/csv'

Train on AWS: Run workflow.ipynb notebook

jupyter lab

CAREFUL: Work in progress

About

No description, website, or topics provided.

Releases

No releases published

Packages

No packages published

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

License

Uh oh!

tchaton/sagemaker-pytorch-boilerplate

Folders and files

Latest commit

History

Repository files navigation

sagemaker-pytorch-boilerplate

How to use this project

Local development

CAREFUL: Work in progress

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages

Languages

License

tchaton/sagemaker-pytorch-boilerplate

Folders and files

Latest commit

History

Repository files navigation

sagemaker-pytorch-boilerplate

How to use this project

Local development

CAREFUL: Work in progress

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages