In this case, we used docker container technologies to create ML platform from scratch.
It is consists of four different docker containers (mlflow, notebook, postgres, tensorboard) that are already built in docker-compose.yml
The details of containers could be found under ./platform directory.
Each container service has a specific dockerfile corresponding to the directories (mlflow, notebook, postgres, tensorboard)
under platform directory
Please, note that you need to install compatible docker and docker-compose version before using the service.
You could see the version of docker and docker-compose, below.
 Client:
 Version: 18.06.1-ce
 API version: 1.38
 Go version: go1.10.3
 Git commit: e68fc7a
 Built: Tue Aug 21 17:24:56 2018
 OS/Arch: linux/amd64
 Experimental: false
 
 Server:
 Engine:
 Version: 18.06.1-ce
 API version: 1.38 (minimum version 1.12)
 Go version: go1.10.3
 Git commit: e68fc7a
 Built: Tue Aug 21 17:23:21 2018
 OS/Arch: linux/amd64
 Experimental: false
docker-compose version 1.22.0, build f46880fe
docker-py version: 3.4.1
CPython version: 3.6.6
OpenSSL version: OpenSSL 1.1.0f 25 May 2017
- 
Please, note that download the dataset (fashion-mnist_train.csv, fashion-mnist_test.csv) via the link below before staring use this project. 
- 
Add the file under datadirectory.
- 
You will find makefileto kick off the platform. It has three different commands to build, start, and stop platform.
- 
To build platform make build
- 
To start platform make start
- 
To stop platform make stop
- 
To visit Notebook type the following address on your favorite browser: (You may find the token by this command: docker logs jupyter_notebook)http://localhost:8888/?token=<YOUR_TOKEN>
- 
To visit TensorBoard type the following address on your favorite browser: http://localhost:6006/
- 
To visit Mlflow type the following address on your favorite browser: http://localhost:5000/
We built CNN network to train dataset with four different metrics (accuracy, precision, recall, f1) We are logging those metrics mlflow and tensorboard sides. In addition, we plotted confusion matrix on tensorboard side rather than notebook.
You may find the model we built in method def __create_model in fmnist/fmnistexperiment.py.
input_shape = (self.__params.img_height, self.__params.img_width, 1) self.__model = Sequential() self.__model.add(Conv2D(32, kernel_size=(3, 3), activation='relu', kernel_initializer='he_normal', input_shape=input_shape)) self.__model.add(BatchNormalization()) self.__model.add(MaxPooling2D((2, 2))) self.__model.add(Dropout(0.25)) self.__model.add(BatchNormalization()) self.__model.add(Conv2D(64, (3, 3), activation='relu')) self.__model.add(MaxPooling2D(pool_size=(2, 2))) self.__model.add(Dropout(0.25)) self.__model.add(BatchNormalization()) self.__model.add(Conv2D(128, (3, 3), activation='relu')) self.__model.add(Dropout(0.4)) self.__model.add(Flatten()) self.__model.add(BatchNormalization()) self.__model.add(Dense(128, activation='relu')) self.__model.add(Dropout(0.3)) self.__model.add(Dense(self.__params.number_classes, activation='softmax')) self.__model.compile(loss=keras.losses.categorical_crossentropy, optimizer=keras.optimizers.Adam(), metrics=self.__params.metrics) self.__model.summary()
This service handles client side request via internet browser. We created a module (fmnist) for training and testing.
Whenever we changed any codes in the module, Notebook are loading that module automatically macros in the first cell on the notebook.
This service is being used to keep track confusion matrix and metrics' history of mode with respect to epoch. You may see the screenshots for TensorBoard, below.
This service is being used to keep track and manage model parameters, metrics, model artifacts. Postgres database is a database that is storing MLflow logs related to parameters, metrics. Model artifacts is stored local disk. To manage flexibly storage, another options (S3, NFS etc.) could be used rather than local disk. However, we preferred local disk in this implementation.