In this episode, Global Black Belt and Technical Architect in Big Data and Advanced Analytics Team at Microsoft, Alex Zeltov, is our guest and he explains the in’s and out’s of MLOps though various tools like mlflow and kubeflow
In this first episode, Alex talks on a more theoretical level about MLOps and the benefits it can deliver.
For more from Alex on MLOps and mlflow, check out his presentation at the Washington DC DataWorks Summit a couple, of weeks ago. The slides are now available on SlideShare and the video is available on YouTube:
Just like DataOps follows on to DevOps, one may say that MLOps continues after DataOps. While there is a wikipedia page on the subject, there is not that much “prior art” available just yet.
The main advantages that MLOps can deliver, according to Alex, are a much improved move to production of trained algorithmes, even allowing for CI/CD, and a more structured approach to training models where multiple data scientists can work together to achieve better results.
One of the main tools emerging at the moment is the DataBricks backed mlflow project. Though not an Apache project, it has been open sourced under the Apache License now and shows much promise.
In the episode, Alex explains how mlflow integrates with your data science notebooks to allow for reliable model management with minimal disruption.
The second contender to reach for the MLOps crown is Kubeflow.
Even though less mature than mlflow, it is backed by the very popular Kubernetes framework and that brings a large community together working on this project.