Course Overview
This course builds upon and extends the DevOps methodology prevalent in software development to build, train, and deploy machine learning (ML) models. The course is based on the four-level MLOPs maturity framework. The course focuses on the first three levels, including the initial, repeatable, and reliable levels. The course stresses the importance of data, model, and code to successful ML deployments. It demonstrates the use of tools, automation, processes, and teamwork in addressing the challenges associated with handoffs between data engineers, data scientists, software developers, and operations. The course also discusses the use of tools and processes to monitor and take action when the model prediction in production drifts from agreed-upon key performance indicators.
Who should attend
This course is intended for:
- MLOps engineers who want to productionize and monitor ML models in the AWS cloud
- DevOps engineers who will be responsible for successfully deploying and maintaining ML models in production
Prerequisites
We recommend that attendees of this course have
- AWS Technical Essentials course (classroom or digital)
- DevOps Engineering on AWS (AWSDEVOPS), or equivalent experience
- Practical Data Science with Amazon SageMaker (PDSASM), or equivalent experience
[/list]
Course Objectives
In this course, you will learn to:
- Explain the benefits of MLOps
- Compare and contrast DevOps and MLOps
- Evaluate the security and governance requirements for an ML use case and describe possible solutions and mitigation strategies
- Set up experimentation environments for MLOps with Amazon SageMaker
- Explain best practices for versioning and maintaining the integrity of ML model assets (data, model, and code)
- Describe three options for creating a full CI/CD pipeline in an ML context
- Recall best practices for implementing automated packaging, testing and deployment. (Data/model/code)
- Demonstrate how to monitor ML based solutions
- Demonstrate how to automate an ML solution that tests, packages, and deploys a model in an automated fashion; detects performance degradation; and re-trains the model on top of newly acquired data