DP-100: Designing and Implementing a Data Science Solution



Kursarrangør: Glasspaper AS
Sted: Oslo, Helsfyr
Oslo
Kursadresse: Brynsveien 12, 0667 Oslo (kart)
Type:Åpent kurs / gruppeundervisning
Undervisningstid: kl 09:00 - 16:00
Varighet: 4 dager
Pris: 28.500
Neste kurs: 17.06.2024 | Vis alle kursdatoer

DP-100: Designing and Implementing a Data Science Solution on Azure.This course teaches you to leverage your existing knowledge of Python and machine learning to manage data ingestion and preparation, model training and deployment, and machine learning solution monitoring in Microsoft Azure.

Learn how to operate machine learning solutions at cloud scale using Azure Machine Learning.

IMPORTANT NOTICE!
This course is focused on Azure and does not teach the student how to do data science. It is assumed students already know that.

Audience
This course is designed for data scientists with existing knowledge of Python and machine learning frameworks like Scikit-Learn, PyTorch, and Tensorflow, who want to build and operate machine learning solutions in the cloud.

Prerequisites
Before attending this course, students must have fundamental knowledge of cloud computing concepts, and experience in general data science and machine learning tools and techniques.

Specifically:
• Creating cloud resources in Microsoft Azure
• Using Python to explore and visualize data
• Training and validating machine learning models using common frameworks like Scikit-Learn, PyTorch, and TensorFlow
• Working with containers
If you are completely new to data science and machine learning, please complete Microsoft Azure AI Fundamentals first.

Course content
Module 1: Getting Started with Azure Machine Learning

In this module, you will learn how to provision an Azure Machine Learning workspace and use it to manage machine learning assets such as data, compute, model training code, logged metrics, and trained models. You will learn how to use the web-based Azure Machine Learning studio interface as well as the Azure Machine Learning SDK and developer tools like Visual Studio Code and Jupyter Notebooks to work with the assets in your workspace.
Lessons:
• Introduction to Azure Machine Learning
• Working with Azure Machine Learning

Module 2: No-Code Machine Learning
This module introduces the Automated Machine Learning and Designer visual tools, which you can use to train, evaluate, and deploy machine learning models without writing any code.
Lessons:
• Automated Machine Learning
• Azure Machine Learning Designer

Module 3: Running Experiments and Training Models
In this module, you will get started with experiments that encapsulate data processing and model training code, and use them to train machine learning models.
Lessons:
• Introduction to Experiments
• Training and Registering Models

Module 4: Working with Data
Data is a fundamental element in any machine learning workload, so in this module, you will learn how to create and manage datastores and datasets in an Azure Machine Learning workspace, and how to use them in model training experiments.
Lessons:
• Working with Datastores
• Working with Datasets

Module 5: Working with Compute
One of the key benefits of the cloud is the ability to leverage compute resources on demand, and use them to scale machine learning processes to an extent that would be infeasible on your own hardware. In this module, you'll learn how to manage experiment environments that ensure consistent runtime consistency for experiments, and how to create and use compute targets for experiment runs.
Lessons:
• Working with Environments
• Working with Compute Targets

Module 6: Orchestrating Operations with Pipelines
Now that you understand the basics of running workloads as experiments that leverage data assets and compute resources, it's time to learn how to orchestrate these workloads as pipelines of connected steps. Pipelines are key to implementing an effective Machine Learning Operationalization (ML Ops) solution in Azure, so you'll explore how to define and run them in this module.
Lessons:
• Introduction to Pipelines

Module 7: Deploying and Consuming Models
Models are designed to help decision making through predictions, so they're only useful when deployed and available for an application to consume. In this module learn how to deploy models for real-time inferencing, and for batch inferencing.
Lessons:
• Real-time Inferencing
• Batch Inferencing
• Continuous Integration and Delivery

Module 8: Training Optimal Models
By this stage of the course, you've learned the end-to-end process for training, deploying, and consuming machine learning models; but how do you ensure your model produces the best predictive outputs for your data? In this module, you'll explore how you can use hyperparameter tuning and automated machine learning to take advantage of cloud-scale compute and find the best model for your data.
Lessons:
• Hyperparameter Tuning
• Automated Machine Learning

Module 9: Responsible Machine Learning
Data scientists have a duty to ensure they analyze data and train machine learning models responsibly; respecting individual privacy, mitigating bias, and ensuring transparency. This module explores some considerations and techniques for applying responsible machine learning principles.
Lessons:
• Differential Privacy
• Model Interpretability
• Fairness

Module 10: Monitoring Models
After a model has been deployed, it's important to understand how the model is being used in production, and to detect any degradation in its effectiveness due to data drift. This module describes techniques for monitoring models and their data.
Lessons:
• Monitoring Models with Application Insights
• Monitoring Data Drift

Certification
This course is recommended as preparation for exam DP-100, which leads to the new Azure Data Scientist Associate certification.