MLOps Virtual Event | Building Machine Learning Platforms for the Full Lifecycle

MLOps Virtual Event: Building
Machine Learning Platforms
Matei Zaharia
Chief Technologist, Databricks
@matei_zaharia

Even After Deploying, Operating ML is Complex!
 Monitoring performance of the model
 Data drift
 Governance and security
Many ML teams spend >50% of their time maintaining existing
models

Why is ML Hard to Operationalize?
 Dependence on data
 Multiple, application-specific
ways to evaluate performance
 Many teams and systems
involved
Data Prep
Training
Deployment
Raw Data
ML
ENGINEER
APPLICATIO
N
DEVELOPER
DATA
ENGINEE
R

Response: ML Platforms
 Software platforms to manage ML applications, from development to
production
 Most companies that use ML at scale are building one
 Tech examples: Facebook FBLearner, Google TFX, Uber
Michelangelo

Common Components in an ML Platform
 Data management, in development and at scoring time
▪ Data transformation, quality monitoring, data versioning
▪ Feature stores
 Model management
▪ Packaging, review, quality assurance, versioning
 Code and deployment management
▪ Reproducibility, deployment, monitoring, experimentation
ModelDB

Our Approach at Databricks
 Every team’s requirements will be different, and will change over
time
 Provide a general platform that is easy to integrate with diverse
tools
Open source machine
learning platform
Transactional, versioned
data lake storage
Data science & ML workspace

In This Webinar
 How we and other organizations handle the different components of
a machine learning platform
 Demos and experience from 4 different companies

End-to-End Data Science
and Machine Learning
on Databricks
Clemens Mewald
Director of Product Management, Databricks

End-to-End Data Science and ML on
AutoML
End-to-End ML Lifecycle
ML Runtime and
Environments
Batch Scoring
Online
Serving
Data Science Workspace
Prep Data Build Model Deploy/Monitor Model
Open,pluggable
architecture

End-to-End Data Science and ML on
AutoML
Batch Scoring
Online
Serving
Open,pluggable
architecture
ML Runtime and
Environments

Projects
Packaging format
for reproducible runs
on any compute
platform
Components

Models
General model
format
that standardizes
deployment options
Projects
Packaging format
on any compute
platform
Components

Tracking
Record and query
experiments: code,
metrics, parameters,
artifacts, models
Models
General model
format
that standardizes
deployment options
Projects
Packaging format
on any compute
platform
Components

Tracking
Record and query
experiments: code,
metrics, parameters,
artifacts, models
Models
General model
format
that standardizes
deployment options
Model Registry
Centralized and
collaborative
model lifecycle
management
Projects
Packaging format
on any compute
platform
Components

Model Lifecycle
Models
Flavor
2
Flavor 1
Custom
Models

Model Lifecycle
Models Tracking
Flavor
2
Flavor 1
Custom
Models
Parameter
s
Metrics Artifacts
ModelsMetadata

Model Lifecycle
Staging Production Archived
Data Scientists Deployment Engineers
v1
v2
v3
Models Tracking
Flavor
2
Flavor 1
Model Registry
Custom
Models
Parameter
s
Metrics Artifacts
ModelsMetadata

Model Lifecycle
v1
v2
v3
Models Tracking
Flavor
2
Flavor 1
Model Registry
Custom
Models
In-Line Code
Containers
Batch & Stream
Scoring
Cloud Inference
Services
OSS Serving
Solutions
Serving
Parameter
s
Metrics Artifacts
ModelsMetadata

Parameters and (a time series of) metrics Artifacts (including model)
Auto-logging for ML Frameworks: A single line of code logs parameters, metrics, and artifacts.
mlflow.keras.autolog() # or: mlflow.tensorflow.autolog()
Auto-Logging

Enterprise Ready
Enterprise grade access
controls, identity pass-through,
and auditability
Collaborative
Realtime co-editing
and commenting
Reproducible
Auto-logged revision history
and Git integration for
version control
Visualizations
Built-in visualizations and
support for the most popular
visualization libraries
(e.g. matplotlib, ggplot)
Experiment Tracking
Built-in tracking of Data
Science and ML experiments,
with metrics, parameters,
artifacts, and more
Multi-Language
Scala, SQL, Python, R:
All in one notebook
Databricks Notebooks
Provide a collaborative environment for Unified Data Analytics

Databricks Notebooks for Collaborative Data
Science
Data Engineers, Data Scientists, ML Engineers, and Data Analysts can all collaborate in one shared environment
using modern collaboration patterns.
Co-Presence / Co-Editing CommentingVersioning

Integration with Databricks Notebooks
● Runs Sidebar integrated with MLflow
Tracking
● Track runs, sort by metrics and parameters
● Linked to revision history of the notebook

Your Existing Data Lake
Ingestion
Tables
Data
Catalog
Feature
Store
Azure Data
Lake Storage
Amazon S3
Streaming
Batch
3rd Party Data
Marketplace
Files
for Data Science and ML
● Schema enforced high
quality data
● Optimized
performance
● Full data lineage /
governance
● Reproducibility
through time travel
ML Runtime

Ingest data and visualize data distribution

Data versioning and time travel

Integration with Delta
Auto-Logging for any Spark Datasource

Packages and optimizes most common ML
Frameworks
...
Machine Learning Runtime

Frameworks
...
Built-in Optimization for Distributed Deep
Learning
Distribute and Scale any Single-Machine
ML Code to 1,000’s of machines

Built-In AutoML and Experiment
Tracking
Frameworks
...
Learning
AutoML and Tracking /
Visualizations with MLflow

Machine
Learning
Pre-configured
Environment
Customizatio
n
requirements.txt
Built-In AutoML and Experiment
Tracking
conda.yaml
Frameworks
...
Learning
Customized Environments using
Conda
Conda-
BasedAutoML and Tracking /
Visualizations with MLflow

Integration with ML Runtime
Hyperopt autlogging to MLflow

Model Deployment
v1
v2
v3
Model Registry
In-Line Code
Containers
Batch & Stream
Scoring
Cloud Inference
Services
OSS Serving
Solutions
Serving

Model Deployment
model_udf =
mlflow.pyfunc.spark_udf(
spark,
model_uri='models:/forecast/production')
v1
v2
v3
Model Registry
In-Line Code
Containers
Batch & Stream
Scoring
Cloud Inference
Services
OSS Serving
Solutions
Serving

In summary, Databri cks accelerates the full ML Lifecycle
AutoML
Batch Scoring
Online
Serving
Open,pluggable
architecture
ML Runtime and
Environments

MLOps Virtual Event | Building Machine Learning Platforms for the Full Lifecycle

MLOps Virtual Event | Building Machine Learning Platforms for the Full Lifecycle

More Related Content

MLOps Virtual Event | Building Machine Learning Platforms for the Full Lifecycle