How To Deploy Machine Learning Model As Microservices

FastAPI is a popular Python framework for building high-performance microservices. This document discusses how to deploy a machine learning model as a microservice using FastAPI. It involves: 1. Training a logistic regression model on an iris dataset and saving it as a pickle file. 2. Creating a FastAPI app and defining an IrisSpecies data model for the API endpoint. 3. Implementing a predict endpoint that loads the model, makes predictions, and returns results. 4. Running the app and viewing interactive documentation on localhost to test the API.

Uploaded by

prudvi

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

124 views

How To Deploy Machine Learning Model As Microservices

Uploaded by

prudvi

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

How to deploy Machine Learning models as a

Microservice using FastAPI

towardsdatascience.com/how-to-deploy-machine-learning-models-as-a-microservice-using-fastapi-b3a6002768af

Microservice imlementation using FastAPI | Ashutosh Tripathi | Data Science Duniya

As of today, FastAPI is the most popular web framework for building microservices with
python 3.6+ versions. By deploying machine learning models as microservice-based
architecture, we make code components re-usable, highly maintained, ease of testing, and of-
course the quick response time. FastAPI is built over ASGI (Asynchronous Server Gateway
Interface) instead of flask’s WSGI (Web Server Gateway Interface). This is the reason it is
faster as compared to flask-based APIs.

It has a data validation system that can detect any invalid data type at the runtime and
returns the reason for bad inputs to the user in the JSON format only which frees developers
from managing this exception explicitly.

In this post, the objective is to explain the machine learning model deployment as
microservices with the help of FastAPI. So we will focus on that part, not on the model
training.

complete source code is also available in github repository. You will get the repository link at
the end of the post.

Step 1. Make your model for which you want to create the API ready
To create API for prediction we need the model ready so I have written few lines of code that
train the model and save it as LRClassifier.pkl file in the local disk. I have not focused on
exploratory data analysis, pre-processing or feature engineering part as that is out of the
scope for this article.

import pandas as pdfrom sklearn.model_selection import train_test_splitfrom

sklearn.linear_model import LogisticRegressionimport pickle# Load dataseturl =
""names = ['sepal-length', 'sepal-width', 'petal-length', 'petal-width',
'class']dataset =
pd.read_csv(filepath_or_buffer=url,header=None,sep=',',names=names)# Split-out
validation datasetarray = dataset.valuesX = array[:,0:4]y = array[:,4]X_train,
X_test, y_train, y_test = train_test_split(X, y, test_size=0.20, random_state=1,
shuffle=True)classifier = LogisticRegression()classifier.fit(X_train,y_train)save the
model to diskpickle.dump(classifier, open('LRClassifier.pkl', 'wb'))load the model
from diskloaded_model = pickle.load(open('LRClassifier.pkl', 'rb'))result =
loaded_model.score(X_test, y_test)print(result)

Jupyter snippet of the above code:

1/7
Logistic Regression python code snippet

Step 2. Create API using FastAPI framework

Start from scratch so that you don’t get any error:

Open VS code or any other editor of your choice. I use VS code

Using file meny open the directory where you want to work
open the terminal and create the virtual environment as below:
python -m venv venv-name
Activate venv using venv-name\Scripts\activate

Install Libraries:

pip install pandas

pip install numpy
pip install sklearn
pip install pickle
pip install FastAPI

Import libraries as shown in below code.

create a FastAPI "instance" and assign it to app

Here the app variable will be an "instance" of the class FastAPI .
This will be the main point of interaction to create all your API.
This app is the same one referred by uvicorn in the command as below:

2/7
Here main is the name of file where you are writing the code. you can give any name
but same you have to use while executing in the command in place of main.
When you need to send data from a client (let’s say, a browser) to your API, you send it
as a .
A body is data sent by the client to your API. A body is the data your API sends to the
client.
Your API almost always has to send a body. But clients don’t necessarily need to send
bodies all the time.
To declare a body, you use models with all their power and benefits.
Then you declare your data model as a class that inherits from BaseModel .
Use standard Python types for all the attributes.
In our case we want to predict the Iris Species so will create a data model as class with
four parameters which are the dimensions of the species.
Now create an end point also known as route named “predict”
Add a parameter of type data model we created which is “IrisSpecies”.
Now we can post data as json and it will be accepted in iris variable.
Next, we will load the already saved model in a variable loaded_model.
Now perform the prediction the same way we do in machine learning and return the
results.
now you can run the app and see the beautiful User Interface (UI) created by FastAPI
which uses Swagger now known as openAPI as backend for designing the
documentation and UI.
Full code is given below you can simply copy and paste and it will work if you have
followed the above steps properly.

from fastapi import FastAPIfrom pydantic import BaseModelimport pickleimport numpy as

npimport pandas as pdapp = FastAPI()class IrisSpecies(BaseModel):sepal_length:
floatsepal_width: floatpetal_length: floatpetal_width:
float@app.post('/predict')async def predict_species(iris: IrisSpecies):data =
iris.dict()loaded_model = pickle.load(open('LRClassifier.pkl', 'rb'))data_in =
[[data['sepal_length'], data['sepal_width'], data['petal_length'],
data['petal_width']]]prediction = loaded_model.predict(data_in)probability =
loaded_model.predict_proba(data_in).max()return {'prediction':
prediction[0],'probability': probability}

VS-Code snippet of the API creation:

3/7
Executing the APP:

Now if you can see the nice UI created by typing the url: 127.0.0.0:8000/docs

Below you see the API end point is created as POST request.

4/7
Click on the end point and it will expand as below.

Now click on Try it out and paste the dimensions to get the prediction.

I pasted some dummy dimensions and clicked on execute.

5/7
Now you see that it has predicted it as Iris-setosa with 99% accuracy.

You can directly call this api from anywhere as below:

import requestsnew_measurement = {"sepal_length": 1.2,"sepal_width":

2.3,"petal_length": 1.4,"petal_width": 2.8}response = requests.post('',
json=new_measurement)print(response.content)>>> b'{"prediction":"Iris-
setosa","probability":0.99}'

So this was all about the API creation using the FastAPI.

FastAPI also provides nice documentation which gets created automatically. just type in the
browser 127.0.0.0:8000/redoc

6/7
That’s it for this article. Hope you enjoyed reading. Share your thoughts about your
experience with FastAPI. Also, you can ask if you get any questions during implementation
using the comments.

7/7

Paypal Accounts
86% (7)
Paypal Accounts
8 pages
SAP DataSphere Tutorial
100% (1)
SAP DataSphere Tutorial
38 pages
Iso 8589
No ratings yet
Iso 8589
24 pages
Free Download Practical Statistics For Data Scientists PDF
No ratings yet
Free Download Practical Statistics For Data Scientists PDF
4 pages
How Does A Bike-Share Navigate Speedy Success - Google Capstone Project
100% (2)
How Does A Bike-Share Navigate Speedy Success - Google Capstone Project
13 pages
Kubernetes For MLOps Engineers
No ratings yet
Kubernetes For MLOps Engineers
7 pages
Mlops 101
No ratings yet
Mlops 101
33 pages
GenAI Interview Questions-1
No ratings yet
GenAI Interview Questions-1
9 pages
Bedrock Doc 1
No ratings yet
Bedrock Doc 1
4 pages
TensorFlow Cheatsheet Zero To Mastery V1.01
No ratings yet
TensorFlow Cheatsheet Zero To Mastery V1.01
26 pages
1. Application Of Large Language
No ratings yet
1. Application Of Large Language
75 pages
Aisha A Custom AI Library Chatbot Using The ChatGPT API
No ratings yet
Aisha A Custom AI Library Chatbot Using The ChatGPT API
23 pages
Generative AI - 48 Hours TOC
No ratings yet
Generative AI - 48 Hours TOC
4 pages
RAG and LangChain Loading Documents Round1
No ratings yet
RAG and LangChain Loading Documents Round1
8 pages
Generative AI Notes
No ratings yet
Generative AI Notes
1 page
Aws Mlops Framework
No ratings yet
Aws Mlops Framework
43 pages
Mastering Chunking in RAG - Techniques and Strategies
No ratings yet
Mastering Chunking in RAG - Techniques and Strategies
12 pages
Generative AI
No ratings yet
Generative AI
2 pages
Types of RAG: @bhavishya Pandit
No ratings yet
Types of RAG: @bhavishya Pandit
15 pages
Crud Rag
No ratings yet
Crud Rag
31 pages
Hugging Face Case Study 112023
No ratings yet
Hugging Face Case Study 112023
2 pages
10 Evani Generative AI Champion
No ratings yet
10 Evani Generative AI Champion
39 pages
64 Natural Language Processing Interview Questions and Answers-18 Juli 2019
No ratings yet
64 Natural Language Processing Interview Questions and Answers-18 Juli 2019
30 pages
Machine Learning GenAI Roadma
No ratings yet
Machine Learning GenAI Roadma
36 pages
Llama3, LangGraph and Elasticsearch - Build A Local Agent For Vector Search - Search Labs
100% (1)
Llama3, LangGraph and Elasticsearch - Build A Local Agent For Vector Search - Search Labs
48 pages
Getting Started With MLOPs 21 Page Tutorial
No ratings yet
Getting Started With MLOPs 21 Page Tutorial
21 pages
LangChain_Academy_-_Introduction_to_LangGraph_-_Motivation
No ratings yet
LangChain_Academy_-_Introduction_to_LangGraph_-_Motivation
17 pages
Introduction To Parallel Computing
100% (1)
Introduction To Parallel Computing
34 pages
Edureka Python Ebook
No ratings yet
Edureka Python Ebook
21 pages
MLOps Syllabus and Weekly Schedule (June 2021) PDF
No ratings yet
MLOps Syllabus and Weekly Schedule (June 2021) PDF
5 pages
Generative AI LLM Tutorial
No ratings yet
Generative AI LLM Tutorial
25 pages
Semantic Kernel
No ratings yet
Semantic Kernel
471 pages
Little Guide To Building Large Language Models in 2024
100% (1)
Little Guide To Building Large Language Models in 2024
65 pages
Deep Learning With PyTorch: Object Classification - Filliat Et Al
No ratings yet
Deep Learning With PyTorch: Object Classification - Filliat Et Al
3 pages
PyTorch For Machine Learning
No ratings yet
PyTorch For Machine Learning
5 pages
Lab7 LLM Chains
No ratings yet
Lab7 LLM Chains
7 pages
Build An MLOps Project in 6 Steps
No ratings yet
Build An MLOps Project in 6 Steps
8 pages
Gen Ai Solutions
No ratings yet
Gen Ai Solutions
14 pages
Machine Learning + Devops Using Azure ML Services
No ratings yet
Machine Learning + Devops Using Azure ML Services
17 pages
Guide To Evaluating LLM and RAG Systems
No ratings yet
Guide To Evaluating LLM and RAG Systems
41 pages
Robot Process Automation RPA and Its Future
No ratings yet
Robot Process Automation RPA and Its Future
25 pages
Generative AI Database
No ratings yet
Generative AI Database
14 pages
MLops Concept
No ratings yet
MLops Concept
20 pages
Chapter 2. Pair Programming
No ratings yet
Chapter 2. Pair Programming
15 pages
Onnx Machine Learning in Production - Blog
No ratings yet
Onnx Machine Learning in Production - Blog
4 pages
Natural Language Processing
100% (1)
Natural Language Processing
12 pages
Large Language Model (LLM) 1
100% (1)
Large Language Model (LLM) 1
17 pages
Elevating Customer Satisfaction With LLM-Powered Chatbots
No ratings yet
Elevating Customer Satisfaction With LLM-Powered Chatbots
18 pages
Google Cloud Security Engineer Exam Prep Sheet
No ratings yet
Google Cloud Security Engineer Exam Prep Sheet
9 pages
RAG and LangChain
No ratings yet
RAG and LangChain
14 pages
MLOps Buyers Guide by Seldon
No ratings yet
MLOps Buyers Guide by Seldon
11 pages
Best Practices For Prompt Engineering With The OpenAI
No ratings yet
Best Practices For Prompt Engineering With The OpenAI
6 pages
Brief Introduction To GenAI
No ratings yet
Brief Introduction To GenAI
1 page
Data Science ML Full Stack 2022 GitHub
No ratings yet
Data Science ML Full Stack 2022 GitHub
9 pages
Rag 1708257109
No ratings yet
Rag 1708257109
5 pages
Agents in LangChain
100% (1)
Agents in LangChain
11 pages
MasterClass Agentic AI & RAG Flyer-1
No ratings yet
MasterClass Agentic AI & RAG Flyer-1
4 pages
Advances in Quantum Machine Learning
No ratings yet
Advances in Quantum Machine Learning
38 pages
Building GenAI Products and Business Outline Web
No ratings yet
Building GenAI Products and Business Outline Web
8 pages
Modified Generative AI and LLMs in Practice
No ratings yet
Modified Generative AI and LLMs in Practice
6 pages
Shreyash's Resume
No ratings yet
Shreyash's Resume
1 page
How To Use LeetCode For Data Science SQL Interviews - StrataScratch
No ratings yet
How To Use LeetCode For Data Science SQL Interviews - StrataScratch
1 page
MLOps
No ratings yet
MLOps
9 pages
The Datadog Handbook: A Guide to Monitoring, Metrics, and Tracing
From Everand
The Datadog Handbook: A Guide to Monitoring, Metrics, and Tracing
Robert Johnson
No ratings yet
Hybrid Neural Networks: Fundamentals and Applications for Interacting Biological Neural Networks with Artificial Neuronal Models
From Everand
Hybrid Neural Networks: Fundamentals and Applications for Interacting Biological Neural Networks with Artificial Neuronal Models
Fouad Sabry
No ratings yet
Career Track Brochure - Data Science
No ratings yet
Career Track Brochure - Data Science
39 pages
CheatSheet Python 6 - Coding Interview Questions
No ratings yet
CheatSheet Python 6 - Coding Interview Questions
1 page
Programmable Search Engine
No ratings yet
Programmable Search Engine
2 pages
Ansa Course Content New
No ratings yet
Ansa Course Content New
15 pages
Alcoa Bearing Bracket Specs
No ratings yet
Alcoa Bearing Bracket Specs
3 pages
Rizwan Khan's Answer To How Should I Study For The GATE in Aerospace Engineering - Quora
No ratings yet
Rizwan Khan's Answer To How Should I Study For The GATE in Aerospace Engineering - Quora
3 pages
DCI Stencils
100% (3)
DCI Stencils
2 pages
F HG I KJ: 8.12 Shear Stress
No ratings yet
F HG I KJ: 8.12 Shear Stress
3 pages
Shear Forces and Bending Moments in Beams: (Examples)
No ratings yet
Shear Forces and Bending Moments in Beams: (Examples)
13 pages
05_PowerScale+Upgrades-SSP+-+Participant+Guide
No ratings yet
05_PowerScale+Upgrades-SSP+-+Participant+Guide
27 pages
Konar 2011
No ratings yet
Konar 2011
10 pages
Activating X Entry XDOS OPEN SHELL 9.2020
No ratings yet
Activating X Entry XDOS OPEN SHELL 9.2020
12 pages
Team 3 ME 558 Final Report
No ratings yet
Team 3 ME 558 Final Report
9 pages
300-4L User Guide PDF
No ratings yet
300-4L User Guide PDF
14 pages
Medicine Reminder: Budget Manager
No ratings yet
Medicine Reminder: Budget Manager
3 pages
Full Download Computational Thinking (MIT Press Essential Knowledge series) Peter J. Denning PDF DOCX
100% (2)
Full Download Computational Thinking (MIT Press Essential Knowledge series) Peter J. Denning PDF DOCX
55 pages
Final Anniversary 2020 2021 Evaluation For Anush Jain
No ratings yet
Final Anniversary 2020 2021 Evaluation For Anush Jain
21 pages
N - Mme005Ma1 - B#Tsel#Taliburatselma Sec 3
No ratings yet
N - Mme005Ma1 - B#Tsel#Taliburatselma Sec 3
204 pages
Group Id: 09 Daily Expense Tracker
100% (3)
Group Id: 09 Daily Expense Tracker
16 pages
WEB Tharindu
No ratings yet
WEB Tharindu
136 pages
Unit 4
No ratings yet
Unit 4
16 pages
Shubham Kumar
No ratings yet
Shubham Kumar
1 page
MK 208
No ratings yet
MK 208
64 pages
IoT Unit 5
No ratings yet
IoT Unit 5
2 pages
CSE422 Lab Assignment 03 (Alpha beta pruning)
No ratings yet
CSE422 Lab Assignment 03 (Alpha beta pruning)
5 pages
How Do I Go Online?: Simotion & Sinamics
No ratings yet
How Do I Go Online?: Simotion & Sinamics
48 pages
Resume Yash Acadamic
No ratings yet
Resume Yash Acadamic
1 page
ONLINE COMMUNITIES
No ratings yet
ONLINE COMMUNITIES
12 pages
2024 - Personalized Equalization of Sound Pressure at Eardrum With Insert Earbuds
No ratings yet
2024 - Personalized Equalization of Sound Pressure at Eardrum With Insert Earbuds
8 pages
Bangalore - Chennai
No ratings yet
Bangalore - Chennai
17 pages
Copia de Haptics - The Science of Touch in Periodontics 2
No ratings yet
Copia de Haptics - The Science of Touch in Periodontics 2
5 pages
Believer Song
No ratings yet
Believer Song
11 pages
Downloaded From Manuals Search Engine
No ratings yet
Downloaded From Manuals Search Engine
31 pages
3.2 - WindSCADA - System - Generic - XXHZ - Network - Connectivity - Requirements - EN - Doc-0000822 - r05
No ratings yet
3.2 - WindSCADA - System - Generic - XXHZ - Network - Connectivity - Requirements - EN - Doc-0000822 - r05
16 pages
Narrative Report
No ratings yet
Narrative Report
6 pages